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ACTCTTTTGA 
ACTTTAAATG 
TTGTAAACCA 
AAGCTTGCTA 
CTACCATTAT 
ACTACTATTG 
ACGCCCGCAC 
CCAGCATTTC 
TTTCTATATT 
TCTGCAATTT 
TTTATTAATA 
ATGGCTGGTG 
TTGGTTATAC 
AAATCCTTCC 
TTTTTCATTT 
AACCGCAGGT 
AGATTTTCGA 
CGTAACCAAT 
GTTCTGTCAA 
CCCACTCTGC 
CAGGTAAAGC 
TGACCAAAAG 
CTTTTGTTAC 
TAAAAGCCTT 
AACGAAGCTT 
GGCATCAAAG 
CCCCGCAAAA 
GCTTTTTAAT 
ATTTACTTCT 



ATTCCTTTAC 
TCCCAGGAAG 
ACACTGCTGT 
CTACAGCTCC 
GAATCATTAC 
GAATATTGAA 
TATCTACTCC 
CTCTAATCGC 
CTCCTGCTCC 
TCAGAGCAGC 
AACCACCAGC 
AGGCGCCCAG 
AAATGACCAA 
TTTTCACTTT 
ACATTGAATG 
TGCTCAAAAC 
AGAGTCTTAC 
ATCAAAAAAA 
ATCTTCTCGA 
AAAGAAAGGA 
CAAATCTTTA 
ATGGGCATAG 
AGGGTGCAAA 
TTGGAAATAA 
GGATTCAAGT 
AAGTGATAAT 
AGTTCCTTAA 
AATTCTCCAA 
CCTCTCTTTA 



ATCTTCATCT 
ATGTCCATTT 
TTTTAATGAT 
TAATAAGCAT 
CACTTCTCCA 
CTTCTCATTT 
TTTAGATGCC 
TGCTAGTTTA 
ACAGGCTACA 
TTGGTATAAT 
ATACTTTAAC 
TGCTACTAAT 
TGGTGCTTTT 
ATACGATCTA 
ATTTATACTC 
ACTGTTTTGA 
CTCATCAAAT 
GCTAATTCTA 
TTTTTACCGA 
CCTCTGTGGA 
GAACCCCAGT 
TCTGATGAAG 
GTCACTTCTA 
CCATCTTCAT 
TTATCTGCGT 
CTTGAATCAG 
CAAAACGATG 
CAGTCAAACC 
CTTGTTAGTA 



441 
GTAATCTCGT 
GGACGTTTTT 
TTCAATTCTT 
CCTGTCCCAA 
TTAACAGCAA 
GCTGCTAGAG 
ACATCTATTC 
TAATTGTTGA 
GGATCTAAAA 
TTCCAATTTT 
AAATCCTCTA 
CCATTTGCTG 
TCTTTTAATA 
CTAATTTCGA 
AATGAAAATC 
GGTTGTGGAT 
TTGTAAATAT 
AAGCGACTGC 
CACGATTGAG 
GATTGATCCA 
CTAAATAGAG 
CCACCGCCGA 
GATAGTCATT 
CTGCTTCAAG 
GACTACGCAG 
ATAGTCCTTT 
ATTGATTGCA 
TGGCTGAAAT 
ATTAATAAAA 



ATAAGGCATT 
CTGCTATTTC 
GACCTTTTTC 
TGACTTTCGG 
TGGCATCCAC 
CAATTTCGTC 
CTACTAAAGA 
TTAGATCATC 
CTGCTGGGAC 
CATCTGTCAA 
AATCTGCTGG 
TGAAATTTTT 
ATTTTAAACT 
TTTATCTTTA 
AAAGAGCAAA 
AGAACTGACG 
CATGAGCCTT 
TTGATTCCAG 
TTCGTCAACC 
TTCCGAATGA 
ACCTTCTGCA 
ATACATTAGA 
CTCTGCTACT 
AAAGCCTAGT 
GCACCCAGCA 
AAGACCTTAT 
GCCTGCCAAT 
GCATAGTCTT 
CACCAAGAAA 



TATAAATTCA 
TCCAGCGATA 
TAGTCCGATA 
CATCATAGCA 
TTCACCTGTT 
AATATTATCT 
GGCAATCTCG 
TGCTACTTTT 
ATTATATTTC 
TGTTCCTATG 
AAACTCACTC 
TACTACATCA 
TGTCATATTG 
GTTGAGAATT 
CTAGGAGGCT 
TGGTTTGAAG 
CTCTAGACAT 
CGTTGCTGAA 
AGAAATTGAA 
ATATAGACTT 
ATGACCAGCA 
TCCTGAAAGG 
TTTAACTCTT 
TGCTTGGCAA 
TGGATAAGAA 
TCTCAATTGT 
CCTTCTGACT 
GTGTTTCCAT 
TATCAAGCAA 



17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
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AATCGTAATT CCACTTGATC CTTTTAAAGC ACATCGAGAG CATTTGCAGA GAGCTAACTA 19320 

AACAAGCCTA TCCAGTTTAT ATAAACAAAA AACTCCAATT ACAATCAAGA ATTAGAGTTG 19380 

ACTTACAAGA TTAGACCGTT CATTTCACCA TACGAAAAAA CTGTTCACAT TTCCCTTCGC 19440 

CAGTCTTAAC TGTATCAGGT TCAATGGGTA TTATCTCAGC CTAAAGCACC CCAAATGTCT 19500 

CTATTATTTA ACTACTGAAC CAGTATAGCA AAAAATGAAA GCCCTAGCAA GATATTTGAC 19560 

CGAAAAATAT CTTTATATAT AATATATTGA AACTAGAATA GTACACCTCT ACTTATAAAA 19620 

CATTGTTAGA AATCGATTTG ACTGTCCTGA TTGATTTGTC CTATTCTTAT TTCATTTTAC 19680 

TATAGTTTTC GATAGCAATT TATTCTTCCA ATACACGAAG AAAAACCTCC ACATTCAGTG 19740 

GAGGCAATCT GTTTTATCAA TACAATTTTA AGTCACGAGG GTCAACTGGG AAGGTTGGGT 19800 

TGTATGGATT GTGACGGAGC TTGAAGTGTT TGACATCTTC AATGGTCTGA GTTCCAGACA 19860 

ATTGCATAAC TGTCTTCAAT TCCGCATTCA AGTGTTCAAA GACTTGACGC ACACCGACAC 19920 

TACCACCGAG AGCCAAGCCA TAGATGACAG GGCGTCCAAT AGCAACCAAG TCTGCTCCTG 19980 

ATGCCAAGGC TTTAAAGACG TGTTGACCAC GACGAACACC AGAGTCAAAG ACAATCGGCA 20040 

CACGTCTATC AACTGCTTCT GCCACTTCTT GAAGCGAGTC AAAGGCAGCT GGTCCACCGT 20100 

CGATTTGACG ACCACCGTGG TTGGTTACCC AGATACCAGA AGCTCCTGCA GCAAGCGAAC 20160 

GTTCAACGTC CTCACGGCAT TGTGGTCCCT TGACATACAC AGGAAGACCA GAGTATTCAG 20220 

CGATAAATTC TACATCGCGT GGAGACAAGC GTTGTTTAGC TGATTTGTAA ACAAAGTCCA 20280 

TTGATTTACC AGCACCTTCT GGCAGGTATT CTTCAACAAT CGGCATGCCA ACTGGGAAGA 20340 

CAAAACCATT ACGCTTATCC ACTTCACGAT TCCCCCCTAC AGTAGCATCT GCCGTCAAGA 20400 

CAATCGCTTT ATAACCTTCA GCCTTCACAC GGTCCATGAT GTGGCGGTTG ATACCGTCAT 20460 

CCTTACTAAA GTAAAATTGA AACCAATGAG GTGTCCCTTG GAGGGCTTCA GAAATCTCTG 20520 

GAAGGTCAAC AGTAGAGTAA GAACTGGTTG TATAAAGAGA ACCAAACTCA TGCACACCAC 20580 

GCGCAGTCGC CACTTCCCCC TGTTCATTTG CCAATTTATG AGCCGCAACA GGTGCCATAA 20640 

TGATTGGAGA AGATAGTTTT TCACCTGCAA ATTCAATCTC TGTACTTGGA TTTTCTACAT 20700 

TGCAAAGTGT ATGAGGAACG ATGAGCTTGT GGTTAAAGGC ACGGATATTC TCTCTTAAAG 20760 

TGAAAGTATC TTCCGCCCCA CTAGCGATAT AGCCAAATGC TGCTTTAGGA ATAACTTGTT 20820 

GCGCCATTGG CTCCAAATCA TAGGTATTGA TGAArTCTAC ATGACCTTCT GCATTGCTTG 20880 

TTTTGTATGA CATAAAATGT CCTCCTTAAT AAGTAAGCGT TTACTTTGTG TATTACAAAA 20940 

ATATCTTAAC TCTTTTTCAA AACTTTTAAA ATATTTTGTT TGGAAATTTC AGAAATTTTA 21000 

TGTCTATGAT AAAAATCCTT ATAACGGCAA TAAAAAATAG ATATTATCCA AAGAAGATTT 21060 
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TAAGTGCTAC 


AATAACTGTA 


TTATTTCTAG 


ATGGGAGGTT 


CTATTTTTGG 


ATTGATCCAT 


21120 


TGTTGAACAA 


TATCTACCAC 


TATATCAAAA 


GGCATTCTTT 


CTGACCTTGC 


ATATTGCAGT 


21180 


TTGGGGAATT 


TTGGGATCCT 


TTCTGCTCGG 


TTTAATCGTT 


AGTATCATCC 


GACATTATCG 


21240 


AATCCTTGTT 


TTGGCGCAAG 


TAGCGACAGC 


CTACATTGAA 


TTGTCACGTA 


ATACGCCCCT 


21300 


TTTGATTCAA 


CTCTTCTTTC 


TCTACTTCGG 


TCTTCCCCGA 


ATCGGGATTG 


TCCTATCTTC 


21360 


AGAAGTCTGT 


GCAACGCTTG 


GGCTTGTCTT 


TTTAGGAGGC 


TCCTATATGG CAGAATCTTT 


21420 


CCGAAGTGGG 


CTGGAAGCCA 


TCAGTCAAAC 


CCAGCAGGAG 


ATTGGCCTCG 


CTATTGGTCT 


21480 


GACACCTCTA 


CAGGTCTTTT 


ACTATGTGGT 


TCTTCCGCAA 


GCAACAGCGG 


TGGCACTCCC 


21540 


CTCCTTTAGT 


GCCAATGTCA 


TTTTCCTTAT 


CAAGGAAACC 


TCTGTTTTCT 


CAGCAGTGGC 


21600 


TTTGGCCGAC 


CTCATGTACG 


TCGCCAAGGA 


TTTGATTGGT 


CTCTACTATG 


AGACAGACAT 


21660 


TGCGCTAGCT 


ATGTTGGTAG 


TTGCTTATCT 


AATCATGCTG 


CTACCCATCT 


CACTGGTCTT 


21720 


TAGCTGGATA 


GAAAGGAGGC 


TCCGCCATGC 


AGGATTCGGG 


AATCCAAGTA 


CTCTTTCAAG 


21780 


GAAATAATCT 


CCTGAGAATC 


TTACAGGGAT 


TGGGCGTTAC 


GATTGGGATA 


TCCATCCTGT 


21840 


CTGTCCTCTT 


ATCCATGATG 


TTCAGAACAG 


TCATGGGAAT 


CATCATGACC 


TCCCATTCTA 


21900 


GAATCATACG 


ATTTTTAACA 


CGATTGTATC 


TGGAATTTAT 


CCGTATCATG 


CCCCAGCTGG 


21960 


TGCTACTCTT 


CATCGTTTAC 


TTTGGCTTGG 


CTCGAAACTT 


TAATATCAAT 


ATCTCAGGTG 


22020 


AGACTTCAGC 


TATTATCGTT 


TTTACCCTCT 


GGGGAACAGC 


TGAAATGGGA 


GACTTGGTAC 


22080 


GTGGAGCTAT 


CACTTCTCTC 


CCTAAACATC 


AGTTTGAAAG 


TGGACAGGCA 


CTCGGCTTGA 


22140 


CTAATGTTCA 


ACTTTACTAC 


CACATCATCA 


TCCCACAAGT 


CTTAAGAAGA 


CTGCTACCGC 


22200 


AGGCTATCAA 


TCTTGTCACT 


CGGATGATTA 


AAACCACTTC 


ATTAGTTGTT 


TTGATTGGGG 


22260 


TTGTGGAAGT 


GACCAAAGTT 


GGACAACAAA 


TCATCGATAG 


CAATCGCCTG 


ACCATCCCAA 


22320 


CTGCTTCATT 


TTGGATTTAT 


GGAACCATTC 


TAATCTTATA 


TTTCGCAGTT 


TGCTACCCTA 


22380 


TTTCCAAACT 


ATCCACTCAC 


TTAGAAAAAC 


ATTGGAGAAA 


CTAAATGTCT 


GAAACTATCT 


22440 


TAGAAATCAA 


GGAACTAAAA 


AAATCCTTCG 


GAGACAATCC 


CATCCTCCAA 


GGACTTTCTC 


22500 


TAGAAATCAA AAAAGGGGAA GTTGTTGTCA 


TCCTAGGGCC 


ATCTGGTTGT 


GGGAAAAGTA 


22560 


CCCTCCTTCG 


TTGCCTCAAC 


GGCTTAGAAA 


GTATTCAAGG TGGAGATATT CTTCTGGATG 


22620 


GTCAGTCTAT 


CGTTGAAAAT 


AAAAAAGATT 


TTCACCTAGT 


TCGCCAAAAG 


ATTGGCATGG 


22680 


TCTTTCAAAG 


TTATGAACTC 


TTTCCCCATC 


TGGATGTCTT 


ACAAAACCTC 


ATCCTAGGCC 


22740 


CTATCAAAGC 


TCAAGGAAGG 


GACAAGAAAG 


AAGTAACGGA AGAAGCTTTG 


CAATTACTAG 


22800 
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AGCGTGTCGG TTTGCTGGAT AAACAACATA GCTTTGCCCG TCAATTATCT GGTGGACAGA 22860 

AGCAACGTGT TGCAATTGTC CGTGCCCTCC TAATGCATCC AGAAATCATC CTTTTTGACG 22920 

AGGTGACTGC TTCGCTGGAT CCAGAAATGG TGCGTGAGGT GCTGGAACTT ATCAATGATT 22980 

TGGCCCAAGA AGGCCGTACC ATGATTTTAG TAACCCACGA AATGCAGTTT GCCCAAGCCA 23040 

TTACTGACCG GATTATCTTC CTCGACCAAG GGAAAATCGC TGAAGAAGGA ACAGCTCAAG 23100 

CCTTCTTTAC CAATCCGCAA ACCAAACGAG CCCAGGAATT TTTAAACGTC TTTGACTTTA 23160 

GCCAATTCGG CTCATATCTA TAAAGGAGAT TCTTATGAAA CTATTCAAAC CACTCTTAAC 23220 

TGTTTTAGCA CTTGCCTTTG CCCTTATCTT TATCACTGCT TGTAGCTCAG GTGGAAACGC 23280 

TGGTTCATCC TCTGGAAAAA CAACTGCCAA AGCTCGCACT ATCGATGAAA TCAAAAAAAG 23340 

CGGTGAACTG CGAATCGCCG TGTTTGGAGA TAAAAAACCG TTTGGCTACG TTGACAATGA 23400 

TGGTTCTTAC CAAGGCTACG CTACGATATT GAACTAGGGA ACCAACTAGC TCAAGACCTT 23460 

GGTGTCAAGG TTAAATACAT TTCAGTCGAT GCTGCCAACC GTGCGGAATA CTTGATTTCA 23520 

AACAAGGTAG ATATTACTCT TGCTAACTTT ACAGTAACTG ACGAACGTAA GAAACAAGTT 23580 

GATTTTGCCC TTCCATATAT GAAAGTTTCT CTGGGTGTCG TATCACCTAA GACTGGTCTC 23640 

ATTACAGACG TCAAACAACT TGAAGGTAAA ACCTTAATTG TCACAAAAGG AACGACTGCT 23700 

GAGACTTATT TTGAAAAGAA TCATCCAGAA ATCAAACTCC AAAAATACGA CCAATACAGT 23760 

GACTCTTACC AAGCTCTTCT TGACGGACGT GGAGATGCCT TTTCAACTGA CAATACGGAA 23820 

GTTCTAGCTT GGGCGCTTGA AAATAAAGGA TTTGAAGTAG GAATTACTTC CCTCGGTGAT 23880 

CCCGATACCA TTGCGGCAGC AGTTCAAAAA GGCAACCAAG AATTGCTAGA CTTCATCAAT 23940 

AAAGATATTG AAAAATTAGG CAAGGAAAAC TTCTTCCACA AGGCCTATGA AAAGACACTT 24000 

CACCCAACCT ACGGTGACGC TGCTAAAGCA GATGACCTGG TTGTTGAAGG TGGAAAAGTT 24060 

GATTAGTCAT TAACTCTTAA AAGGAACTGG ATTTTAAGCT CCAATCCCTT TTTAAGATTT 24120 

TACCTATAAC ATCCTGAGTC TATCTAAGAT GTTCAATCTG AACACAGTGT ACATACTTTA 24180 

TCTTCTATTG CATATACTTT ATCACATAAG ATACGAATAT CCTCTTCACT ATGACTAGCA 24240 

ATCAAAATTG TTGTCCCTTT TTCACTAGAG AGCTTTCTAA ACAATGTTCT CATATTTTCT 24300 

ACACTTGATT TATCCAAGGC ATTCATAGGT TCATCTAGTA AAAGAATAGA GGGATTCTCC 24360 

ATAATTGCTT GAGCAATCCC TAGCTTTTTC CTCATACCTA GCGAATAAGT TTTAACTTTC 24420 

TGGTCTTTTT GCTCATATAG ACCAACTATT TTCAGTGTAT CATTGATTTC CTGATTACCA 24480 

ACTACTCCTC GTATGCTTGC CAAATATTGT AAATTCTTAA AGCCACTATA ATAATTTATA 24540 

AAACCAGGTT CTTCAATCAA AGCTCCCAAA TTAGCTGGAA TTTTTCTCTC AGGAACAATA 24600 
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TTTTCCCCAT TGATTAACAC TTCTCCATAA GACGGACTAT ATAAACCAGC TATTAATTTA 24660 

AACAATACAC TTTTCCCTGA GCCATTCGCA CCAGTAATTC CTATAATTTC CCCCTGTTTA 24720 

CAACTAAAGT TAAGGTTTTG AAAAACACAT GTCTTTTTTA ATTTCAACTC AATATTTTTT 24780 

AATGTAATTA TTTCATTCAT TCTATAAACC TCCTCTTTTG ACGAGTGAAA TAGAAAATGC 24840 

TTTGAAAAAG AAAGACTAAA AATAGCAACT GAAGAAATAA ATCTCGTCCT ATATCTCCAT 24900 

TCCCTCGATT CAAAATATAA AATAGATAAT TAGTTCGATT TCCTACAAAT AGACCACCAA 24960 

ACACAATCAT GAGTAAAAAG AAACTAACGC AAGCAAAGTT CG 25002 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11443 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



CAGGTACGGT 


GAGGCGCAAC 


TAAAATATAA 


TTTTCATCTT 


GATTAGGAAT 


TTTATCAGTA 


60 


TTATGATAGT 


GAGCATTGCC 


ATTGATGGAC 


CATAAGAGCA 


ATACAACTAA 


TCCACGCAAA 


120 


TAAGTATAAA 


ACATGCGATC 


TCCTTCGATT 


GTTTTCTTGT 


TATTATTATA CCTTATCAAA 


180 


GGAGGGCTGG 


CAAACTTTTC 


CCTTGACTAG 


ATACATATTT 


AGGATGAAAT 


TAGAATTCTG 


240 


TTAAAAAAAA 


TGATATAATA 


GAATTTATGG 


ATAAAAATAA 


GATTATGGGA 


TTAACCCAAA 


300 


GAGAAGTCAA 


GGAAAGACAG 


GCTGAGGGTT 


TGGTCAATGA 


CTTTACCGCA 


TCAGCCAGTA 


360 


CCAGCACTTG 


GCAAATCGTT 


AAACGAAATG 


TCTTTACCCT 


TTTTAACGCT 


TTGAACTTTG 


420 


CCATTGCTTT 


GGCTCTTGCC 


TTTGTGCAGG 


CTTGGAGCAA 


TCTGGTCTTC 


TTTGCTGTTA 


480 


TCTGCTTTAA 


CGCTTTTTCT 


GGGATTGTGA 


CCGAGCTACG 


AGCCAAACAC 


ATGGTGGACA 


540 


AGCTCAATCT 


CATGACCAAG 


GAAAAGGTCA 


AAACCATCCG 


TGATGGTCAG 


GAAGTTGCTC 


600 


TTAATCCTGA 


AGAATTAGTG 


CTAGGAGATG 


TCATTCGTTT 


GTCTGCAGGA 


GAGCAGATTC 


660 


CTAGTGATGC 


CTTGGTTTTG 


GAAGGCTTTG 


CGGAAGTCAA 


TGAAGCCATG 


TTAACGGGAG 


720 


AAAGTGATTT 


GGTGCAAAAG 


GAAGTTGACG 


GCTTACTTTT 


GTCAGGAAGT 


TTCCTAGCCA 


780 


GTGGGTCAGT 


TTTATCTCAA 


GTTCACCATG 


TCGGTGCAGA 


CAACTATGCT 


GCCAAACTCA 


840 


TGCTTGAGGC 


TAAGACCGTT 


AAACCCATCA 


ACTCCCGTAT 


CATGAAATCG 


CTGGACAAGT 


900 


TGGCTGGTTT 


TACTGGGAAG 


ATTATCATTC 


CCTTTGGTCT 


GGCTCTCTTG 


CTGGAAGCCT 


960 
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TGCTTTTAAA AGGCCTGCCT CTCAAGTCAT CCGTTGTAAA CTCGTCGACA GCTCTTTTGG 1020 

GAATGTTGCC TAAGGGAATT GCCCTTTTGA CCATTACTTC GCTCTTGACT GCAGTGATTA 1080 

AGTTGGGCTT GAAAAAGGTC TTGGTGCAGG AGATGTACTC TGTTGAGACC TTGGCGCGCG 1140 

TGGATATGCT CTGTCTGGAC AAGACGGGTA CCATCACCCA AGGAAAGATG CAGGTGGAGG 1200 

CTGTTCTTCC GTTGACGGAA ACGTATGGTG AAGAGGCTAT TGCCAGCATC TTGACTAGCT 1260 

ACATGGCCCA TAGTGAGGAT AAGAATCCAA CTGCCCAAGC CATTCGCCAG CGTTTTGTGG 1320 

GAGATGTTGC TTATCCTATG ATTTCCAATC TTCCCTTCTC GAGCGACCGC AAGTGGGGGG 1380 

CTATGGAGTT AGAAGGCTTG GGGACAGTTT TCTTAGGGGC ACCTGAGATG TTGCTTGATT 1440 

CTGAAGTCCC AGAAGCTAGG GAGGCCTTGG AGAGAGGATC ACGTGTCTTG GTCTTAGCTC 1500 

TCAGTCAGGA GAAATTAGAC CATCACAAAC CACAGAAACC ATCTGATATT CAGGCTCTAG 1560 

CCTTGCTGGA AATCTTGGAC CCCATTCGAG AGGGAGCAGC AGAGACGCTG GACTATCTCC 1620 

GTTCTCAGGA GGTGGGACTC AAGATTATCT CTGGTGACAA TCCAGTTACG GTGTCCAGCA 1680 

TTGCCCAGAA GGCTGGTTTT GCGGACTATC ACAGCTATGT AGATTGCTCA AAAATCACCG 1740 

ATGAGGAATT GATGGCCATG GCGGAGGAGA CAGCTATTTT CGGACGTGTT TCCCCTCATC 1800 

AAAAGAAACT CATCATCCAA ACGTTGAAAA AAGCGGGACA TACAACGGCT ATGACAGGGG 1860 

ACGGGGTTAA TGATATCTTG GCCCTTCGTG AGGCGGATTG TTCTATCGTG ATGGCGGAGG 1920 

GGGATCCAGC AACCCGTCAG ATTGCCAATC TGGTTCTCTT GAACTCAGAC TTTAATGATG 1980 

TTCCTGAGAT TCTCTTCGAG GGTCGTCGCG TGGTCAATAA CATTGCCCAC ATCGCCCCGA 2040 

TTTTCTTGAT AAAGACCATC TATTCCTTCC TGTTAGCAGT CATCTGTATT GCCAGTGCTT 2100 

TACTAGGTCG GTCAGAGTGG ATTTTGATTT TCCCCTTCAT TCCGATCCAG ATTACCATGA 2160 

TTGACCAGTT TGTGGAAGGT TTCCCACCAT TCGTTCTGAC TTTTGAGCGA AATATCAAAC 2220 

CTGTTGAGCA GAATTTCCTC AGAAAATCCA TGCTTCGTGC CCTACCAAGC GCTCTCATGG 2280 

TCGTCTTCAG CGTCCTGTTT GTGAAAATGT TTGGCGCGAG TCAAGGTTGG TCTGAGTTAG 2340 

AAATCTCAAC TCTACTCTAT TATCTCTTGG GGTCAATTGG TTTCTTATCC GTATTTAGAG 2400 

CCTGCATGCC ATTTACCCTA TGGCGTGTCC TCTTGATTGT TTGGTCAGTA GGAGGTTTCC 2460 

TAGCCACAGC TCTCTTCCCA AGAATTCAAA AACTGCTTGA AATTTCAACC TTAACAGAAC 2520 

AAACGTTGCC TGTTTATGGT GTCATGATGT TGGTCTTTAC CGTGATTTTC ATCCTGACCA 2580 

GTCGTTACCA AGCGAAAAAA TAAATCAAAA CCACCAGTGT GAACTGGTGG TTTGTTCTGC 2640 

GGCTATAAGC CGCTTCTACC GGCCAGGGCC AAAGGCCCAC CGAAATAGCT TCCTCGCGCA 2700 

CCACTTTCCC GAGCAGGTGC TAAAGCACCT TAGTTACTTC CTCTTATTTA TTTCGCCAGT 2760 
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AAACGGATCT ACTGACTCGA ATAACGTGAG CTGGTCTGCT ACTCTGTCTT CTTGTAATTG 2820 

ATTCTGAATA TATTCAGCTA TCACTTTCTG ATTACGGCCT ACCGTATCTA CATAATAGCC 2880 

TCTACACCAA AACTTGCGAT TGCCATATTT GTATTTTAAA TTCGCATGCT TATCAAAAAT 2940 

CATCAAACTG CTCTTGCCCT TTAAATAGCC CATAAAGGAC GAAACACTAA GTTTCGGAGG 3000 

AATACTGATA AGCATGTGAA TATGGTCTGA ACAAGCATTC GCTTCATGGA TTATTACACC 3060 

CTTACGCTCA CATAAGTCAC GTATGATTCT TCCGATACTA GCTTTGTATC TGCCATAAAT 3120 

GATTTGACGA CGATATTTGG GTGCAAAAAC AATATGATAT TTACAATTCC ATGTGGTATG 3180 

TGATAAACTT TGATTATCCT CTCTCATGAG GTACCTCCTG TATGATATGT TGTAGTGGCG 3240 

GAGAAACCAC TTCTATCTTA TCATTTTAGG AGGTTCTTTT TGTTACCACG CTAAAAGCTC 3300 

TATGGAAcCA CTAGCATAGC TAGTGGTTTT CGGGAGACAA CAAGAAAGAC TGCAATCTGT 3360 

GGATTGCAGT TTTTTATACG ATGGATCTAT CGTAGATCTG ATGTGCAAGG CCTACGTGCC 3420 

GATCATCTAT CGGTGAACCC AAGAGCGACC CTCAAGCCTG CTTGGATTGA GGTAATAGAT 3480 

TCAAATATCT GTAGTTAGAC TATTTGAAGT TTGATGTAAG AAAGAGAAAG CGACAGATTG 3540 

AAGTAATTTT AACTCTCTTC TATTGCTAGA ACAAATGGTC GGATAGGTTG GTAGTTTGAA 3600 

AATGAAGATG CTATCTATTG TTAAATGGAA CATAGTGTTA TTTATTAGAA AATCGTTTGG 3660 

TTTATTTCTT ATCAAATACG AAAAGCAACT TAAATATTTC AACTAAAATA GATGTTATGA 3720 

AGAAAAGGTA AAATGATTTT GGCATAGTGA GGTTCTGTTC TATTTGATAT CATATTTTTG 3780 

ATAAAAACAA AAATGTCCAT TGCAAAGGAC AAAATGCGAA GT AT ATT ATT TTTTGAAAGC 3840 

GATATAATGG ATTCATAAAG GAGGTGTATC GTGTCTAGAA AACAAGAACA AATGGAAACG 3900 

TTGTTGCTCC TTTTGCGAGA TAGTAAGGAT TATATATCTG CTAAAGTATT GGGAGAAAAA 3960 

TTAAATTGCT CTGATAAAAC GGTTTATCGC CTTGTCAAGG GAATCAACAA AGATTGTCCG 4020 

GTAGAAGCAT TCATTTTATC TGAAAAAGGC AGAGGTTTCA AATTAAATCC AAGAAGTTCC 4080 

CTCGTGGACG TTGATGGGAA TTTTACAGAG GCTTTTGATC CTGAAGTAAG GCGTGAAAAA 4140 

TTACTAGAAC GTCTCTTGTT GACTGCTCCT AAGCCACATT CTATTTATGA TTTAGGAGAG 4200 

GAATTCTACG TAAGCGAGTC AGTAGTACTA AAAGATCGTC AGATATTACA AGAGAGTCTA 4260 

GCAATTTATG GGTTAGATTT AAAAATGAGA CAACGAAAGC TTTTTATTGA TGGGGATGAG 4320 

GCTCAAATTC GTTCAGCCAT TCTAAATCTA CTGCCAATGT TTAATCAGTT GGATTTAGAG 4380 

CAAATTACAC AGAATAAGGT TCAGCCTCTT GACGGAGAAC TTGCTCACTT TTGTTTGGGA 4440 

TTACTGATTA CACTTGAGAG AGAATTGGGG GTAAACATTC CCTATCCATA TAATATAAAT 4500 
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ATTTTCTCTC ACCTGTATAT TTTTATCAGT AGGAATCGTC GTAGTACTAG TATTCATGTT 4560 

GTAGCACCTT CAAAACCTAC TATTGTTGAT GAGAAAATTT ACAGTGTCTG TCAAAAAATT 4620 

ATTCAAGAAA TTGAACAATA TTTTAGGATG AAGGTTGATG CAGTTGAGAT TGACTATCTT 4680 

TATCAATACG TTGTATCTTC GAGATTGCAA AAACCATTTT CTTCCGGGAA GCTTCCTTTT 4740 

TCTCAGCGAG TTTTAGATGT CACTCATTAC TATTTTAGCC GTATGTGTAT GGACAATAGA 4800 

GAGATTGAAA CGACAGATCC TGACTTTGTT GACTTGGCGA GTCATATCAG TCCCTTACTG 4860 

AGGAGATTAG ATAATAGAGT ACAGATTAAG AATAGTCTTT TATCACAAAT TCTTTTAACC 4920 

TATCCTAATC TGGTTAAAGA GTTAACAACT ATTTCTAAAG AAGTGAGTCT AGTATTTGGT 4980 

TTTGCTTCCT TGAGTCTGGA CGAGATTGGT TTTCTAGTCT TATATTTTGC ACGGTTTCAA 5040 

GAAAAGCGAG CACGTCCTCT AAAAACAGTA GTGATGTGTA CATCAGGTGT CGGAACTTCA 5100 

GAGCTTTTAC GAGCACGATT AGAAAAGCAA TTTTCTGAAT TGGATATTAT TGATGTAGTT 5160 

GCTTATCATC AATTAGATGA GCTGATAAAT CTATATCCAG ATTTAGATTT CATTGTGACG 5220 

ACGGTAGCTT TGCAGGAACC AGCAAGTGTC CCGTTTGTCC TAGTTAGTGT TTTTCTAACC 5280 

GAGGGTGATA AACAACGTCT TCAAGCAAAA ATTCAGGAGA TAAACTATGA ATAATCTTTC 5340 

GCTTGTCCTT ATGGATATAT CTGTTCAAAA TCGTCAAGAA GCCTACAAAG AATTAGCAAA 54O0 

TCAAATCAGC CTTCTTGTTT CTGAAGATAC AGAAAAAATA GAAGAGCTTC TATATTACCG 5460 

TGAGAGACAG GGAAGTATAG AGGTTGCTAA AGGTGTTCTT CTACCACATT GTGAAGGAAA 5520 

CTTTCAACAT CATGTCTTAG TGATTACTAG ATTAAAATCA CCTATCAGAG AATGGTCGAA 5580 

GGATATCCAG TGTGTTGACC TTATTATCGG TTTGGCCATT GCAGTATCAC AGGACAAGTC 5640 

ATGTATTAAA ACATTGATGA GAAGACTAGC AGATGAATCA TTCATAAATC AATTAAAACA 5700 

GTTAACAAAA GAAGAATTAC GGGAGATAAT ATATGGAAAT CAAAGATATT CTTAATGTGA 5760 

GTCTGATCCA GACGGATTTA CAGATGCAGA GCAAAGAAGA GGTTTTTGAG GCATTAGCTC 5820 

AACTATTGGT TGAGACGGGT TATGTGTCTG ATAGAGACCA ATTTATCGAA GGTCTTTATC 5880 

AGAGAGAGGC AGAAGGACAG ACCGGTATTG GGAATTATAT TGCTATTCCC CATAGCAAGA 5940 

GTTCTGCTGT GGAGAAGGCG GGGGTAGTCA TAGCTATAAA TCACAATGAG ATTCCTTGGG 6000 

AGACCATTGA TGGGAAAGGG GTCAAAGTAA TTGTACTCTT TGCAGTTGGT GATGATACAG 6060 

AAGCTGCTAG GGAGCATTTG AAGACCTTAT CACTCTTTGC TCGAAAACTT GGTAATGACG 6120 

AAGTTGTTGC CAAATTAGTT CGGGCTCAGA CATCTGATGA TGTGATTGCA GCTTTTTGTT 6180 

AATAAGAAAA AATTTTGGAG GGTATCCGTA TGAAAATTGT TGGTGTTGCA GCTTGTACTG 6240 

TGGGAATTGC CCACACTTAT ATTGCACAGG AAAAATTAGA GAATGCCGCA AAGGTAGCTG 6300 
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GACATGTGAT TCATGTTGAG ACTCAGGGGA CAATAGGGGT AGAAAATGAA TTGAGTCAAG 6360 

AGCAGATTGA TGCAGCGGAT GTAGTTATTT TAGCAGTTGA TGTTAAGATT TCTGGTATGG 6420 

AACGCTTTGA GGGTAAAAAG ATTATCAAGG TTCCAACAGA AGTGGCAGTC AAATCTCCCA 6480 

ATAAACTGAT TGCTAAAGCT GTTGAGATTG TTACGAAATA ACTGAAAATA TTTAAGGAGA 6540 

AAATATATGT TGAAACACTT AAACTTAAAA GGTCACTTAT TGACAGCCAT TTCCTATATG 6600 

ATTCCAATTG TTTGTGGTGC AGGATTCTTA GTTGCCATTG GTTTAGCAAT GGGGGGTGGT 6660 

GTTCCTGACG CTCTTGTAGC AGGAAAATTC ACTATCTGGG ATGCTTTAGC AACTATGGGT 6720 

GGTAAAGCCC TTGGTCTCTT GCCAGTTGTT ATTGCTACAG GTTTGTCTTA CTCGATTGCT 6780 

GGTAAGCCAG GGATTGCACC AGGTTTTGTT GTTGGTCTAA TTGCCAATTC TGTTGGTTCA 6840 

GGGTTTATCG GTGGTATCTT GGGAGGTTAT ATAGCTGGTT TCTTGGTTCA AGCGATTATT 6900 

AAAAAGGTCA AAGTACCAAA CTGGATTAAA GGTTTAATGC CAACCTTGAT TATTCCTTTT 6960 

GTAGCCTCTT TGGTAAGTAG TTTGATTATG ATTTATATTA TTGGAGCGCC TATCGCAGCC 7020 

TTTACCAACT GGTTGACGAG CTTATTACAA AGCTTGGGAA GTGCTTCAAA TGGTTTGATG 7080 

GGGGCAGTTA TTGGAATTCT CAGTGCTGTT GACTTTGGTG GCCCACTTAA TAAAACAGTC 7140 

TATGCGTTTG TGTTGACTTT ACAGGCTGAA GGTGTGAAAG AACCATTGAC TGCTTTACAA 7200 

TTGGTGAATA CTGCTACACC AGTTGGATTT GGATTGGCCT ATTTTATCGC GAAATTACTC 7260 

AAAAAAAATA TCTATACTCA AGAGGAAATC GAAACATTGA AATCGGCTGT TCCTATGGGG 7320 

ATTGTCAATA TTGTTGAAGG TGTAATTCCG ATTGTTATGA ATAACTTGGT TCCAGGTCTC 7380 

ATTGCAACAG GTATCGGTGG TGCTGTTGGT GGTGCTGTTT CTTTGACAAT GGGTGCTGAT 7440 

TCTGCTGTGC CATTTGGTGG AGTGCTTATG TTACCAACCA TGACTCGTCC AGTAGCTGGT 7500 

ATTTGTGCCT TGTTAGCTAA CATTGTAGTC ACAGGACTTG TCTACGCGAT TTTGAAAAAA 7560 

CCAATAAAAC ATGCAGAACC AGTTATGACT GTTGAAGAAG AGATTGATTT GTCAGATATT 7620 

GAAATTTTGT AAGAGGGTAA CGATGTCAAG AATTGAATTT TCACCATCTT TGATGACCAT 7680 

GGATTTGGAC AAATTCAAAG AGCAGATTAC TTTTTTGAAT GATAAAGTAG CATCTTATCA 7740 

TATCGATATT ATGGATGGCC ATTTTGTTCC CAATATTACC TTGTCTCCTT GGTTCATTCA 7800 

AGAAGTTCAA AAAATTAGTG ACACACCTTT ATCAGTTCAT CTGATGGTCA CAGACCCAAC 7860 

CTTTTGGGTA GATCAAGTTC TCGATTTACA ATGTGAGTAT ATTTGTATTC ATGCTGAAGT 7920 

TCTGAATGGT CTTGCTTTTC GTTTGATTGA TAAAATTCAT GATGCAGGTC TAAAGGCTGG 7980 

TGTTGTCCTT AATCCTGAAA CACCTGTTTC TACAATCTTT CCCTACATTG ATTTACTTGA 8040 
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CAAAGCAACT ATTATGACTG TAGATCCAGG TTTTGCAGGA CAACGCTTTT TGGAGTCTAC 8100 

CTTGTATAAA ATCCAAGAAC TCCGTCAGCT TAGAGTTCAG AATGGTTATC ACTACATCAT 8160 

TGAGATGGAT GGTTCTTCGA GTCGTAAGAC TTTCAAACAA ATTGATGTGG CAGGACCAGA 8220 

TATTTATGTT ATAGGTCGCA GTGGATTATT TGGTTTGGAT GACGATATTG CCAAAGGCTG 8280 

GGATATCTGT TCTAGAGATT ACGAAGAAAT GACCGGAAAA ACAATGCCAA TCAAATAATG 8340 

GTTTGAGAAG AAATTTATTA GTTAGGAGGA ATATATGTCA CTACAATCAG TTAACGCCAT 8400 

TCGTTTTCTT GGCGTAGATG CTATTAACAA ATCTAATTCT GGTCACCCGG GAATTGTCAT 8460 

GGGTGCTGCG CCAATGGCTT ATAGCCTATT TACAAAGCAC CTTAGAATTA CACCTGAGCA 8520 

GCCAAACTGG ATTAACCGAG ATCGCTTTAT CTTGTCTGCG GGTCATGGAT CAATGCTACT 8580 

GTATGCTCTC TTGCATTTAA CAGGGTATAA GGATGTATCC ATGGACGAGA TTAAAAATTT 8640 

CCGGCAATGG GGATCTAAGA CACCTGGTCA TCCTGAAGTG ACGCATACGT CTGGTGTGGA 8700 

TGCGACATCT GGTCCGCTTG GTCAGGGGAT TTCTACTGCC GTTGGTTTCG CCCAAGCAGA 8760 

GCGTTTTTTA GCTGCTAAGT ACAACAAAGA TGGTTTCCCT ATTTTTGACC ATTATACTTA 8820 

TGTTATCGCT GGAGACGGTG ACTTCATGGA AGGAGTGTCT GCGGAGGCGG CTTCTTATGC 8880 

AGGTCATCAA GCTTTAGATA AGCTTATCGT CCTCTACGAC TCCAACGACA TCTGCTTGGA 8940 

TGGTGAGACC AAAGATACTT TCTCTGAAAA TGTTCGCGTC CGTTACGATG CTTATGGTTG 9000 

GCATACAGTT CTGGTAGAAG ATGGAACAGA TTTAGCAGCA ATTTCTACAG CAATTGAGAC 9060 

GGCCAAGTTT TCTGGTAAAC CGAGTTTGAT TGAAGTGAAA ACGGTAATTG GTTACGGCTC 9120 

ACCCAATAAA AGTGGTACAA ATGCTGTTCA TGGTGCACGA CTAGGAGCAG AAGAAACAGG 9180 

AGCAACTCGT AAGTTTTTGG GATGGGATTA CGATCCATTT GAAGTACCAG AGGAAGTATA 9240 

TTCTGATTTC AAGACAAATG TAGCGGATCG TGGTCAGGAG GCATACGATG CTTGGGCTAG 9300 

TTTGGTGTCT GATTACAAGG TTGCTTATCC CGAAGTTGCT AGTGAGATTG ACGCTATTGT 9360 

AGCTGGAAAA TCCCCTGTAA CCATTACTGA AAAAGACTTC CCTGTCTATG AGAATGGCTT 9420 

CTCTCAAGCA ACTCGTAATT CGTCCCAAGA TGCTATTAAT ACAGCAGCAG TTTTACCAAC 9480 

CTTCTTAGGT GGATCGGCAG ACTTAGCTCA CTCTAACATG ACCTACATCA AGGCAGATGG 9540 

CTTACAAGAT AAATATAATC CATTAAACCG CAATATTCAG TTTGGGGTAC GTGAATTTGC 9600 

CATGGGAACA ATCCTCAATG GAATGGCTCT TCATGGTGGT TTACGAGTTT ATGGCGGAAC 9660 

CTTCTTTGTT TTCTCTGACT ACGTCAAAGC TGCTATTCGG CTATCAGCCA TTCAGGAGTT 9720 

GCCTGTAACT TATGTCTTTA CCCATGATTC AATTGCCGTT GGTGAAGATG GTCCAACTCA 9780 

TGAACCAGTT GAACATTTGG CAGGTTTACG CTCAATGCCA AACTTGACTG TTATCCGTCC 9840 
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AGCGGATGCC 


CGTGAAACTC 


AAGCGGCTTG 


GCATCATGCC 


TTGACCAGTA 


CCACCACTCC 


9900 


AACTGTCATT 


GTCTTAACCC 


GTCAAAACTT 


GGTAGTTGAA 


GAAGGGACAG 


ACTTTGGTAA 


9960 


GGTCGCTAAA 


GGAGCCTACG 


TCGTGTATGA 


TACCCCGGGA 


TTTGATACTA 


TTATCATTGC 


10020 


TACAGGATCT 


GAGGTCAATC 


TAGCTATCAA 


AGCTGCTAAG 


GAATTGGTTT 


TACAAGGTGG 


10080 


TAAAGTACGT 


GTGGTATCTA 


TGCCCTCAAC 


CGAACTATTT 


GATGCTCAAG 


ATGCTACCTA 


10140 


CAAGGAAGAC 


ATTTTACCAT 


CTAAGACTCG 


TCGTCGTGTG 


GCCATTGAAA 


TGGCAGCGAC 


10200 


CCAAAGTTGG 


TACAAGTATG 


TTGGTTTGGA 


TGGCGCGGTC 


ATCGGTATTG 


ACATCTTCGG 


10260 


TGCGTCTGCC 


CCAGCTCAGA 


CTGTGATTGA 


TAATTATGGA 


TTTACGGTAG 


AGAATATCGT 


10320 


TGCTCAAGTT 


AAGTCCCTAT 


AGAAACCAAT 


TACAATGAAG 


ATACAGCTGT 


TGTCAGACTA 


10380 


GCAGATGTAG 


TGATAGACAC 


TAATCAGATG 


ATTGGTTATT 


TAAAAACTGT 


AATGAAAATG 


10440 


TAATAATTTA 


TCTACGAAAG 


TTATAGTAGA 


TAGTATACAC 


AATAGAGTAT 


ACCCTGAAAC 


10500 


GGTTGCGAAG 


TACGCTAATC 


ACTTTGCTAC 


TGATCTAGAT 


AGTTTCTTTA 


ATCAATAAAC 


10560 


ACAGCATCCA 


CAGATTGACT 


TAGGATATTG 


TAAGTTTTTT 


GAAAGCTAGA 


GAGAAGGTCT 


10620 


CTAAAATTAA AAAACGCATA 


GTATAGGATG 


TTGAAATGAT 


GAACTGCACC 


CCAAAAGTTA 


10680 


GACAGAAAAA 


AATCTAACTT 


TTGGGGTGTT 


TTTATTATGA 


AATTAACTTA 


TGATGATAAA 


10740 


GTTCAGTTCT 


ATGAACTTAG 


AAAACAAGGA 


TATATCTTAG 


AGAAGCTTTC 


AAATAAATTT 


10800 


GGGATAAATA ATTCTAATCT 


TAGGTACATG 


ATTAAATTGA 


TTGATCGTTA 


CGGAATAGAG 


10860 


TTCGTCAAAA 


AAGGGAAAAA 


TCGTTACTAT 


TCTCCTGATT 


TAAAACAAGA 


AATGATTCAT 


10920 


AAAGTCTGAC 


ATGAAGGCTG 


GACTAAAGAT 


AGAGTTTCTC 


TTGAATACGG 


TCTCCCAAGT 


10980 


CGTACGATAC 


TTCTTAACTG 


GCTAGCACAA 


TACAGGAAAA 


ACGGGTATAC 


TATTGTTGAG 


11040 


AAAACAAAAG 


GGAGAGTACC 


TGAGAGCGGA 


GAATGCCATC 


CTAAAAAAGT 


TAAGAGAACT 


11100 


CCGATTGAAG 


GAGGAAAAAG 


AGAAATAAGA 


AAGACAGAAA 


TTGTTCAAGA 


ATTAATGACT 


11160 


GAGTTTTCGT TAGATCTTCT 


TCTAAAAGCC 


ATTAAACTAG 


CTCGTTGGAC 


CTACTACTAT 


11220 


CACTTGAAAC 


AGCTAGATAA 


ACCAGATAAG 


GACCAAGAGC 


TTAAAGCTGA AATTCAATCC 


11280 


ATCTTTATCG AACACAAGGG 


AGATTATGCT 


TATCGCCGGG 


TTCATTTAGA 


ACTAAGAAAT 


11340 


CGTGCTTATC 


TGGTAAATCA 


TAAAAGAGTT 


CAAGGCTTGA 


TGAAAGTACT 


CAATTTACAA 


11400 


GCTAGAATGC 


GACAGnAACG 


AAAATATTCT 


TCTCATAAAG 


GAG 




11443 



(2) INFORMATION FOR SEQ ID NO: 50: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5338 base pairs 



WO 98/18931 



PCT/US97/19588 



452 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

CCAATTACAT TATATTATCA AAATCGTCGA AACTGGCTCC ATGAATGAGG CAGCCAAGCA 60 

ACTCTTTATC ACTCAGCCAA GTCTCTCCAA TGCAGTGCGA GATTTGGAAA ATGAAATGGG 120 

CATTGAGATC TTTATCCGCA ATCCCAAGGG AATCACCTTG ACCCGTGATG GCATGGAGTT 180 

TCTCTCTTAT GCCCGTCAGG TTGTCGAGCA GACCCAGCTT CTGGAGGAAC GCTATAAAAA 240 

TCCTGTCGCC CACCGCGAAC TCTTTAGCGT TTCGTCTCAA CACTATGCCT TTGTGGTCAA 300 

TGCCTTTGTC TCTTTGCTCA AGAAAAGCGA TATGGAGAAA TACGAACTCT TCCTTCGTGA 360' 

AACTCGGACT TGGGAGATTA TCGACGACGT CAAGAACTTC CGCAGTGAGG TCGGGGTCCT 420 

CTTCTTAAAC AGTTACAACC GTGATGTTTT AACCAAGATG CTGGATGACA ATCACCTGCT 4 80 

AGCCCACCAT CTCTTCACAG CGCAACCGCA TATCTTTGTC AGCAAGACCA ACCCTCTGGC 540 

AAAGAAAGAC AAGGTGAAAC TGTCTGATTT GGAGAATTTC CCTTACCTCA GCTATGACCA 600 

AGGGACGCAC AACTCCTTCT ACTTTTCAGA AGAGATTCTT TCTCAAGAAC ACCACAAGAA 660 

ATCCATTGTG GTCAGTGACC GTGCCACCCT CTTTAATCTC TTGATTGGTT TGGATGGTTA 720 

TACCATTGCG ACAGGGATTT TGAACAGCAA CCTAAACGGA GACAATATCG TTTCTATCCC 780 

ACTGGATATT GATGACCCGA TCGAGCTGGT CTATATCCAG CATGAGAAAA CCAGCCTATC 840 

TAAGATGGGC GAACGCTTTA TAGACTATCT CCTAGAAGAA GTTCAGTTTG ATAGTTGAGA 900 

AATGATAAGA ACCAATATGT AGGCTAGCAA CAACCTGCAC ATTGGTTCTT TTTACTTATA 960 

ATTAAAAGTT TCCCCTGCCA ACTTATCAGC TAGCTTGGGA AAGAGAGTAT AAAACTTATG 1020 

GGCTAGGTTC AACAAAATCG GGAGATTGAG TTCTCGTTTG TTTTTTCCTA TAATCTTGAC 1080 

AATCTTTTTA GCCACTGCAT CTGGTTCTAG CAGGAAGCGA TCAACCGATT TAAGATAAGT 1140 

TCCATCTGGG TCGGCTTGGT CGAAAAATCC TGTACGGATT GGTCCTGGAT TGACTGTTGT 1200 

CACATAGACT CCATAGGGCA TAAGTTCGAG TCGCAGAGCA TTTGAAAAAC CAATAGCCGC 1260 

AAACTTGGTC GCTGAGTAAA GACTAGACTT GCCAGTAGCT ATTAGACCTG CCATGCTGAC 1320 

GATGTTGATG ATATGCCCTT TGCTGCTTTC CTTCATACGA GCCGCAAGGT GACGAGACAG 1380 

ATT CATC AGG GCAAAGGTAT TGACCTCAAA CATCTGGTGA ATATCTTTAT CAGCAATCTG 1440 

GTCAAATCCC TCAAAAATCC CGTAACCAGC GTTGTTAATC AAGACATCAA TCTTGCCATA 1500 

GCGGAGATAA AGATCAGTTA CCAGAGCTTC TAGGGCTGAA TCGTCGGTAA TATCAATTTC 1560 
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AATCAATTCT GCATGGGAAT AATTTCCGTA GAGTTGGGCT AATTTTTCCT TATTTCTACC 1620 

AAGCAAGATG AGTTGGTCAT TGGGCAGGAG TTTGACCATT TCTTGAGCTA GACCACCGCT 1680 

AGCTCCGGTA ATGAGAATAG TAGGCATACT TATCCTTTCT GTGACTGCTA GATTTCCACT 1740 

TCTTCCAAGT CTTTGACCAC ATGGACATTT TCAAAAATTG TGGCAGCGTC TTTCTTGAGT 1800 

TTGCTAATAT CTTTTGAGAG GAAACGGGCA CTGATATGGT TGAGTAGGAG GCGTTTGGCA 1860 

CCTGCTTCTA CCGCTACTTG TGCAGCTTGC ATATTAGTTG AGTGACCATG GTTACGAGCA 1920 

ATTTTTTCAT CACCCTTGCC ATAAGTGGAC TCATGAACTA GGACATCTGC ATTGACAGCC 1980 

AGACGCACAC TGGCACCCGT TTTTCGAGTG TCTCCTAAAA TAGTGATAAT CTTACCTGGA 2040 

CGTGGCGCTG AGATATAGTC TGCTGCCTTG ATTTCAGTTC CGTCTTCCAA AACAAGATCC 2100 

TGGCCGTTTT TGATTTTACC AAAAAGCGGG CCGAACGGAA CACCAGCAGC CTTGAGTTTT 2160 

TCAGCATCCA GCGTCCCTTC TAGATCCTTT TGCATGACAC GATAGCCAAC ACAGAAAATA 2220 

GTGTGGTCCA GCTCCTCTGC ATACACAGTG AATTTATCGG TTTCAAGAAT TTTACCCAGA 2280 

GAATCTTGGT CAAACTCATG GAAATGAATG CGGTAGGGCA GACGAGAACC TGACACACGA 2340 

AGGCTGGTTA AGACAAATGA CTTGATTCCT TGAGGTCCGT AGATTTCCAA ATCTGTCTGC 2400 

TCTTCATTGG CCTGAAAGGC ACGGCTAGAA AGGAAACCTG GCAAACCAAA AATGTGGTCT 2460 

CCATGCAGAT GGGTAATAAA GATTTTGCTG ACCTTACGTG GTCGAATTGT GGTTTCCAGA 2520 

ATGCGATTTT GCGTACCTTC TCCACAGTCA AAGAGCCAAA CTTCGTTAAT CTCATCCAAA 2580 

AGTTTCAGGG CGAGACTTGA AACGTTGCGG GCTTTAGAGG GCTGACCAGC CCCCGTTCCT 2640 

AAAAATTGAA TATCCATTCG ATACTTTCTA ATTAATCAAT ATATAACATG GCTGTGCGGT 2700 

TTTCCGATCG GAAATAGCGT TTGCCAGAAA AAGCAGCAGC TTCTTGCAAT AAATCCTCTT 2760 

GGCTGTAGCC TTTGAGACGT TTTCGACCAT CAGCCAATCT TTCCAAATCA GTCAAAGCTG 2820 

TGAGACTTTC TAGGCTGATA ACTTCCTCGT CCTCGACAGG CTTCATGTAA ATCTTACCAG 2880 

ACTCTTCAAA GACTAATTGA TGGGGGAAAA TTTGCGCAAT TTCAAAGAGC AAGTCATCCG 2940 

AGATTTTCTC CTCATTTTCA AAGAAAATCC GACCAAGGCC GTCACTCTCA TAACAAAAAC 3000 

CAAAGGATTT ACCAGACAGA TTAAGCCGAA TAAAAGGCTT ATTTTCTAGG GTGAAACTTG 3060 

GCTCAGTATT GTAAAGATTC AGTTCCTGAC TGAGTTCTGC AAAATAATCC GTCGCAGCCT 3120 

GAGGACTCTT TTTCTGATAG AGTTCTGCAA AGTAGGCATT AACAACACTT GGCGGAGGTG 3180 

TAATAAGTGT TAACTGCTCC TGATCTGTTT TACCAGCTAG AAGCTGATCC AGATAGACCT 3240 

TGTCCAGACT TGTATAACCT CCATACTTTA GAGCCAAAGT TTTAATATCA GTCATAAAAT 3300 
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TCTTCTAACC TCCATTTATT TTTCTCGGAA ATGTAGCCTG TAATCACTTC GCCGTCTTCC 3360 

TGATAATCAC GTTCTTCCAG AATTGCAACA CTCTCTAAAT CATGAATCTT GTAGGACTTT 3420 

GAAAAAGGCA CTCGCAGGGT AAATGCTTCA AAAATTTCCT TAATCTTATC TAGCAATAAT 3480 

GCTTGCAAGT TTTCACGACT GTCCTCAGAC TTGGCAGAAA TGAGGGTATA TGGCGTTTGG 3540 

GTAGGCGTGA AATCCTCCAC CAAATCCGCT TTATTATAAA GCGTCAAGTG AGGAATATCT 3600 

TCCATGTCCA GGTCTTTCAT GATGGAGAGA ACCGTTTTTT CATGCTCCTC GTGGTAAGGA 3660 

TTGCTAGCAT CGATAACATG AACCAGAAGG TCCACATGCT TGCTTTCTTC CAAGGTTGAC 3720 

TTGAAACTGG ACACCAACTC TGTCGGCAAA TCTTGGATAA AGCCAACGGT ATCTGTCAAA 3780 

GTTACTTGGA GATTGCCTCC CAGATGAATA CTCTTGGTTG TCGCATCCAG AGTCGCAAAG 3840 

AGCTCATCTG CTTCATACTG GGTCTTACTG GTCAAGATGT TCATGATAGT TGATTTCCCA 3900 

GCATTAGTAT AACCAATCAA ACCAATCTTA AAAGTGCTAG ACTCCAAACG TTTTTCTCTG 3960 

ACAGTCGCAC GATTTTTCTC AACCACCTTG AGCTGGCGCT CGATATCCGT GATTTGATTG 4020 

CGAACGCTAC GACGGTTCAG CTCCAGCTGG CTTTCACCAG GACCACGGGA ACCAATTCCC 4080 

CCTgCCTGAC GGCTGAGCAT AATCCCCTGA CCAACCAAGC GAGGCAAAAG GTATTTGAGT 4140 

TGGGCTAGGT GGACTTGGAG CTTCCCTTCA TGGCTTCGAG CCCGCATGGC AAAGATATCC 4200 

AAAATCAACT GCATACGGTC AATGACCTTA ACACCGAGAA CTTCCTCTAG ATTGACATTC 4260 

TGCCTTGGGG TCAGACGATT GTTGACGATG ACAGTAGTGA TTTCTTCTGC ATCCACCATA 4 320 

AGCGCAATCT CTTCCAACTT ACCAGAGCCG ACGAAGGTCT TGGAATCATA TTTTTCACGT 4380 

TTTTGTCTGT AGCTATCTAC AACGACTGCC CCTGCCGTTT TCGCTAAACT AGCCAATTCT 4440 

TCCATGGAGA GGTCAAAACT GTCCATACCC TGCAATTCCA CACCAATCAG CAGGACTCGC 4500 

TCCTCTTTTT TCTCCGTTTC AATCATCTAA AAACTCCTCT ATCTGGCTTA AAATGCGGTC 4560 

TTGTACACCA GATTCTCCAA TCTGATAAAA GGTGACCTGC ATGCGATTAC GGAACCAGGT 4620 

CAGCTGACGC TTGGCAAAAC GACGAGTCGC CTGTTTAAGA CTCTCACTAG CTTCCTCCAA 4680 

GGTCTGCTCT CCACGGAAAT AAGGAAAGAG TTCCTTATAG CCAATTCCTT TAGCAGCCTG 4740 

TACATTAGGG GAATGGTCAA ACAGCCACTT GGCCTCATCC AAAAGCCCAG CCTCAAACAT 4800 

CAAATCCACT CGGTGGTTGA TACGCTCATA AAGTTGACTA CGTTCATCAT CCAAGCAGAT 4860 

AATCAGCGGT TCATACAAGG TCTCTTGATT TTCCAAATCC TGACCAAAAT GGGCAATTTC 4920 

TAAGGCACGC ATAGCACGAC GACGATTAAA CTGGGGAATC TCAAGGCCTG CTTGATCCAC 4980 

CAAATGGGCT AATTCCTCAT CTGAATATGG CTCCAAACTA GCTCGATAAG CTAAAATCTC 5040 

CTCATGAGGA GTCTCCCCAC CTAGGTGGTA ACCTTCTAGC AAGCTCTGGA TATAAAGTCC 5100 
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AGTCCCACCG GCGATAATGG CTAGCTTGCC ACGGTTGTGA ATACCCTCAA 



TAGTCATCTT 



5160 



AGCTTCTGAA ACAAAATCAA AAGCCGAGTA AGACTCGGTT ATCTCTCTAA 



CATCGATTAA 



5220 



ATGATGAGGA ACAGCTGCCT GCTCTTCTGG ACTAGCCTTG GCCGTCCCAA 



TATCAAGTCC 



5280 



TCGATAGACT TGCTGGCTAT CTCCACTAAC CACTTCGCCA TTAAAACGCT 



TTGCGGGG 



5338 



i2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 19446 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

CGGAAACCCA TCTAGTCTCC ATCGTTTGGG AGACCAAGCA ACACGAATCT TAGATGCTTC 60 

TCGCCAACAG ATTGCAGATT TAATCGGTAA GAAAAGCGAT GAAATCTTCT TTACCTCGGG 120 

TGGAACAGAA GGGGATAACT GGCTTATCAA GGGTGTGGCC TTTGAAAAAG CTCAGTTTGG 180 

CAAGCACATC ATTGTTTCAG CCATTGAACA TCCAGCAGTC AAAGAGTCAG CCCTCTGGTT 240 

GAAAAGTCAA GGATTTGAAG TGGATTTTGC TCCAGTTGAT AAGAAAGGCT TGGTCGATGT 300 

TGAGGCGTTA CAGGTTTGAT ACGGCATGAT ACAATCCTCG TTTCCATCAT GGCTGTGAAC 360 

AATGAAATCG GCTCTATCCA ACCTATTGAG GCTATTTCAG AATTCTTGGC AGACAAGCCG 420 

ACTATTTCCT TCCACGTTGA TGCGGTTCAG GCGCTTGCCA AAATTCCGAC ' TGAAAAGTAT 480 

CTGACAGAAC GGGTGGATTG CGCGACTTTC TCTAGTCACA AGTTCCACGG GGTTCGAGGT 540 

GTTGGCTTTG TCTATATCAA ATCTGGCAAG AAGATTACAC CTCTTCTTAC AGGTGGTGGC 600 

CAGGAGCGAG ATTATCGTTC GACAACTGAA AATGTGGCAG GGATTGCAGC GACAGCCAAG 660 

GCCCTCCGTT TGTCTATGGA AAAGCTAGAT ATCTTTAGGA GCAAGACTGG GCAGATGAAG 720 

GCAGTGATTC GCCAAGCTCT TCTGAACTAT CCGGATATTT TTGTCTTTTC AGATGAGGAA 780 

AACTTTGCAC CTCATATTCT GACTTTTGGA ATCAAAGGTG TTCGAGGTGA AGTCATCGTT 840 

CACGCCTTTG AAGACTATGA TATTTTCATC TCAACAACCT CAGCTTGTTC ATCTAAGGCA 900 

GGAAAACCAG CCGGTACCTT GATTGCCATG GGAGTGGACA AAGATAAGGC CAAGTCAGCT 960 

GTGCGTCTTA GCCTAGACTT GGAAAATGAT ATGAGTCAGG TCGAGCAGTT TTTGACCAAG 1020 

TTAAAATTGA TTTACAATCA AACTAGAAAA GTAAGATAGG AGCATTCATG CAGTATTCAG 1080 

AAATTATGAT TCGCTACGGA GAGTTGTCAA CCAAGGGTAA AAACCGTATG CGTTTCATCA 1140 
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ATAAACTTCG TAATAATATT TCGGACGTTT TGTCTATCTA TACCCAAGTT AAGGTAACAG 1200 

CAGATCGCGA CCGTGCCCAC GCTTACCTCA ATGGAGCTGA TTACACAGCA GTTGCAGAAT 1260 

CTCTCAAACA AGTTTTTGGA ATTCAAAACT TTTCTCCTGT TTATAAGGTT GAAAAATCTG 1320 

TAGAAGTTTT GAAGTCTTCT GTCCAAGAGA TTATGCGGGA CATCTACAAG GAAGGTATGA 1380 

CCTTTAAGAT TTCTAGCAAG CGTAGCGACC ACAACTTTGA ACTTGATAGT CGTGAACTCA 1440 

ACCAAACACT TGGAGGGGCT GTATTCGAAG CCATTCCAAA TGTGCAAGTT CAAATGAAAA 1500 

GTCCTGACAT CAATCTTCAG GTGGAGATTC GTGAAGAAGC AGCCTATCTT TCTTATGAAA 1560 

CCATTCGTGG GGCTGGTGGT TTGCCAGTTG GAACTTCAGG TAAAGGGATG CTCATGTTGT 1620 

CAGGAGGGAT TGACTCACCT GTAGCAGGTT ATCTTGCTCT TAAGCGTGGG GTGGATATCG 1680 

AGGCAGTTCA CTTTGCTAGT CCACCATATA CTAGTCCTGG TGCCCTCAAG AAAGCGCAGG 1740 

ACTTGACCCG TAAATTGACC AAGTTTGGCG GAAATATCCA GTTTATAGAG GTGCCTTTCA 1800 

CAGAGATTCA AGAGGAAATC AAAGCCAAAG CGCCAGAAGC TTATTTGATG ACTCTAACTC 1860 

GTCGCTTTAT GATGCGGATT ACTGACCGTA TTCGTGAGGT ACGAAATGGT TTGGTTATCA 1920 

TCAATGGGGA AAGTCTAGGT CAAGTAGCCA GCCAAACCCT TGAAAGTATG AAGGCTATCA 1980 

ATGCTGTTAC CAACACTCCC ATCATTCGTC CTGTGGTTAC CATGGACAAG TTGGAAATCA 2040 

TTGACATCGC CCAGGAAATC GATACCTTTG ACATTTCAAT CCAACCGTTT GAAGACTGTT 2100 

GTACCATTTT TGCACCAGAT CGTCCAAAAA CAAATCCTAA AATTAAGAAT GCGGAGCAGT 2160 

ACGAAGCGCG TATGGATGTT GAAGGCTTGG TTGAGCGAGC AGTGGCTGGA ATCATGATTA 2220 

CTGAAATCAC ACCTCAAGCC GAAAAAGATG AAGTTGATGA CTTGATTGAC AATCTGCTCT 22 80 

AATTCAGAAA ATCCAAAAGA ATAGCGAAAA TCAGTAAAAA AAGTTAGTTT TTTCTCTAAA 2340 

AACAGGTAAA AAACTAACTT TTTTTATTTT TATGATATAA TGATATAAAA TTTTGAATAT 2400 

AGAGAGTTTT CTGACAATGA ATCAATCCTA CTTTTATCTA AAAATGAAAG AACACAAACT 2460 

CAAGGTTCCT TATACAGGTA AGGAGCGCCG TGTACGTATT CTTCTTCCTA AAGATTATGA 2520 

GAAAGATACA GACCGTTCCT ATCCTGTTGT ATACTTTCAT GACGGGCAAA ATGTTTTTAA 2580 

TAGCAAAGAG TCTTTCATTG GACATTCATG GAAGATTATC CCAGCTATCA AACGAAATCC 2640 

GGATATCAGT CGCATGATTG TCGTTGCTAT TGACAATGAT GGTATGGGGC GGATGAATGA 2700 

GTATGCGGCT TGGAAGTTCC AAGAATCTCC TATCCCAGGG CAGCAGTTTG GTGGTAAGGG 2760 

TGTGGAGTAT GCTGAGTTTG TCATGGAGGT GGTCAAGCCT TTTATCGATG AGACCTATCG 2820 

TACAAAAGCA GACTGCCAGC ATACGGCTAT GATTGGTTCC TCACTAGGAG GCAATATTAC 2880 

CCAGTTTATC GGTTTGGAAT ACCAAGACCA AATTGGTTGC TTGGGCGTTT TTTCATCTGC 2940 
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AAACTGGCTC CACCAAGAAG CCTTTAACCG CTATTTCGAG TGCCAGAAAC TATCGCCTGA 3000 

CCAGCGCATC TTCATCTATG TAGGAACAGA AGAAGCAGAT GATACAGACA AGACCTTGAT 3060 

GGATGGCAAT ATCAAACAAG CCTATATCGA CTCGTCGCTT TGCTATTACC ATGATTTGAT 3120 

AGCAGGGGGA GTACATCTGG ATAATCTTGT GCTAAAAGTT CAGTCTGGTG CCATCCATAG 3180 

TGAAATCCCT TGGTCAGAAA ATCTACCAGA TTGTCTGAGA TTTTTTGCAG AAAAATGGTA 3240 

AGTTAAGAAA GGAAAAAACG AAATGCATAT TGAACATCTT AGCCACTGGA GTGGTCATCT 3300 

TAACCGTGAA ATGTACCTTA ACCGTTATGG ACATGGTGGG ATTCCAGTTG TGGTCTTTGC 3360 

TTCATCAGGT GGTAGTCACA ACGAATACTA TGATTTTGGC ATGATTGATG CCTGTGCTTC 3420 

CTTTATCGAG GAAGGCCTTG TCCAGTTCTT TACCCTATCT AGTTTGGATA GTGAGAGCTG 3480 

GTTGGCTACT TGGAAAAATG CTCATGACCA AGCGGAAATG CACCGTGCCT ACGAACGTTA 3540 

TGTGATTGAG GAGGCCATTC TTTTATCAAG CACAAGACAG GTTGGTTTGA TGGCATGATG 3600 

ACGACAGGTT GCTCTATGGG AGCCTATCAT GCACTCAATT TCTTCCTCCA GCATCCAGAT 3660 

GTCTTTACCA AAGTGATTGC TCTCAGTGGT GTTTACGACG CACGTTTCTT TGTCGGTGAT 3720 

TACTACAACG ATGATGCTAT TTACCAAAAC TCGCCAGTAG ATTATATTTG GAACCAAAAC 3780 

GACGGCTGGT TTATTGACCG TTACCGTCAG GCAGAGATTG TGCTGTGTAC GGGGCTTGGA 3840 

GCCTGGGAAC AAGATGGTTT GCCATCCTTT TACAAGCTCA AAGAAGCCTT TGACAAGAAA 3900 

CAAATTCCAG CCTGGTTTGC TGAATGGGGA CATGATGTCG CCCATGACTG GGAATGGTGG 3960 

CGTAAACAAA TGCCTTATTT CCTCGGTAAT CTCTATTTAT AAAAGGAGTT ACCTATGAAT 4020 

TACCTTGTTA TTTCTCCCTA CTATCCACAA AACTTTCAAC AGTTTACCAT CGAACTAGCT 4080 

AATAAAGGCA TCACAGTCTT GGGAATTGGT CAAGAGTCTT ACGAGCAATT GGATGAGCCC 4140 

TTGCGCAATA GCTTGACCGA GTATTTTCGT GTTGATAATC TTGAGAACAT AGATGAAGTC 4200 

AAACGTGCAG TTGCTTTTCT CTTTTATAAA CATGGTCCAA TTGGCCGCAT CGAGTCTCAC 42 60 

AATGAATACT GGCTTGAGCT AGACGCAACA CTCAGAGAAC AATTCAATGT TTTTGGTGCC 4320 
AAACCAGAGG ATCTCAAAAA GACGAAATAT AAGTCTGAAA TGAAGAAACT TTTCAAAAAA . 4380 

GCAGGTGTTC CTGTGGTACC TGGAGCTGTT ATCAAGACGG AAGCAGATGT TGATCAAGCA 4440 

GTGAAAGAAA TCGGTCTTCC AATGATTGCC AAACCTGATA ATGGAGTGGG AGCAGCCGCA 4500 

ACCTTTAAAC TTGAGACAGA AGACGATATC AATCACTTCA AGCAAGAATG GGACCATTCA 4560 

ACCCTTTATT TCTTTGAAAA ATTTGTCACT TCCAGCGAAA TCTGTACCTT TGACGGGCTC 4620 

GTGGACAAGG ATGGAAAGAT TGTCTTCTCA ACAACCTTTG ACTACGCCTA TACACCGCTT 4680 
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GACCTCATGA TTTATAAGAT GGACAATTCT TATTATGTGC TCAAGGATAT GGATCCTAAA 4740 

CTGCGCAAGT ATGGGGAAGC AATTGTCAAA GAATTTGGTA TGAAAGAACG GTTTTTCCAT 4800 

ATTGAGTTCT TCCGTGAGGG GGACGATTAT ATTACCATCG AGTACAATAA CCGCCCTGCA 4860 

GGTGGTTTTA CCATTGATGT TTATAACTTT GCTCATTCCT TGGACCTTTA TCGTGGCTAT 4920 

GCAGCTATTG TCGCAGGAGA GGAGTTCCCG GCGTCAGACT TTGAAACTCA GTATTGTTTG 4980 

GCTACTTCTC GCCGTGCAAA TGCTCACTAT GTTTATTCAG AAGAGGATTT GCTTGCCAAA 5040 

TATAGCCAGC AGTTCAAGGT TAAAAAAGTC ATGCCAGCTG CCTTCGCGGA ACTTCAAGGA 5100 

GATTACCTGT ATATGCTGAC CACTCCGAGT CGACAAGAAA TGGAGCAGAT GATTGCAGAT 5160 

TTCGGACAAC GTCAAGAATA AGAACTATCG GATTAAGGAA ATTAACTCCC TTAATCCTTT 5220 

TGTTTTGTCT GATAAAAAAT AAGAGCATCC CAACAAGGTA GCTATCATAA AACTTGTTCG 5280 

ATAACTATTT GAAGCAGGAT TAGGTGGTCA GAAATTAAAT TTTAATATTT CAATTGAGTC 5340 

ATAGTATTGT GTTTGCGTAT CCTTAAATCA GCTAAAAGGA TCCATGACGA CACCTATACG 5400 

ATATAGTTTT CAAGATACCA AACAAGTCTA TTAATATTCA ATGAAAATCA AAGAGCAAAC 5460 

TAGGAAGCTA GCCGCAGGTT TCTCAAAACA CTGTTTTGAG GTTGTGGATA GAACTGACAG 5520 

AGTCAGTATC ATATACTACG GCAAGGTGAA GCTGACGTGG TTTGAAGAGA TTTTCGAAGA 5580 

GTATAAAATA TTCAGGTGAC GCATAGATAT AGTTAATTGA AGCTTTGTTT GAAATCTGAT 5640 

AAAATAATGA TATTACTAAG TTTTAAAAAC TAAAGAAAAG GGAAGATATG ATTACAGGCG 5700 

AATTAAAAAA TAAAATCGAT CAGCTGTGGG AAATTCTTTG GACAGAAGGA AACGCAAATC 5760 

CTTTAACAAA TATTGAACAG TTGACTTATC TCTTATTTAT GAAAGATTTG GATAGTGTCG 5820 

AGCTTGGACG TGAAAGTGAT GCTGAATTTC TAGGGATTCC TTATGAGGGA GTTTTTCCAA 5880 

AAGATAAACC TGAATACCGT TGGTCAACTT TTAAAAATAT AGGAGATGCT CAGGAAGTTT 5940 

ATCGTTTAAT GACTCAGGAG ATTTTTCCGT TTATTAAAAA TCTCAAGGGG GATACAGATG 6000 

ATACAGCCTT TTCACGATAT ATGCGAGAAG CTATTTTTCA AATAAATAAA CCTGCTACGC 6060 

TTCAAAAGGC AATTTCTATC TTAGATGTTT TTCCAACTAG GGGATTAGAT GTAGATTTTG 6120 

ATAATGACAA ACAAAGTATT ACTGATATCG GAGATATCTA TGAATATCTG TTATCAAAAT 6180 

TGTCGACCGC AGGTAAAAAT GGACAGTTCC GTACACCTCG TCACATCATC GATATGATGG 6240 

TTGAGTTGAT GCAACCGACT ATCAAAGATA TCATCTCAGA TCCCGCTATG GGTTCTGCTG 6300 

GCTTCTTAGT ATCTGCTAGC CGTTACTTAA AGCGTAAGAA AGATGAATGG GAAACCAATA 6360 

CAGATAATAT CAATCATTTT CATAATCAGA TGTTTCATGG AAATGATACG GATACGACTA 6420 

TGTTGAGACT TGGGGCGATG AACATGATGC TACATGGAGT AGAAAATCCA CAAATCAGTT 64 80 
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ACCTTGACTC GCTGTCTCAA GATAATGAAG AAGCCGATAA ATATACTTTG GTTTTAGCAA 6540 

ATCCTCCTTT TAAGGGCTCA CTTGACTACA ATTCAACCTC TAATGACCTT CTTGCAACCG 6600 

TAAAAACCAA AAAAACAGAA TTACTCTTTC TTTCTCTTTT CTTGCGAACT TTAAAACCAG 6660 

GTGGACGAGC AGCAGTTATC GTACCTGATG GTGTCCTTTT TGGTTCGTCT AAAGCTCATA 6720 

AAGGAATTCG TCAGGAAATT GTAGAGAATC ATAAGCTTGA TGCTGTAATC TCAATGCCTA 6780 

GTGGTGTGTT CAAGCCTTAT GCTGGAGTTT CAACTGCCAT TCTCATCTTT ACAAAAACTG 6840 

GTAATGGTGG TACTGACAAA GTCTGGTTTT ACGATATGAA AGCGGATGGT TTAAGTTTGG 6900 

ATGATAAGCG ACAACCGATT AGCGACAATG ATATTCCAGA TATTATCGAA CGCTTTCATC 6960 

ATCTTGAAAA AGAAGCAGAA CGTCAGAGAA CGGATCAATC TTTCTTTGTT CCAGTTGCTG 7020 

AGATAAAGGA AAATGATTAT GATTTGTCTA TCAATAAATA TAAAGAGATT GAGTATGAAA 7080 

AAGTTGAGTA TGAACCAACA GAAGTCATAT TAAAGAAAAT CAATGATTTA GAAAAAGAAA 7140 

TTCAAGCTGG CTTGGCTGAA TTGGAAAAAT TACTCAAGTA GGGAGGTGGC TGTATGAAAA 7200 

AAGTGAAGTT GGGGGAAGTC TTATCTCTAA AAAAAGGCAA GAAAGCCACT GTACTTGCTG 7260 

AACAAACAAC TCTAAGCCAA CGTTATATTC AAATAGATGA TTTAAGAAAT AATAATAATT 7320 

TAAAATTCAC TGAAAGTTTA AATATGACTG AAGCACTCCC AGATGATATT CTGATAGCAT 7380 

GGGATGGAGC TAATGCAGGA ACAGTTGGTT ATGGATTATC GGGAGCTGTT GGTAGTACAA 7440 

TTACGGTCTT AAAAAAGAAT GAGCGATACA AAGAAAAAAT TATATCAGAT TACTTGGGAG 7500 

TCTTTTTGGA AAGTAAATCG CAGTATTTAC GAGATCATTC AACAGGTGCA ACAATTCCTC 7560 

ATTTAAACAA GAATATATTA CTTGATTTAC AATTAGAATT GCTAGGTATC GAAGAACAAG 7620 

AGAACATTAT CTGTATTCTT AATACGATTA AAAGGCTTAT TACTAAAAGA AAATTTCAGT 7680 

TAGATGAACT AAACTTGCTC GTCAAATCCC GATTTAACGA GATGTTTGGG GAAAATAAAA 7740 

TATTTGAAAG CATTGATAAC TTATTTGATA TTATAGATGG TGATAGGGGC AAAAATTATC 7800 

CTAAATCAGA TGAGTTGTTT AGTGAGGAGT ACTGTTTATT TTTAAATACA AAGAATGTTA 7860 

CTAAAAACGG ATTTTCATTC GATACAAAGC AATTTATCAC TAAAACAAAG GATAAATTAC 7920 

TTCGAAAAGG CAAACTTGAG CGTTATGATA TAGTCTTGAC AACAAGAGGT ACTGTTGGAA 7980 

ATGTAGCGTA CTACGATGAA TTAATAAAAT ATAAACATTT ACGTATAAAT TCAGGTATGG 8040 

TAATATTACG TCCCAAGACA CCAAATCTAA ATCAGAAATT TATTATCCAT GTTTTAAGGA 8100 

ATAATAATTA TAGTCGAGTG ATATCAGGAA GTGCTCAGCC TCAGTTACCA ATTACAAAAT 8160 

TAAAAAAAAT ACTTCTCCCC CTCCCCCCAC TAGCCCTCCA AAATGAGTTC GCAGACTTTG 8220 
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TAGTCCAGGT CGACAAATCA CAATTTGCTT GTGAGATAGC TATAAAAGTG TGGAGAAATA 8280 

GCTTGAAATT TAGTATAATA TAGCTAAACT ATTTGTTTAA AGTGAGAAAA AAATGGGAAA 8340 

TTTTAGCTTT CTTTTAAAAA ATGACGAATA TGAATCTTTT TCAAAACCTT GCATTGAAGC 8400 

TGAGAATATG ATTGCTACAT CAACTGTGGC TACTGCCTTT ATGGCGCGTC GTGCTTTAGA 8460 

GCAGGCTGTC CATTGGATAT ATAGTCACGA TTCATATTTA GAAGCTCCCT ATCGTGCTAC 8520 

TCTATCTTCT TTAGTATGGG ATGATGATTT TAGGGATATC GTAGATTCTG AACTCCACAA 8580 

GCAGATAGTT CTGTTGATTC GGTGGGGAAA CCATGCTGCT CATGGTGGTG AAATTAAGGA 8640 

ACGAGAAGCG ATTTTAGCTT TGCATCATTT GTATCAGTTT GTTAATTTTA TCGATTATTG 8700 

TTACAGCAAT GAGTTTGTGG AGCGTTATTT TGATGAGAAG TGCTTACCAC TTTCAGCAAA 8760 

CATCAAATAC CGAGAAACTC CACAATCTAT GATAAAGTTA CAAGACAGTT TACCAGAACT 8820 

GCCTGATTTT CATGAACAGA TGGCTGCTCA GTCCGTAGAA GTTCAAGAGA CTTATACTGA 8880 

AAAACGTGAG ACTGCAGCGC AACGGCAAGA TGTGCCTTTC CATATTGATC AATTATCTGA 8940 

GGCAGAGACA AGAAAGCTCT TTATTGATAT CGATCTCCGT TTAGCAGGAT GGATATTTGA 9000 

AGAAAACTGT CGTGTTGAGA TAGCCGTTGA TGGTCTCAAG CACGGTTCAG GAATTGGTTA 9060 

CTGTGACTAT GTACTTTATG GTAAAAATGG GAAAATTTTA GCGATTGTGG AGGCTAAAAA 9120 

AGCCTCTGTC AATCCAGAAG TAGGGGAAGT ACAGGTCAAA GAATATGCTG AAGCTTTGGA 9180 

GAAACATATC GGCTATCAGC CAATTTGCTT TATTACAAAT GGGTTGAAGC ACTATATACT 9240 

TGATGGTCCG AACCGCCGCC AGATTGCAGG CTTTTACTCT CAAGAAGAAT TGCAATTAGT 9300 

GATGGATAGA CGTCATCTTC AAAAACCGCT TGAGGATATT TCTAGTAAAA TTAGGGACGA 9360 

TATTTCCGGG CGTCACTACC AAAAACATGC CATTGCAAGC GTTTGTGAAG CTTTCTCTGA 9420 

TCATCGTAGA CAGGCACTTT TGGTTATGGC AACTGGGGCG GGGAAAACTC GTACAGCAGT 9480 

TTCTCTAGTT GATATCTTAT CACGTCATAA CTGGGTAAAA AACGTTCTCT TCTTAGCCGA 9540 

TAGAACTTCC TTGGTTAAGC AAGCATATGA TTCGTTTAGA AAATTACTCC CAGATCTTTC 9600 

CGTTTGTAAC TTCTTAGAAG ATAAAGAAGG AGCTCAATCA AGTCGCATGG TCTTTTCAAC 9660 

TTATCCGACC ATGATTGGAG CGATTAGTGG TCAAGAAGAA GTAAATCAAC GCCCTTTCAC 9720 

TGTTGGGCAT TTTGACCTTA TCATAATTGA CGAATCTCAC CGTTCTATTT ATCAGAAATA 9780 

CAAGTCCATT TTTGATTATT TTGATGCAAG AATTGTAGGC TTAACAGCTA CTCCGCGTCA 9840 

AGATTTAGAT AAAAACACCT ATGGATTCTT TAATTTGGAG AATGGGGTTC CAACATATGC 9900 

ATATGATTTG GAAGAGGCTG TTAAAGACGG ATATTTAGTA GCCTATCATT CTATCGAAAC 9960 

CAAACTGAAA CTACCTACGG ATGGTCTACA TTATGATGAT TTGTCCGAAG AAGAAAAGGA 10020 
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ATTAGGAAGA ATGTCATCGT TTTCCTAGAA TACAGTATCA GTTGTTAAGT GGTTGATAAA 11820 

TTTCAAAGTA GATACTTGTA CCACGATGTT TGTTGATCGA GTTATTAACA AAAGAGCTAC 11880 

TTTGATTTTA AAGAAATAGA AAACAAAAAG CCGAGCAAGA ATTCAATTGC AGGAGAAAAT 11940 

GAAATAATAC TCAATGAAAA TCAAAGAGCA AACTAGGAAA CTAGCTGCAG GCTGCTCAAA 12000 

ACACTGTTTT GAGGTTGCAG ATGGAAGCTG ACGCGGATTG AAGAGATTTT CGAAGAGTAT 12060 

AAATCTTCCT AGGATAAAGC AAAACGCATA GTATCAAGGG TTTTCAACAC TTGATACTAT 12120 

GCGTTTTCTG ATGTTAAAGA CTTTCTACCA GGTTTTTTAA AAGCATAATT GTTAGTTGTA 12180 

GTCATTTATT ATTCTTCAAA GAAAAATGGT GGGGCGAATT TTTTCAGTTC TTCAAAGCAC 12240 

TTTTGAGCAG TATCTGCATC TTCACAGATG ATAAGACAGA CATCATTACC ACAAAGGGTA 12300 

GCGATAGCGT CAGGGAAGCT CAAAGTATCA ATGATAGAAC CAAAGGATTG AGCCAGTCCA 12360 

GGAAGGGTTT TTAGTAGGAC TTGGTGTTGA ACTGGGCGCA TCCAGACAAG GGCGTCTTCC 12420 

ATGTAGAGTT CGAGACGTTT TTCCCATTTT GAGATGGAAC CATTGTTAAG AACATAATAA 12480 

GCGCTATCTT CTTCGCGGAC TTTTGATAGG TTCATATTTT TGATGTCGCG TGAGAGGGTT 12540 

GCCTGGGTTA CTTGAATGTC GTTCTCAGCA AGAAGGGCTT GCAACTCAGC CTGTGTATGA 12600 

ATCTTGTTTT TTGTGATAAG AGCGCGTATA AGTTGGTGGC GGTGTTCTGA TTTATTCATA 12660 

ATAATGTAAC TCCTTTTAGC AAGGTAAGGT AAGCATGGAC TGAGCGAGGT CGACAGTCAA 12720 

GTGGTAGTCT GTATTGTCAC GGATGGTGAT TTCAAAGTCA GTAGTATAGA GGACTAAACG 12780 

GAGAGTGTCT CCTTCTTTTA GCTTGTAAAT AGTTGGCTGC AGTTCAAATT GAACGTCCAT 12840 

CCATTCATCT GCAGTAATAT CCTCTACTAA CAGTAAATCA TTTCTATTTT GTAAATTAAG 12900 

GTAACCTTTT GTCACGACTC GTTGTGCCTC TGGTCTAAAT GGCAATTCAC AGAGATTTTC 12960 

CAACATGTGA TAGCGACCGT TGTCAATGGT TCTAGCACTT AAAATAGCTG GATAAGGTTG 13020 

TAGGTATTTC TTTTGCCCAA ATTCTAGCAG TTGGGCAGAT AAGAGCCCCT TGTTTGTACT 13080 

GGATTTGATA CGAAGATTGA GCTGAGCGCG ACCGTTTAGG TGGAGATCTT TAGTCACAGG 13140 

AAGGTTAATA GTAATCTGAT TGGCTTTCCC TTGATAGAGC TCTGTATTGA AGGTTTGGTA 13200 

TGTCTTACCA TAGCGCTCAA AATCCTTATC TGGGTACTGG TTTTGAATAG CTTGCTCTTC 13260 

TTGACCAAGT GAGAAGGTTT CACAGTTTTC TTGCCCACCG AAGTTATCAA GTGATAACCA 13320 

AGTCTGTGGA GCTGTATTGT CCTGCCAGAT AACAGTAGGA AGTTGAAAGT CTGTTTCCTG 13380 

TCCTAGTAAT TTCTTGGTCA ATAAGGCATT TATGGACTCA CGGAAGTCAA TTGATTGCCA 13440 

ATTGTTCATG TAAACATGGG CACCATTATG GAAAAAGAGA TGCTTGTGTA TATGAGTAGG 13500 

AAGAGCATGG AACATCTGGT AAACATGAAG TGGTTTGACA TTCCAATCCT GAGAACCATG 13560 
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AGTAAAGACA ACCTCTGCCT TTACTTTATG GGCATTGAGC AGATAATTGC GGTCATGCCA 13620 

AAACTGATTG TAGTCCCCAG TTTTTCGGTC TAGCTGAGCT TTCACTTTTT CTAAGTCAGC 13680 

TTGGTGAGCT TCATTGCCAC GGATATAGTC GCCAGCTAAG AGATTACGAG AATAGGTTAA 13740 

CTCAGCAAGG GAGTCAAAGT CCTCACCTGG ATAACCACCT GGGCTAGTCA CCAGACCGTT 13800 

TTCACGGTAG TAGTTGTACC ATGATGAAAT TCCTGCCTCG GCAATGATAA CTTCTAAACC 13860 

ATCGACTCCT GTAGTCGCAA GACCATTGGA CATGGTACCT AGATAGGAAA GTCCTGTTGT 13920 

AGCAACTTTT CCGTTTGACC AATCAGCCTT GACTTGACGC TGGCGCGTGT GATCAGTAAA 13980 

GGCACGGCAA CGACCGTTAA GCCAATCGAT GACATTTTTA TAAGCCTCGA TTTGCTGGTA 14040 

GTCTCCATTA GTCATGAAAC CTGTCGAGTC TTTGGTACCA ACACCTGAGA CATAGAGATT 14100 

GGCAAAGCCT CTCGGAAGGA AGTAGTCGTT TAGTGTATAG CTAGAGTTGA TGTGAGTTAG 14160 

CTTTTCCTCA GCCTCTGCTA TAAGCTCAGC TTTACCTTGG GGTTGGACGA GATTTAGTTG 14220 

AGGTTTCTCT AGCTCAATCT TGTGAGGAAG CTTAACCTCA AGCTCGCCCT CCATCTTGTA 14280 

GAGAGCCTTG TCACTAGCCT TGTCATTGGT TCCCTGATGA TAAGGGCTGG CTGTCATGAT 14340 

GGCAGGGATT TTTCCATCAA AACGAGGGCG AATAATGCTA ACCTTTACTA GGTCTGATAG 14400 

CCCTTTTTGG TCAGTATCGA CACGAGACTC AACGTAAACG ACTTCACGAA TGACATCCTG 14460 

GTTAGAAAAA GTAGCCAAAC TCTTGCCGTT AAAGTAGTGG TAGTCATTAT CCTCCGGAAT 14520 

AAGACCATCA CTAACAAGTT GGTCGATAAG AGTATTTCCT TTTTTGGTGC GAGTATTGAG 14580 

TAACTGATAG AGATTTTCAA TCAAGTCACC ATATATAATG GGAAATCCAG TTTCTTTACG 14640 

AAAAACGTCA CTATCTTCGA AGTCAACCAA ATAAGAAAAG CCTAAAAGTT GAAAAGCAAC 14700 

AGTATAAAAA ATATCTGCTG TCAGTTCATC TTCTGATTGA AAAAATGTCA GCAGGTCTGT 14760 

TTTTTTATCA GCTGCTAGGA TAGAAAGTGG GTAGTTGGTG TCTTGATAAG TGAAAAAGAA 14820 

ACGACGTAAA AAGGTTTCAA GTGAGTCTTT GTGATTGGCT GTATTTTGTA AATCAAAGCC 14880 

ACATTTTTTT AGTTCAGATA AGACATTTTC TTTTGGAAAA TTGATATAAC TATATTGATT 14940 

AAAACGCATA GAACCTCCAT ATAGAATGAC AGTTAAGGTT ATTATATCAA AAAAAAAGCA 15000 

GAAAGGGAAT TGTTAACTTC AAAAGGAAAT AATCCAATAA AAATGAATAA AGTACTAAAT 15060 

TCAATATAGA GAACAGAGTA ACAATAAGAA TAAATAGATA GGGTATAAAA GTTCTAGGAG 15120 

ATTTATATTA TATGCTTTCT ATTTTTATAT ACAATATAGT ATAAATATAA AAATGATGAC 15180 

AAAAATACAA ATGAATAGAA AATAAATTAG TAAGCTGATG AAATTTTTCT CAAGAGAAGC 15240 

CATTTATAGG TGAAAATGGT ATAATATAGT GAGAAGGATA GAGGAGAAGT GTAAATTGAT 15300 
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CGCACAACTA GATACAAAAA CAGTCTATAG TTTTATGGAA AGCGTCATTT CGATCGAAAA 15360 

GTATGTGAGA GCAGCTAAAG AATACGGCTA CACTCATTTG GCTATGATGG ATATTGACAA 15420 

TCTTTATGGC GCTTTCGACT TTCTAGAGAT TACAAAAAAA TACGGCATTC ATCCTTTGCT 15480 

AGGGCTTGAA ATGACAGTGT TTGTAGATGA TCAGGGAGTG AATTTGCGCT TTTTAGCTCT 15540 

ATCTAGTGTG GGCTATCAGC AGTTGATGAA GCTTTCGACA GCCAAGATGC AGGGGGAGAA 15600 

AACTTGGTCA GTCCTGTCCC AGTACCTGGA GGATATCGCG GTCATTGTGC CTTATTTTGA 15660 

TAGAGTTGAG TCGTTAGAAC TAGGCTGTGA TTACTATATA GGGGTTTATC CAGAAACACT 15720 

AGCAAGCGAA TTTCATCATC CTATCTTACC TCTTTATCGG GTCAACGCTT TTGAAAGCAG 15780 

GGATAGAGAA GTTCTTCAAG TTTTAACAGC GATTAAAGAA AATCTACCGC TCAGAGAAGT 15840 

TCCCTTGCGT TCGAGACAAG ATGTCTTTAT ATCAGCAAGT TCTTTAGAGA AACTATTCCA 15900 

AGAGCGTTTT CCGCAAGCTT TGGACAATTT AGAAAAGCTT ATTTCAGGCA TTTCTTACGA 15960 
CTTGGATACT AGTCTGAAAC TGCCTCGTTT TAATCCAGCT AGACCAGCAG TAGAGGAGTT . 16020 

GAGAGAGCGT GCTGAACTGG GGCTTGTTCA GAAGGGGTTG ACTAGTAAAG AATATCAAGA 16080 

TAGACTAGAC CAAGAATTGT CTGTTATTCA TGATATGGGC TTTGATGATT ATTTCTTGGT 16140 

TGTTTGGGAT TTGTTGCGTT TTGGACAATC GAATGGCTAT TATATGGGAA TGGGAAGGGG 16200 

TTCTGCAGTA GGCAGTTTGG TTTCTTATGC CTTAGACATC ACGGGGATTG ACCCAGTAGA 16260 

GAAAAATCTG ATTTTTGAAC GCTTTCTTAA TCGTGAACGC TATACCATGC CTGATATTGA 16320 

TATTGATATC CCAGATATTT ATCGTCCAGA TTTTATCAGA TATGTTGGTA ATAAATATGG 163 80 

TAGTAAACAT GCGGCACAAA TCGTTACTTT TTCAACCTTT GGAGCCAAGC AAGCTCTTCG 16440 

AGATGTCTTG AAACGCTTTG GTGTGCCAGA GTATGAATTA TCTGCAATTA CTAAGAAAAT 16500 

CAGTTTTCGT GACAATCTTA AGTCGGCCTA TGAGGGAAAT CTCCAGTTTC GTCAGCAAAT 16560 

CAATAGTAAG TTAGAATACC AAAAAGCTTT TGAGATTGCT TGCAAGATAG AGGGCTATCC 16620 

AAGGCAAACC TCTGTCCATG CGGCTGGTGT TGTAATTAGT GACCAAGATT TAACCAACTA 16680 

CATTCCTCTA AAGTATGGTG ATGAAATTCC ACTGACTCAG TATGATGCTC ATGGAGTTGA 16740 

GGCTAGCGGA CTTTTGAAGA TGGACTTTCT GGGACTACGA AATTTGACCT TTGTCCAGAA 16800 

GATGCAAGAG TTGCTTGCTG AAACAGAAGG TATTCATCTG AAAATTGAAG AAATCGATTT 16860 

AGAAGACAAA GAAACGTTAG CTTTATTTGC CTCTGGTAAT ACAAAAGGTA TCTTTCAATT 16920 

TGAGCAACCA GGTGCCATTC GTCTGCTTAA GCGTGTGCAA CCAGTCTGTT TTGAAGATGT 16980 

CGTCGCGACT ACTTCTCTAA ATCGACCGGG TGCTAGTGAC TATATCAATA ATTTTGTGGC 17040 

AAGAAAGCAT GGGCAGGAAG AAGTGACTGT TCTGGATCCA GTACTGGAGG ATATTTTGGC 17100 
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TCCAACCTAC GGCATAATGC TCTATCAGGA GCAGGTTATG 
CGGATTTAGT CTTGGGAAAG CCGATATTTT GCGTCGGGCT 
TGCCATGCAT GAGATGAGGG CTTCCTTTAT TCAAGGTTCA 
GGAAAAAGCA GAGCAGGTCT TTGATGTTAT GGAGAAGTTT 
GTCACACGCC TATGCCTACT CAGCCTTGGC CTTCCAGTTG 
TCCAGCCATT TTTTATCAGG TCATGTTAAA TTCTTCCAAC 
ACTTGAAGCA GGTTTTGAAG TAGCCTCTCT ATCCATCAAC 
AATTGCCAAC AAGGCCATCT ATCTAGGTTT GAAATCCATT 
AGCTCTCTGG ATTATTGAAA ATAGACCTTA TTCTAACATT 
ACCTGAGAAT TATCTGAAAC TTCCTCTGCT AGAACCTTTG 
TTCATTTGAA AAAAATCGTC AAAAAGTATT TAATAACTTA 
GAAAGAGTTG GGAAGTTTGT TTGGAGATGC TATTTATAGT 
GACGGAACAA GAAAAATTTT ATATGGAACA AGAGCTTTTA 
TCCACTACAA GCTATTGCAA GTAAGGCTAT TTACCCGATT 
AGAAAATAGC TATGCTATTA TCTTGGTTGA AGTTCAGAAA 
AAAGGGTGAA AATATGGCCT TCTTACAGGC AGATGATAGT 
TCTCTTTTCA GACTTATATC GTCAGGTTGG ACAGGAAATA 
TGTAAAAGGA AAAATACAAT CACGTGATGG CCGTCTGCAA 
AGAAGCAGTT GCTGAACGCT TTTGGATACA GGTGAAAAAT 
TTCACGCATT TTAGAACAAT TTAAAGGCCC AATCCCAGTC 
ACAGAAAACC ATCGTTTCTC CCCATCATTT TGTAGCTAAA 
ATTGAATGAA ATCGTTATGA AAACGATTTA TCGCTAAAAA 
TCAACGTAAA TGTGGTATAA TCAGTAAGAA TGTTAAAAGA 
GAAACGTATT GCTGTTTTGA CTAGTGGTGG AGACGCCCCT 
TGCAGTTGTT CGTCAAGCAA TTTCAGAAGG AATGGAAGTT 
TGCTGGTATG GTTGCCGGTG AAATTCATCC CCTAGATGCA 
TTCTCGTGGT GGTACTTTCC TTCACTCAGC TCGTTACCCA 
GCAACTTAAA GGGATTGAGC AATTGAAAAA ACACGGAATT 
TGGTGACGGA TCTTACCACG GCGCTATGCG TTTGACTGAA 



CAGGTTGCCC AGCGACTTGC 17160 

ATGGGGAAAA AGGATGCCTC 17220 

TTAGAAGCTG GTCATACTGT 17280 

GCAGGTTATG GTTTTAACAG 17340 

GCTTATTTCA AAACGCATTA 17400 

AGTGATTACT TAATAGATGC 17460 

ACCATTCCCT ATCACGATAA 17520 

AAAGGAGTCA GTAATGATTT 17580 

GAAGATTTTA TAGCTAAATT 17640 

GTAAAAGTTG GTCTTTTCGA 17700 

GCTAATCTAT TTGAATTTGT 17760 

TGGCAGGAAT CGGAAGATTG 17820 

GGGATAGGTG TCAGCAAACA 17880 

ACCCCAATCG GAAATTTGTC 17940 

ATAAAAGTGA TTCGTACCAA 18000 

AAGAAAAAAT TGGATGTCAC 18060 

AAAGAGGGAG CCTTCTACTA 18120 

ATGATTGCAC AAGAAATAAG 18180 

CATGAATCGG ATCAAGAAAT 18240 

ATCATCCGGT ATGAAGAGGA 18300 

TCCAATGAAT TAGAGGAGAA 18360 

TACGGAAAAT AGAAGAATTT 18420 

AAAAGGAGCA TAACCAATAT 3.8480 

GGTATGAACG CTGCCATCCG 18540 

TTTGGTATCT ATGACGGATA 18600 

GCTTCAGTAG GGGACATCAT 18660 

GAGTTCGCTC AACTTGAAGG 18720 

GAAGGTGTAG TTGTTATCGG 18780 

CATGGCTTCC CAGCTATTGG 18840 
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TCTTCCAGGT ACAATCGATA ACGATATCGT TGGTACTGAC TTTACAATCG GTTTTGACAC 18900 

AGCGGTTACT ACTGCCATGG ACGCTATCGA TAAGATTCGT GATACATCAT CAAGTCACCG 18960 

TCGTACTTTT GTAATCGAAG TTATGGGACG TAACGCTGGT GATATCGCTC TTTGGGCTGG 19020 

TATTGCAACT GGTGCTGATG AAATCATCAT . CCCTGAAGCA GGCTTCAAGA TGGAAGATAT 19080 

CGTAGCAAGC ATCAAAGCTG GTTATGAATG TGGTAAAAAA CACAATATTA TCGTCTTAGC 19140 

TGAAGGTGTG ATGTCAGCGG CTGAATTTGG TCAAAAACTT AAAGAAGCTG GAGATACAAG 19200 

CGACCTTCGT GTAACAGAAC TTGGACATAT TCAACGTGGT GGTTCTCCAA CTGCGCGTGA 19260 

CCGTGTTTTG GCGTCACGTA TGGGTGCACA TGCTGTTAAA CTTCTTAAAG AAGGTATCGG 19320 

TGGTGTTGCG GTTGGTATTC GTAACGAAAA AATGGTTGAA AATCCAATTC TTGGTACTGC 19380 

AGAAGAAGGG GCATTGTTTA GCCTTACTGC AGAAGGTAAG ATTGTGGTTA ACAACCCAGC 19440 

TACAAA 19446 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 16593 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



TCGTAAATAT 


GCTCTGTTTT 


TGGATTTTGT 


TTCTTAATCT 


GTTTGGCAAG 


TGCCTTCATC 


60 


ATAGAAATAG 


GACCACACAT 


ATAGACGGTT 


GCATGTTCGG 


GCACTTCTTT 


TTGTTCAAAA 


120 


TTAAGATAGC 


CGTCTTTCGT 


ACTGTCGATT 


AGATGGAGTT 


CAAAATTAGG 


ATTTTTCTGA 


180 


GCATAGTTAC 


GGAGTAAATC 


TAGGTAGACT 


GCATTTTCAT 


CTCCACGGAA 


GCTATAGTAG 


240 


AAGTGAACCT 


GTTTATCTAA 


AATAGGATGT 


TCACGGATGT 


AAGAGATGAA 


GGGGGTGATC 


300 


CCAATACCTC 


CAGCAATCCA 


AACCTGATTT 


TCTCGTCCTT 


CTTCTATGAT 


CATGTGTCCG 


360 


TAAGCTCTGT 


CTAGGGTTAC 


TTTGCTGCCG 


GCTTGAAGAT 


TATCATAGAT 


ATTCTTGGTA 


420 


TGGTCGCCTG 


AAGTTTTAAC 


AGTAAAGTAA 


AGAGTTTGAC 


CATGACCTCC 


TGAGATAGAA 


480 


AAGGGATGCG 


GAGCACTTTC 


AAAGCCTTCT 


TGGAAAATCT 


TTAGAAAGGC 


AAATTGTCCT 


540 


GATTGATAGT 


TGAAAGGTCT 


GCTAAGATGG 


ATTTGAATTT 


CTCTAGTATC 


GTGATTTAAG 


600 


CGTTTGAGAT 


GGGTAATTTT 


CCCTAGATAG 


GGGAAGGAAA 


TCTTTTGATA 


TAGAAAAATG 


660 


ATATAAAAAC 


CAGCTAGTAA 


GCCTAAAAGG 


GCATAGCTAC 


CAACAAGAAA 


ACTTAGAAGA 


720 


TTAAATGTAA 


GGAGACGATT 


GCCCATTATC 


ATGTAGATGT 


GAAAGAGTCC 


TAAAATATAG 


780 
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GCTAGGTAAA 


CCAGGCGGTG 


AATCCATCGC 


CAAGCTTCGT 


ATTGGATGTA 


TTTGCCTAAA 


840 


TAGGCGACAA 


GGATGATGCT 


GGCAAAGATA 


TAGATGGCAA 


GATTGCCAAA 


CTGAGCAGCT 


900 


AAGCGAGAGC 


CCCACAAACC 


GCCCATACTA 


AAGTTATGAA 


AGATTAGTAG 


GATGATTGAG 


960 


AGAAAGGCTG 


TGAATTTGTG 


GACGGTGTAG 


ACCTTCTCCA 


AACTGTGAAA 


CCAGCTTTCT 


1020 


AGTAGTGGGA GACGAGTGGC TAGGATAAAA 


GTCAGAGATA 


GGCTTGTTAA 


AGCTAGTCCT 


1080 


GGAATCATGA 


ATTGGGGAGA AGTGTTCATC 


CAAGTCAAAA 


GAGTCAAGAT 


AAAACTAGCT 


1140 


ATGATAAAGA 


GTAGTCCTTT 


GACTGATTTC 


ATAGAAAATT 


CCATTTCATT 


TAGATTTCGA 


1200 


TTTGTTGTAA 


ATAAATTTGT 


TACATTTTAT 


CATAGAAAAT 


GTATGGTGTC 


AAATTGAGGT 


1260 


CTATAAATAT 


CTACTCTCAT 


CAAAAAACTC 


TCCAATTGAA 


CTGGAGAGTG 


GCTGTTTATA 


1320 


CTCAATGAAA 


ATCAAAGAGC 


AAACTAGGAA 


GCTAGCCGCA 


AGTTGCTCAA 


AACACTGTTT 


1380 


TGAGGTTGCA 


GATAGAGCTG 


ACGTGGTTTG 


AAGAGATTTT 


CGAAGAGTGT 


TATTCTGCAG 


1440 


CTTGTTGCCA 


ACGTTTGGCT 


AGCATATGAG 


ACAGGCTAGA 


AATTGCTAGG 


TTAAAGCTGA 


1500 


AGTAGATGAG 


GGCAATCAGG 


ATGTAAAGAC 


TGAAGACCTG 


CTCTGGTTCG 


AAATAACGGC 


1560 


CCATGAGAAT 


TTGGCTGGCT 


CCAAAGAGTT 


CTTGTAGGGC 


GATAACAGAG 


TAGAGGAGAC 


1620 


TGGTATCCTT 


AATCACGGTA 


ACAAACTGAG 


AAATGATGGC 


TGGTAGCATT 


TTGCGGATGG 


1680 


CTTGTGGGAG 


AATGATGTAG 


TAGAGGATTT 


GGGCTGAGGT 


GAAGCCTTGT 


GACATTCCTG 


1740 


CTTCGTACTG 


TCCCTTGTCT 


ACGGCATTGA 


GACCGCCTCG 


AATAATCTCA 


GCCAAGGCTG 


1800 


CTGATGTAAA 


GAGAGTAAAG 


GCTGTAATAC 


CTGCTGGTGT 


GGATTTCATT 


TTGAACACCA 


1860 


AAAAGATAGT 


AAAAATCCAG 


AGAAGGTTGG 


GAACGTTGCG 


CACAAACTCG 


ATATAAATAC 


1920 


TGGAAATAAT 


GCGTAAGACA 


GGATTTTTGC 


CATTTCTCGT 


GACAGCTAGC 


ACCGTACCGA 


1980 


TGATAGTAGA 


GAGGATGATG 


GCAATCAGAG 


AAATATAGAG 


GGTCAAGCCA 


AATCCTTTAA 


2040 


AGATAAAGAC 


TAGGTTATCT 


GGGGTTAAAA 


CTTCTAAAAT 


AGATTCCATA 


GTAACCTCCT 


2100 


AAAGTGAATA 


GGCTTTTTTG 


TTGGCTTGCT 


CCATCTTGCG 


ACCAAACTGG 


GCAACAGGGA 


2160 


AGCATAGAGC 


AAAGTAGAGA 


AGAGCAGCAC 


CTAAAAAGGC 


TGGTATATAG 


TTTCCGTTGA 


2220 


GAGCCGACCA AGACTTAGTC 


ACAAACATCA 


AGTCTACTCC 


AGAGATGATA 


GCTACAGTAG 


2280 


AGGTGTTCTT 


GATGAGGTTA 


ACAATTTGGT 


TGGTCAATGG 


AGGGAGAATG 


ATGCGGAAGG 


2340 


CCTGAGGCAA 


GATAATCAAG 


CGCATGGCAC 


TGATATAGGT 


AAAACCTTGC 


GACAAGGCGG 


2400 


CCTCCATCTG 


ACCACTAGGA 


ATAGACTGAA 


TCCCTGAACG 


AATAACCTCA 


GCGATATAAG 


2460 


CGCCGTGATA 


GAGTCCCACG 


CAGAGAACGG 


CTGTCCAATA 


AATTGGAATC 


ATGATGATAT 


2520 
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GGTCACTGAT AAGAGGTAGG CCATAAAAAA CAATAACAAA CTGCACCAAG AGGGGAGTAT 2580 

TTTGGTAAAA TTCAACAAAG ATGCGAGCTA AAATGCGTAA AATTGGACGT TTACTGGTTG 2640 

ACATGGCACC AAAGAAGATG CCCAAAACCA TAGCGAGGAT AAAGGAACCA ACCGCTAGGG 2700 

CAAGGGTGAA GAGGAAACCA TTGAAAAATT GTCCAAAATC CTGAAAATAG GCTGTCCAAG 2760 

ATGATAAATC TGTCATGGGG TGTCCTCCTT AATCTGCAGT ATGGCTAGAT GGTTTGAGCT 2820 

TGTAACGGTC ATAAAGTTTC TGCAAACTAC CATCCTTGCT CCATTTAGTA ACCAAGTTAT 2880 

CAAGATAGTC GTTGAGCTCT GTATTTGATT TCTTGGTAAC AATACCGTAG TCAGATGGCT 2940 

TGAAACTATC ATCTAGTAGT GCTGTCCGTT TACTAGTGTA GCCAGATAGA ATAGAGCGGT 3000 

CAACGGAAAA GGTATCGATA CGATGAGCGT GCAGGGAAGT AATCAATTCT GGGTAGGAAC 3060 

CAAGTTCGAC GAATTTAAAC TTCAGACCTT TCTTTTTACC CAGTTCAGTA ATCAGGCGTT 3120 

GGGTGATAGA ACCTTGGGCG ACTCCGATGG TTTTGCCGTT TAGGTCCTCA ATCTTTTTGA 3180 

TTTTGGCAGA TTTATTGACC AAAAATCCAG AAGCGTCTGT GTAGTAGGGA CTGGTAAAGT 3240 

TGTAGAGTTT TTTGCGTTCG TCCGTGATGG TAAAGGTCGC GATATCCATA TCGACCTGTT 3300 

CATTGTCTAG AAGGGGGCCG CGGGTTTGTG CTGTAACCGG CACATAGCGA ATCTTGACCT 3360 

TGAGTTCATC AGCTACCATC TTGGCCAAGT CGGTTTCGAT ACCAGAATAA GTACCGGTCT 3420 

TGGGATCTTT GTAACCAAAA TTGGGAACGT CTTGTTTGAC ACCGACAACC AGTTCGCCTC 3480 

TTTTTTGAAT GTCTGCGATA CTTGTATCAG CCTGGACTGG TTTGGCAGCA GCAAGGCCGA 3540 

AAAGGCTAAT CAATAATGCT GATAAAAAGA ATTTTTTTTC ATAGGCGCCT CCTTATTTGA 3600 

CTTTGTCACT TTCGTGGTTG ATAATTTTGC TGAGGAATTG TTGGGCACGA GGTTCGCTTG 3660 

GATTGTCAAA AAAGTTATCG ACATCTGTCG TATCTACTAA AACTTCTCCG TCGGCCATAA 3720 

AGATAATGCG GTCCGCAACC TCTCGAGCAA AGCCCATTTC GTGGGTAACG ATGATCATGT 3780 

TCATCCCATC ATGCGCCAGT TTCTGCATAA CTGCTAGAAC ATCTCCGATA GTCTCAGGAT 3840 

CAAGAGCAGA TGTTGGTTCA TCAAAGAGGA GGAGTTCCGG ATGCATAGCA AGACCACGAG 3900 

CGATGGCGAT CCGCTGTTTT TGTCCACCAG ATAGCATGGC GGGATAGGAA TCTTTCTTGT 3960 

CCCACATATT TACAAATTCC AGATATTTTT GGGCGGTTTT TTCAGCTTCT TTTTTATCAA 4020 

TTCCTAGAAC TTCAATGGGT GCAAGCGTTA CGTTTTCTAA CACAGCTTTG TGTGGATAAA 4080 

GGTTAAAATG TTGAAAAACC ATGCCGACTT CCTTGCGAAG AGGTACCAAA TCTTTCTGGC 4140 

TGGCACCAGC AACTTGGTGC CCATTGACTA GGAGACTTCC TTTGTCAACA GTCTCTAAAC 4200 

CATTGATCGT ACGGATAAGA GTGGACTTCC CAGAGCCAGA AGGTCCAAGC AGGACAACAA 4260 

CTTGTCCTTT TTCAAAACGG AGATTGATGT TGCGGAATGC GTGGTAGTCT CCGTAATATT 4320 
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TTTCGACGTT TTTAAATTCT ACTAAAGCCA TGAGAGATCT 
ACACGGTTCT ACAATAAAAG AATGTTCTTG TCAAATCATA 
TGAAATAAGA ACAGGAAAAA TCGATCGGGA CAGTCAAATC 
AAGTAGAGGT GTACTATTCT AGTTTCAATA TACTATAAAA 
GATAGAGAAA ACGTCTAAAT CATGTTATAA TGAAGCAATA 
TGTCTTTTTG ATAACACCTA CTTATGAATG GCAGTTTGCC 
TTTTACAAAG ATAGCCAAGA AGGCTGGACT GGGTCCTGAG 
GAGAGGGATT CAGAACCAAG AAAGTCTGAA GAAGTTTTTA 
ACATGATGCT TATCTGCTCC ATGATATGGA CAAGGCAGTG 
TGAAGAAGGG GAAAATATTC TTGTTTATGG AGACTATGAT 
TTCTATTGTG AAGGAAAGTT TGGAACAACT TGGTGCTGAG 
TCGTTTTACC GATGGCTATG GCCCTAATGC TAGTGTTTAT 
AGGGATTTCC TTGATTGTGA CGGTGGACAA TGGGGTTGCT 
GGCTCAGTCT ATGGGAGTAG ATGTCATTGT GACAGACCAT 
GCCAGATGCT TATGCTATTG TCCATCCTGA ACATCCAGAT 
TTTGGCTGGT TGTGGAGTTG CTTTCAAGTT GGCTTGTGCC 
GGAATTGCTT GATTTGGTCG CTATTGGAAC TATTGCAGAT 
AAATCGTATC TTAGTTCAAT ATGGTCTGGA AATGTTGGGT 
GCAAGAAATG CTGGACATGG CTGGGATTGC TGCCAACGAA 
TTTCCAGATT GCTCCTCGTT TGAATGCCTT GGGTCGCTTG 
TGATTTGTTG ACTGGATTTG ATGATGAGGA AGCGCATGAG 
GAAAAACGAA GAGCGCAAGG AAATCGTTCA GTCTATCTAT 
GGATCCTGAG AAGAAGGTTC AGGTCTTGGC CAAGGAAGGC 
AATCGTGGCT GGTCGTTTAT TGGAAGAATT GGGACAGACA 
AGACGGTCGT GCCAAGGGCA GTGCTCGTAG TGTGGAAGCG 
GGATCCCCAT CGAGACCTCT TCATCGCCTT TGGAGGTCAT 
GCTGGAAGTT GAGCAACTCT CAGATTTATC TCAGGTTTTG 
AGGTGCAGAT GCTGGTGGCA AGAATAAGTT AAACCTAGAT 
ACTTAGCTTG GAAACGGTCA AAAGTTTTGA ACGTTTAGCT 



CTATTGTGTT ATATTTTATA 4380 

TCTGAAAAAA TTCACTATAG 4440 

GATTTCTAAC AATATTTTAG 4500 

TGTTATAAAA AAGCAATCTG 4560 

GAATTCTTAG AAAGAGTGGA 4620 

CTGCAGGTAG AAGATGCGGA 4680 

GTGGCTCGGT TATTGTTTGA 4740 

GAACCTTCCT TGGAGGACTT 4800 

GAGCGGATTC GTCAGGCTAT 4860 

GCGGATGGCA TGACTTCGGC 4 920 

TGCCGAGTTT ACCTGCCAAA 4980 

AAATACTTTA TCGAGCAAGA 5040 

GGTCATGAGG CTATTGCATT 5100 

CATTCCATGC CTGAAACCCT 5160 

GCGGATTATC CTTTTAAATA 5220 

CTGTTAGAAG AAGTGCAAGT 5280 

ATGGTGAGTC TGACGGATGA 5340 

CATACCCAGC GCATTGGTCT 5400 

GTAACAGAAG AAACGGTTGG 5460 

GATGATCCCA ATCCTGCCAT 5520 

ATTGCCCTTA TGATTCACCA 5580 

GAAGAAGCCA AGACCATCGT 5640 

TGGAATCCTG GGGTTCTAGG 5700 

GTCATTGTTC TTAATATAGA 5760 

GTCGATATTT TTGAAGCTCT 5820 

GCAGGTGCAG CGGGTATGAC 5880 

GAAGATTATG TTCGTGAAAA 5940 

GAAGAGTTGG ATTTGGAGGC 6000 

CCTTTTGGAA TGGATAATCA 6060 
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GAAACCTATT TTTTATATCA AGAATTTTCA GGTCGAAAGT GCTCGTACTA TGGGGGCAGG 6120 

TAATGCCCAT CTAAAGCTGA AAATTTCCAA GGGTGAGGCG AGTTTTGAAG TGGTAGCCTT 6180 

TGGTCAAGGC AGATGGGCGA CAGAGTTTTC TCAAACCAAG AATCTAGAGT TAGCGGTTAA 6240 

ATTGTCTGTC AACCAATGGA ATGGCCAAAC TGCCCTCCAG TTGATGATGG TGGATGCGCG 6300 

AGTGGAAGGT GTTCAACTTT TTAACATTCG TGGAAAAAAT GCAGTCTTGC CAGAAGGTGT 6360 

TCCAGTCTTG GATTTTCCTG GAGAACTGCC AAATCTTGCG GCTAGTGAAG CTGTTGTCGT 6420 

AAAAAACATT CCAGAGGATA TTACTCAGCT GAAGACCATT TTTCAGGAAC AGCATTTCTC 6480 

TGCTGTCTAT TTCAAAAATG ATATTGACAA GGCTTATTAT CTGACAGGTT ATGGGACTAG 6540 

AGATCAGTTT GCCAAATTGT ACAAGACTAT TTACCAGTTC CCAGAGTTTG ATATTCGCTA 6600 

CAAGCTGAAA GATTTGGCTG CATATCTTAA TATTCAACAA ATCTTGCTGG TCAAGATGAT 6660 

TCAAGTATTT GAAGAACTAG GCTTTGTGAC GATAAAAGAT GGTGTGATGA CAGTCAATAA 6720 

AGAGGCGCCA AAGCGGGAGA TAGGAGAAAG TCAAATTTAC CAAAATCTCA AACAAACCGT 6780 

TAAAGACCAA GAAATGATGG CGCTGGGTAC GGTGCAAGAA ATTTATGATT TTTTGATGGA 6840 

AAAAGAGTAG AAGTTAGGAA AGAGTTGGGA AATCAACTCT TTTTTGAAAA CAGACCTTCA 6900 

TTTTGAAAAT CATCAAAAAA ATGGTATAAT GGTAGGAAAA GATTCGGCTG AAAGTATCAG 6960 

AACTTTTAGA ATAAGAGGGT AGAATTGCCC TATAATCAAG ATAAACTAAG ATTTTGGAGG 7020 

AAAAATGAGT AATATCAGTT TAACAACACT TGGTGGTGTG CGTGAGAATG GAAAAAATAT 7080 

GTACATTGCT GAAATTGGAG AGTCCATTTT TGTTTTGAAT GTAGGGTTAA AATATCCTGA 7140 

AAATGAACAA TTAGGGGTCG ATGTGGTGAT TCCAAACATG GATT ACCTTT TTGAAAATAG 7200 

CGACCGTATT GCTGGGGTTT TCTTGACCCA CGGGCATGCG GATGCCATTG GTGCTCTACC 7260 

GTATCTCTTG GCAGAGGCTA AAGTTCCTGT ATTTGGGTCT GAGTTGACCA TTGAGTTGGC 7320 

AAAGCTCTTT GTCAAAGGAA ATGATGCCGT TAAGAAATTT AATGATTTCC ATGTCATTGA 7380 

TGAGAATACG GAGATTGATT TTGGTGGGAC AGTGGTTTCC TTCTTCCCTA CGACTTACTC 7440 

CGTTCCAGAG AGTCTGGGAA TTGTCTTGAA GACATCGGAA GGAAGCATCG TTTATACAGG 7500 

TGACTTCAAA TTTGACCAAA CGGCTAGTGA ATCTTATGCA ACTGATTTTG CTCGTTTGGC 7560 

AGAGATTGGT CGTGACGGCG TCCTGGCTCT CCTCAGTGAT TCGGCCAATG CAGACAGCAA 7620 

TATTCAGGTG GCTAGTGAAA GTGAAGTTAG GGATGAAATT ACCCAAACTA TTGCTGACTG 7680 

GGAAGGTCGT ATCATCGTTG CAGCTGTTTC CAGTAATCTT TCTCGTATTC AGCAGATTTT 7740 

TGACGCTGCG GATAAAACAG GTCGACGTAT CGTCTTGACA GGATTTGATA TTGAAAATAT 7800 

CGTCCGCACA GCGATTCGTC TTAAGAAGTT GTCTTTAGCC AACGAAATTC TTTTGATTAA 7860 
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GCCTAAAGAT ATGTCTCGCT TTGAAGACCA TGAGTTGATT ATTCTTGAGA CAGGTCGTAT 7920 

GGGTGAGCCT ATCAATGGAC TTCGTAAGAT GTCGATTGGT CGCCATCGTT ATGTAGAAAT 7980 

CAAGGATGGG GACCTAGTCT ATATTGCTAC GGCTCCGTCT ATTGCTAAAG AAGCCTTTGT 8040 

TGCGCGTGTG GAAAATATGA TTTATCAGGC AGGTGGGGTT GTCAAATTGA TTACCCAAAG 8100 

TTTACATGTA TCAGGGCACG GAAATGTGCG TGATTTGCAG CTGATGATCA ATCTTTTGCA 8160 

ACCTAAGTAC CTCTTCCCTG TCCAAGGGGA GTATCGTGAG TTGGATGCTC ACGCTAAGGC 8220 

TGCCATGGCA GTTGGGATGT TGCCAGAACG CATCTTCATT CCTAAAAAGG GGACGACCAT 8280 

GGCTTACGAG AATGGAGACT TTGTTCCAGC TGGATCGGTT TCAGCAGGAG ATATCTTGAT 8340 

TGATGGGAAT GCCATTGGTG ATGTTGGAAA TGTTGTTCTT CGTGACCGTA AGGTCTTGTC 8400 

AGAGGATGGA ATTTTCATCG TGGCTATTAC AGTCAACCGT CGTGAGAAGA AAATTGTGGC 8460 

TAGGGCTCGT GTTCACACGC GTGGATTTGT TTATCTCAAG AAGAGTCGCG ATATTCTCCG 8520 

TGAAAGTTCA GAATTGATTA ACCAAACGGT AGAAGAGTAT CTTCAAGGAG ATGACTTTGA 8580 

CTGGGCAGAT CTCAAAGGTA AGGTTCGTGA CAATCTGACC AAGTACCTCT TTGATCAAAC 8640 

CAAGCGTCGC CCAGCCATTT TACCAGTAGT CATGGAAGCA AAATAATCGT TGAAATAAAC 8700 

AGAGAGAAAG TCGAGTTTCG GCTTTTTCTT ATAGAAAAAT AGAAGGAGAA AATCATGGCA 8760 

GTGATGAAAA TCGAGTATTA CTCACAAGTA TTGGATATGG AGTGGGGGGT GAATGTCCTC 8820 

TACCCTGATG CCAATCGAGT GGAAGAACCA GAGTGTGAAG ATATTCCCGT CTTGTACCTT 8880 

TTGCACGGGA TGTCTGGAAA TCATAATAGT TGGCTTAAGC GGACCAATGT AGAACGCTTG 8940 

CTTCGAGGAA CTAATCTCAT CGTTGTTATG CCCAATACCA GCAATGGTTG -GTACACCGAT 9000 

ACCCAGTATG GTTTTGACTA CTACACGGCT CTAGCAGAGG AATTGCCACA GGTTCTGAAA 9060 

CGCTTCTTCC CTAATATGAC GAGCAAGCGT GAAAAGACCT TTATCGCTGG TCTTTCTATG 9120 
GGAGGCTACG GCTGCTTCAA ACTGGCTCTT ACGACAAATC GTTTTTCTCA TGCAGCTAGT . 9180 

TTTTCAGGTG CCCTCAGCTT TCAAAACTTT TCTCCTGAAA GTCAAAATCT GGGAAGTCCA 9240 

GCCTACTGGA GAGGTGTTTT TGGAGAGATT AGAGACTGGA CAACTAGTCC CTATTCTCTT 9300 

GAAAGTCTGG CTAAAAAATC GGATAAAAAG ACCAAACTTT GGGCGTGGTG TGGCGAACAG 9360 

GATTTCTTGT ACGAAGCCAA TAATCTCGCA GTGAAAAATC TCAAAAAACT AGGTTTTGAT 9420 

GTGACCTATA GCCATAGCGC TGGAACTCAC GAGTGGTACT ACTGGGAAAA ACAATTGGAA 9480 

GTTTTTTTAA CAACCCTACC AATTGATTTC AAATTAGAAG AGAGACTGAC TTAGTTTGAA 9540 

CTTCAGCATA GGGGGAGTAG AACTAAAATA AAATATGTTT TCACTAGACT TTTCAAACGm 9600 
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AAGTAGTAGA ATAGTAATAA AATACTGGAG GAAAGAGAGT AGGAAATGTA CCGTTATCAA 9660 

ATTGGCATTC CCACATTAGA ATATGATCAG TTTGTCAAAG AACATGAATT AGCCAATGTA 9720 

TTACAAAGTA GTGCTTGGGA GGAAGTTAAG TCTAATTGGC AACATGAGAA GTTTGGTGTT 9780 

TACAGGGAAG AAAAATTACT GGCGACAGCT AGTATTTTGA TTAGAACTCT TCCGCTAGGC 9840 

TATAAAATGT TTTACATCCC AAGAGGACCT ATATTGGATT ATGGGGATAA AGAACTCTTG 9900 

AATTTTGCCA TTCAGTCTAT TAAGTCCTAT GCTCGCAGTA AGAGAGCGGT TTTTGTGACT 9960 

TTTGACCCAA GTATTTGCCT ATCTCAAAGT TTAATCAATC AGGAAAAGAC AGAATTTCCT 10020 

GAAAATCTGG CTATTATTGA TAGTTTGCAA CAAATGGGAG TAAGGTGGTC AGGAAAAACG 10080 

GAGGAAATGG GAGACACCAT TCAACCTCGT ATTCAGGCGA AAATATACAA GGAAAATTTT 10140 

GAAGAAGATA AACTTTCCAA GTCAACAAAA CAGGCTATTC GAACAGCACG AAACAAAGGG 10200 

CTTGAGATTC AATATGGTGG ACTGGAACTA TTAGATTCAT TTTCGGAGTT GATGAAAAAA 10260 

ACTGAGAAGC GAAAAGAGAT TCATTTGAGG AATGAAGCCT ATTATAAAAA ATTGTTAGAT 10320 

AATTTTAAGG ACAAGGCCTA TATCACCTTG GCCACCTTGG ATGTTTCTAA ACGTTCGCAA 103 80 

GAGTTAGAAG AACAGTTAGC GAAAAATAGA GCCTTGGAAG AGACCTTTAC TGAGTCGACT 10440 

CGAACTTCAA AAGTAGAAGC GCAGAAGAAG GAAAAAGAAC GTTTGTTAGA GGAATTGACC 10500 

TTCTTGCAGG AATATATAGA TGTAGGTCAA GCGAGAGTTC CTTTAGCGGC TACTTTGAGT 10560 

TTGGAATTTG GTACTACCTC TGTCAATATA TATGCTGGTA TGGATGATGA TTTTAAACGT 10620 

TACAATGCAC CAATTTTAAC TTGGTATGAA ACGGCTCGCT ATGCCTTTGA ACGAGGTATG 10680 

ATCTGGCAAA ATTTAGGTGG TGTTGAAAAC TCTCTCAATG GTGGACTTTA TCATTTTAAG 10740 

GAAAAATTTA ATCCAACGAT TGAAGAATAC TTGGGTGAAT TTACAATGCC CACTCATCCT 10800 

CTCTATCCTC TGTTAAGACT TGCTCTTGAT TTCCGTAAAA CATTAAGAAA AAAACATAGA 10860 

AAGTAAGTAT ATGGCACTAA CAACACTCAC GAAAGAAGAG TTTCAGACTT ATTCTGATCA 10920 

GGTTTCTTCT CGTTCCTTTA TGCAATCTGT CCAGATGGGG GATTTGCTAG AAAAAAGAGG 10980 

GGCTCGAATT GTTTATCTTG CTTTGAAACA AGAAGGAGAA ATTCAAGTTG CAGCTCTGGT 11040 

TTATAGCCTG CCCATGCTGG GTGGTCTGCA TATGGAACTC AATTCGGGGC CGATTTATAC 11100 

CCAACAAGAT GCTCTTCCAG TTTTTTATGC AGAGTTAAAA GAATATGCCA AGCAAAATGG 11160 

TGTATTAGAG TTGCTTGTAA AACCCTATGA AACTTATCAA ACTTTTGATA GCCAAGGTAA 11220 

TCCAATAGAT GCTGAGAAAA AAAGTATTAT TCAAGATTTG ACTGATTTAG GTTATCAATT 11280 

TGATGGCTTA ACAACAGGTT ACCCAGGTGG AGAACCAGAT TGGTTATACT ATAAAGATTT 11340 

AACTGAATTA ACTGAAAAGA GTTTGCTTAA AAGTTTTAGC AAAAAGGGTA AACCCTTGGT 11400 
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GAAAAAGGCT 


GAAACCTTTG 


GCATTCGGTT 


GAAAAAGTTA 


AAACGTGAAG 


AACTATCGAT 


11460 


TTTTAAGAAT 


ATAACAAAAG 


AAACCTCTGA 


ACGTAGAGAA 


TATAGTGATA 


AAAGTTTAGA 


11520 


ATATTATGAG 


CATTTTTATG 


ATACTTTTGG 


AGAACAAGCG 


GAGTTTCTCA 


TAGCAAGCTT 


11580 


AAATTTTTCG 


GACTATATGA 


GCAAATTGCA 


AGGTGAACAA 


AGTAAACTAG 


AAGAAAACTT 


11640 


GGACAAGTTG 


CGACTTGATT 


TGAGTAAAAA 


TCCTCATTCT 


GAGAAAAAAC 


AAAATCAACT 


11700 


GAGAGAATAT 


TCTAGTCAAT 


TTGAAACGTT 


TGAAGTTCGA 


AAAGCAGAAG 


CGCGAGACTT 


11760 


GATTGAAAAA 


TATGGAGAAG 


AAGATATTGT 


TTTAGCTGGG 


AGTTTATTTG 


TTTATATGCC 


11820 


TCAGGAAACG 


ACTTATCTCT 


TTAGTGGTTC 


CTACACTGAG 


TTTAATAAGT 


TCTATGCCCC 


11880 


TGCACTGCTT 


CAAAAATATG 


TTATGTTGGA 


AAGCATAAAA 


CGTGGAATAC 


CTAAATACAA 


11940 


CTTCCTAGGC 


ATTCAAGGGA 


TTTTTGATGG 


AAGTGATGGT 


GTTTTGCGTT 


TTAAACAGAA 


12000 


TTTTAATGGC 


TATATTGTAC 


GCAAAGCAGG 


TACTTTCCGT 


TACCATCCAT 


CGCCTTTAAA 


12060 


ATACAAAGCT 


ATCCAGTTAC 


TCAAAAAAAT 


AGTAGGACGT 


TAAGATGAAA 


AAGTCAGTAT 


12120 


TTAGATTTCT 


TTTAGCTTCT 


TTTAGTAAAA 


TAATTCTTAT 


TTGCTAGAAA 


GGTGGAGAGA 


12180 


CATGCGCTGG 


CTTTTTCGTT 


TGATAGGGGC 


TTTCTTTTCT 


TTTGTGTGGC 


GTTTGTTTTG 


12240 


GCGTCTGGTT 


TGGATAGTTG 


TGCTCTTATG 


TGTGCTTGCT 


TTCGGACTTC 


TCTGGTATCT 


12300 


GAACGGAGAT 


TTTCAAGGAG CGCTAAAGCA AGCAGAACGG TCAGTAAAAA TTGGTCAACA 


12360 


AAGTATTGAC 


CAATGGGAGA 


AAACAGGGCA 


ACTGCCTAAG 


TTAAGCCAGA 


CAGATAGTCA 


12420 


CCAGCATTCT 


GAAGGAAGGT 


GGGCACAGGC 


CTCTGCTCGT 


ATTTACCTGG 


ATCCGCAGAT 


12480 


GGATTCACGC 


TTTCAAGAGG 


CTTATTTAGA 


AGCAATCCAG 


AACTGGAATC 


AAACTGGTGC 


12540 


TTTTAACTTT 


GAACTCGTGA 


CTGAGTCTAG 


TAAGGCGGAT 


ATTACGGCTA 


CGGAGATGAA 


12600 


CGACGGAGGC 


ACTCCTGTGG 


CAGGAGAGGC 


GGAAAGTCAA 


ACTAATCTCT 


TAACAGGGCA 


12660 


ATTCTTGTCC 


GTAACGGTGC 


GGTTGAATCA TTATTATTTG 


TCCAATCCAT 


ACTATGGCTA 


12720 


CTCCTATGAA 


CGCCTTGTCC 


ATACGGCAGA 


ACATGAGTTA 


GGTCATGCGA 


TTGGCTTGGA 


12780 


CCATACAGAT 


GAGAAGTCTG 


TCATGCAACC 


AGCAGGTTCC 


TTTTATGGTA 


TCCAGGAAGA 


12840 


GGATGTTGCA 


AACCTCCGAA 


AAATATATGA 


GACTAGTGAG 


TAGGGTACTA 


TCTTTCCCTA 


12900 


CTTTTTTTGC 


TATAATGGAA 


CTATGAACAA 


CTTGATTAAA 


TCAAAACTAG 


AGCTCTTGCC 


12960 


GACCAGCCCT 


GGTTGCTACA 


TTCATAAGGA 


TAAAAATGGC 


ACCATTATCT 


ATGTAGGAAA 


13020 


GGCTAAAAAT 


CTGCGTAATC 


GAGTACGGTC 


CTATTTTCGT 


GGAAGTCATG 


ATACCAAGAC 


13080 


AGAGGCTCTG 


GTGTCTGAAA 


TTGTGGATTT 


TGAATTTATT 


GTTACGGAGT 


CTAATATTGA 


13140 
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GGCACTTCTC CTAGAAATCA ACCTGATCAA GGAAAACAAG CCCAAGTACA ATATCATGCT 13200 

CAAGGATGAC AAGTCCTATC CTTTCATCAA AATCACCAAT GAGCGCTATC CACGCTTGAT 13260 

TATCACTCGT CAGGTCAAAA AGGACGGAGG TCTTTATTTT GGACCCTATC CCGATGTGGG 13320 

GGCAGCCAAT GAAATCAAGC GGTTGCTGGA TCGGATATTC CCTTTTCGTA AGTGTACCAA 13380 

CCCGCCCTCT AAGGTCTGTT TTTATTACCA TATCGGCCAG TGTATGGCCC ACACCATCTG 13440 

TAAGAAGGAT GAGGCTTATT TCAAGTCTAT GGCCCAGGAG GTGTCTGATT TTCTGAAAGG 13500 

TCAGGATGAC AAAATCATCG ATGATCTCAA GAGTAAAATG GCAGTAGCAG CACAAAGTAT 13560 

GGAGTTTGAA CGTGCGGCGG AATACCGTGA CCTGATTCAG GCTATTGGAA CGCTTCGAAC 13620 

CAAGCAACGG GTCATGGCGA AAGATTTGCA AAATCGCGAT GTCTTTGGCT ACTATGTGGA 13680 

TAAGGGCTGG ATGTGTGTGC AGGTTTTCTT TGTCCGTCAG GtAAGCTCAT CGAGCGCGAT 13740 

GTCAATCTCT TCCCCTACTT CAATGATCCA GATGAGGATT TTTTGACCTA TGTAGGACAA 1380O 

TTCTATCAAG AAAAATCTCA TCTAGTTCCC AATGAGGTAC TGATTCCGCA GATATTGACG 13860 

AAGAAGCTGT CAAGGCTTTG GTGGATTCCA AGATTCTTAA GCCTCAACGT GGAGAGAAAA 13920 

AACAACTGGT CAATCTAGCC ATAAAAAATG CTCGTGTTAG TCTAGAGCAG AAGTTCAATC 13980 

TGCTAGAAAA ATCTGTCGAA AAGACTCAAG GAGCTATTGA AAATCTAGGG CGTTTGCTCC 14040 

AAATCCCGAC CCCAGTACGT ATCGAGTCCT TCGATAACTC TAATATCATG GGAACTAGCC 14100 

CTGTTTCGGC TATGGTGGTC TTTGTCAACG GTAAACCGAG TAAGAAGGAT TACCGTAAGT 14160 

ACAAGATAAA AACGGTTGTT GGACCAGACG ACTATGCCAG CATGAGAGAG GTCATTCGCA 14220 

GACGCTATGG TCGAGTACAG CGTGAGGCTT TGACTCCTCC AGATTTGATT GTGATTGATG 14280 

GGGGGCAAGG TCAAGTCAAT ATCGCTAAGC AGGTTATCCA AGAGGAACTG GGCTTGGATA 14340 

TTCCAATTGC TGGGCTGCAA AAGAATGATA AGCACCAAAC CCATGAATTG CTCTTTGGAG 14400 

ATCCGCTTGA GGTGGTGGAT TTGTCTCGCA ATTCTCAGGA ATTTTTCCTC CTCCAACGCA 14460 

TCCAAGATGA GGTGCACCGC TTTGCTATCA CTTTCCACCG CCAACTGCGC TCCAAAAATT 14520 

CTTTCTCATC TCAATTGGAT GGGATTGACG GTCTGGGACC TAAACGCAAG CAGAATCTTA 14580 

TGAAGCATTT CAAGTCTTTG ACCAAAATCA AGGAAGCCAG TGTGGATGAG ATTGTCGAAG 14640 

TTGGGGTACC TAGAGTCGTT GCAGAGGCTG TGCAAAGAAA GTTGAACCCG CAGGGAGAAG 14700 

CCTTGCCTCA AGTAGCAGAA GAAAGAGTAG ATTACCAAAC GGAAGGAAAC CACAATGAAC 14760 

CATAAAATCG CAATTTTATC AGATGTTCAT GGCAATGCGA CGGCGCTAGA AGCAGTGATT 14820 

GCAGATGCTA AAAATCAAGG GGCCAGTGAA TATTGGCTTC TGGGAGATAT TTTTCTTCCT 14880 

GGTCCAGGCG CAAATGACTT AGTCGCCCTG CTAAAGGACC TTCCTATCAC AGCAAGTGTT 14 940 
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CGAGGCAATT GGGATGATCG TGTCCTTGAG GCTTTAGATG GGCAATATGG CTTAGAAGAC 15000 

CCACAGGAAG TTCAGCTCTT GCGTATGACA CAGTATTTGA TGGAGCGAAT GGATCCTGCA 15060 

ACGATTGTCT GGCTACGAAG CTTGCCTTTG CTGGAAAAGA AAGAAATTGA CGGATTGCGC 15120 

TTTTCTATCT CTCATAATTT ACCTGACAAA AACTATGGTG GTGACTTGCT AGTTGAGAAT 15180 

GATACAGAGA AATTTGACCA ACTGCTAGAT GCGGAAACGG ACGTGGCAGT TTATGGTCAT 15240 

GTTCACAAGC AGTTGCTTCG TTATGGAAGT CAAGGGCAAC AAATCATCAA TCCAGGGTCG 15300 

ATTGGCATGC CCTATTTTAA TTGGGAGGCG TTAAAAAATC ACCGTTCCCA GTATGCCGTG 15360 

ATAGAAGTTG AAGATGGGGA ATTACTCAAT ATCCAATTTC GTAAAGTTGC TTATGATTAC 15420 

GAAGCTGAGT TAGAATTGGC CAAGTCCAAG GGGCTTCCCT TTATCGAAAT GTATGAAGAA 15480 

CTGCGTCGTG ACGATAACTA TCAGGGGCAC AATCTGGAAT TATTAGCCAG CTTAATAGAA 15540 

AAGCATGGGT ATGTAGAGGA TGTGAAGAAT TTTTTTGATT TTTTGTAAGA GTTTCCTAAA 15600 

ATAGCCAATG CAAACTAAAA AAGCGATTTG CTGGTCCAAT CGCTTTTAGT ATATCTTATA 15660 

CTCAATGAAA ATCAAAGAGC AAACTAGGAA GCTAGCCGTA GGTTGCTCAA AGCACAGCTT 15720 

TGAGGTTGCA GATAAAGCTG ACGTGGTTTG AAGAGATTTT CGAAGAGTGT TATTGTAACT 15780 

GAGATTGATC TGGGAGGTAA GAACCACCTA GATAGGTATT GCTGAGTTTT TCAAGGGTTC 15840 

CGTCTTGATA GAGTTCTTTG AGCGCTTTAT CAAATTGCTC TTTAAACTCT TTTTGGTGGC 15900 

TTGAGAAAAT GATATAATTG CTGGGGCTAT CTGCAGAAGG TAAATCAACG ACTGAGAGGT 15960 

CTAAACCACG GTCCTTGATA ATCTTTTGAA CGGATACCTT GTCAAAAACT AGGAAATCAA 16020 

ACTCTCCGTT AGCAAGGTCT AGGATTCGTT TACCAATATC CTCACCAGAA AAATTAATTG 16080 

TAGCGGGATT ATCAGTGTGT TTCTGATTCC AGTTATTGAT GAATTGAGCG TTAGAAGTTC 16140 

CGGTATCCTC TTGTGTTGTT TTACCAGCGA TCTGGTCAAG AGAAGTCAAA GGATTTTTCT 16200 

TGTTGCTGAC AAGGACGAGG GGATTGTTGG AAATTGGAAG CGAGTAAAGG TATTTTTCAG 16260 

CACGCTCTTT TGTGTAACTC AAGTTATTGG CCGCAGCCTG ATAGTGACCA GAATCAAGTC 16320 

CTGGGAAGAT GCTCTCCCAG GCGGTTCTTT GGAATTGAAT CTCGTAGTCG CTGAGTTTTT 16380 

CATCTACTGC CTTTAAAACT TCGATATCAA AGCCTGTCAG ATTGCCCTTG TCTTCGTAGT 16440 

CAAATGGTGG CACGTCGCCA GCTGTAGCAA GGACGATTGT CTTTTGAGCG CTAGTCTCTT 16500 

TGGGTGTAGC TTGATTCTCA CAGGCAACCA AAAATGGTAG GATAGCTAGT AATAGGCTAA 16560 

ATTTTTTCAT ACTGTCTCCA TTCAAATGTA AAG 16593 
(2) INFORMATION FOR SEQ ID NO: 53: 



WO 98/18931 



PCT/US97/19588 



476 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3510 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

GGGATATCCT TATATCCTTG TTCCTGGAAC CATTGTGGGA ATTGCTCAAC AGTTTTTTCA 60 

CCTTGAATTC CTGGTGCAAT GACAGTAAGA ATTTCGAAAT CACGATCTGG TTTCGCCGCT 120 

AGTTCCATCA ACTCTGGCAT ACTTTTCTTG CATGGACCAC ACCATGAAGC CCAAAACTTC 180 

AAGTAAACCT TTTTACCCTT AAAATCAGAT AACTTAACTT CTTTGCCATC CATGGATTGC 240 

AATGTGAAGT CTGGAGCATC TTTTCCAACA GCAATTTGTT GTACAGTCGT TTGTTGTTTT 300 

GGCTGTTGTG CTGCTTGAGT CTTTTTAGTT TCTTCCTCAC CACAGGCCAT CAATACAACT 360 

AATGACAAGA GACTTAAGCC AGCAAACATT ACTTTTTTCA TTTGTCCTCC TTTATTCAAA 420 

AATTCCAGCT AGAACATTTA CTTGTCCTAA TAGTAACAAA ATTCCCATTA AAACAATGAG 480 

GAAACCACCA ATTTTCTTTA GTAGCATCAT ATGACGCTTG ATTTTACTAA AATATGGCAT 540 

GACTAGACCT GAAGCTAGTG CCAATACCAA GAAAGGAAGG GCCATGCCaG AGTGTAAATG 600 

AGAGTATAAA TCGCTCCTTG CCAAGCGCCA TTGCCTCCAG AAGCCGCAAG TGCTAAAACA 660 

GAACTTAAAA CTGGACCAAT ACAAGGTGTC CAACCAAAGC TAAAGGTAAT ACCAAGTAAA 720 

AAAGCTGACC AATAACGATT AGAATCTGAT TTTTTAAAGG TAAAACTTTT TTGAACTTCT 780 

AATTTCTTCA AATGAAAAAT TTCCATCTGG TGAAGACCCA AAATGATAAT AATAGCTCCC 840 

ATGCCATATC , GAAACCAATT TGCATAGAGA ATATGACCAA AGTAACCAGC ACCAAAGCCT 900 

AGAATAAAGA AAATGAGAGA GATACCAGCG ATAAAGCAAA GTGTTCGAAT CAAGCCTGAC 960 

CAGAGAACCT TTCTCCCAAA CAAAGAAAAG CTTTTTGCAC TTTCTTGATC ATCCAATAAA 1020 

ATCCCAGCAT AGACTGGCAG AAGAGGAAAA ATACAAGGAG AAAAAAAGGA TAAAACACCT 1080 
GCTAGAAAAA CAGAGATTAA AAATACTATC GTTTCCAATA AAGAACCAAC TTTCTTAATA . 1140 

ATTCTAATCC TATTTTACTA TATTCAATTT TATTTGTAAG CTTTCTGCTA CGCAAAATCG 1200 

TATCGGGCAC TATTGGACCA ATCTTTTCTT TTGCTAGTCA AGGCGGATCT TATCCCCCAA 1260 

AATAGCCAAA AAGCAACGAC AAGGATTACT CATCGCTGCT TTTGTGAACG AAAATGTCTT 1320 

TTAGGTCTGA CATTTCATAA ATCATGTTTT ACTTGAGTTT GTCAAGGATT GCTTTAAGCT 1380 

CCTCTACTAG TTTAGTTTCT GTCTCTGCTG AGCCATTTTC TTCTTTCACG AAATCAAGGG 1440 

TTTCTTGGAG AAGGTTTTGG GCTTTGGCAA GGACTTTTTT ATCCGCTTTT TCTGCATCTA 1500 
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GCTGTCCTAG AACCTTGATC AATTCCGTGC TTAATTGCTG GATTTCTGAC TCTTTCTTAC 1560 

GGCGAATCAG CCAGAAGGCA ATCACGCCTA GGAGGGCAAG TAGACTGACC ACAATCACTC 1620 

CTGCCGGAAC TGAGTTTGTT TCAGTCATCT TATCTGAATC CTTACTATCT TCCGTTCCTT 1680 

GTTTTGCATC CTTCTTGTCC TGTGCAGGCT TGCTGTCGCT AGCATTTGCT TTCACATCTT 1740 

TGAGAGAGTC CAAGGCAGCC CAGCCTTCAC AGACTCTACT GCAGTATGCA GACCTTACTC 1800 

TGTCAAGGCA CTATCTTCCG GAGCTTTTTG AGCATCTAGG AGGACAGCCT TGGTTGCATC 1860 

GATTTTCGGA TCAGATACTG TTGCCAAAGC TTTCAAGCGT TGGTCTAACT CTTGACTCAA 1920 

GGCACGAAGT TCAGACTTGT CAACTTGCTC TTGAGCTTGT GTGCTCGTTG AGCTAGCCGA 1980 

AGCGCTTGCT ACCACTCTAG GATCTTGAGT CGGAGCTGAG CTTGGAGCTG GGACAGGGCT 2040 

TGCAGGTTGA CTAGGAACAG TTATGGTATA TTGAAACTAG AATAGTACAT ATGGACTTCT 2100 

AAAACATTGT TAGAATTCGA TTTTACTGTC CTGATCGATT TGTCCTATTC TTATTTCATT 2160 

TTACTATAAT AACCGATGGT GTGGTTAATG TTGGTAAGAG AAACTTCTGA AACCAAGCTT 2220 

CAAAAAAGTC GCTCGTCATC GTCTCTTCGT AAGTCATTGG AGCGATTAAT TCACCATTTG 2280 

TTAGACCTGC AACCAAAGAA ATCCTCTGAT ATCTTCTTCC AGATACTTTG CCTCTTATTA 2340 

ACTGACCTTT TAATGAGCGA CCATATTCTC GATAAAAATA AGTATCGAAT CCTGTTTCGT 2400 

CAATCTAAAC AGGTGCTAGG TGCTTTAAAC TATTAAAATT CTTAAGAAAT AAGGCTACTT 2460 

TTTCTGGGTC TTGTTCATAG TAGGTGTGGT TCTTTTTTTC GAGTGTAGCC CATAGCTTTG 2520 

AGCGCATAGT GGATGGTAGT TGGATGACAG CCAAAkTCAG AAGCTATTTC AGTCAAATAA 2580 

GCrTCTGGAT TGTCAGTAAG ATAGTTTTTA AGTCTATCTC TATCAACTTT TCTTGGTTTT 2640 

GTTCCTTTTA CTTGGTGGTT TAGCTCTCCT GTTTTCTCTT TTAGCTTTAA CCAGCCATAA 2700 

ATGGTATTAC GTGAGATTTG GAAAACGTGT GATGCTTCTG TTATACTACC TATTCGCTCA 2760 

CAATAAGAGA GAACTTTTTT ACGAAAATCT ATTGAATATG CCATAAGAAG ATTATACCAC 2820 

ATTGTGTACT ATTTTTGGTT CATTTCACTA TAACACAAAA TAGATTATTA TTACATAACA 2880 

AAAAAGAGGT CTAAACCTCT TAACTCAATT ACTCCGCCAG TAGGACTCGA ACCTACGACA 2940 

TCATGATTAA CAGTCATGCG CTACTACCAA CTGAGCTATG GCGGATTAAA GCTAAGCGAC 3000 

TTCCCTATCT CACAGGGGGC AACCCCCAAC TACTTCCGGC GTTCTAGGGC TTAACTTCTG 3060 

TGTTCGGCAT GGGTACAGGT GTATCTCCTA GGCTATCGTC ACTTAACTCT GAGTAATACC 3120 

TACTCAAAAT TGAATATCTA TTCAATTTAA GAAAACCGTT CGCTTTCATA TTCTCAGTTA 3180 

CTTTGGATAA GTCCTCGAGC TATTAGTATT AGTCCGCTAC ATGTGTCGCC ACACTTCCAC 3240 
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TTCTAACCTA TCTACCTGAT CATCTCTCAG GGCTCTTACT GATATATAAT CATGGGAAAT 3300 

CTCATCTTGA GGTGGJcTtCA CACTTAGATG CTTTCAGCGT TTATCCCTTC CCTACATAGC 3360 

TACCCAGCGA TGCCTTTGGC AAGACAACTG GTACACCAGC GGTAAGTCCA CTCTGGTCCT 3420 

CTCGTACTAG GAGCAGATCC TCTCAAATTT CCTACGCCCG CGACGGATAG GGACCGAACT 3480 

GTCTCACGAC GTTCTGAACC CAGCTCGCGT 3510 
(2) INFORMATION FOR SEQ ID NO: 54: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20986 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 



CGGAGAAAAA 


CATGGCTAAG 


TCAAACTTTG 


AAAAAGTAGA 


ATCAGTTGTT 


GGCTGGGTTC 


60 


GTGATAAGAA 


AATCACAGGC 


TACCGTATCT 


CTAAAGAAAC 


GAATGCGCGT 


GAAATGTCTA 


120 


TCATTGCTCT 


GGCGCAGGGT 


CGTGCAAAAG 


TAAAAAATAT 


TTCATTTGAA 


ACAGCCCTAG 


180 


GCCTAATTGA 


TTTCTATGAA 


AAAAATTATG 


AAAAATTTGA 


AGATTAATCT 


TTGGATAACG 


240 


GCGGATTCTT 


GACCTTCAAG 


TAGTAGAGAT 


AGAGAATCTG 


CCTTTTCATT 


TTGAGGACAG 


300 


CAAAAAGACT 


GCACGGTTGA 


TGCAGCCTTT 


TCTTTTTATT 


TGAGATAGCG 


TTGAAGGAAC 


360 


TCTTTTGTTC 


GGTCTTCTTT 


AGGATTGGTG 


AAGAGGTCTT 


CTGGTTTACC 


TTCTTCAGCG 


420 


ATCACGCCCT 


TATCCATAAA 


GATAACACGG 


TGAGAGACAT 


CACGGGCAAA 


TTCCATTTCA 


480 


TGGGTTACGA 


CAATCATGGT 


CAAGCCTTCC 


TGAGCCAGGT 


CCTGCATGAT 


TTTGAGGACT 


540 


TCTCCAACCA 


TTTCTGGATC 


GAGAGCTGAT 


GTTGGTTCAT 


CAAAGAGAAT 


AGCGTCCGGA 


600 


TTCATGGAGA 


GGGCACGAGC 


GATGGCCACA 


CGTTGTTTTT 


GACCACCTGA 


GAGTTGTTTT 


660 


GGTTTGGCTT 


GCCAGTAGCG 


TTCTCCCATG 


CCGACCTTTT 


CCAGGTTTTC 


TTTGGCAATC 


720 


TTTTCAGCTT 


CTGTGCGTTC 


GCGTTTTAGG 


ACAGTTGTCT 


GAGCGACGAT 


TGTGTTTTCA 


780 


AGAACATTGA 


GATTTTCAAA 


GAGGTTAAAG 


GATTGGAAAA 


CCATCCCCAA 


CTTTTCACGG 


840 


TATTGCGTGA 


GGTCATAGCC 


TTTTTCGAGG 


ACGTTTTGTC 


CATGATAAAG 


GATTTGTCCA 


900 


TCAGTTGGTG 


TTTCAAGTAG 


GTTAATGGAG 


CGTAGGAAGG 


TCGATTTTCC 


GCTTCCAGAG 


960 


CTTCCGATGA 


TAGAGATGAC 


CTCTCCCTTG 


TGGACAGTGA 


GTGAAATGTC 


TTTTAGCACT 


1020 


TCGTTTTGTC 


CATAGGATTT 


TTTGAGGTGT 


TTAATTTCAA 


GGATTGCTTG 


TGTCATTATT 


1080 


TCAAATCCTC 


CGTTTGCATT 


TGGTTAGCAC 


CTGTAGTGTA 


GGTATCCATG 


TCCATTCTGC 


1140 
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GCTCGATAAA GCGTAGGATA 


CGTGTTACGG 


TGAAGGTGAG 


GACAAAGTAA 


ATCACGGCGA 


1200 


TGATTGTAAA TGTCTGGAAG 


TATTGATAGG 


TTTGTGTTGC 


CACGGTATTT 


CCTGAGAAAT 


1260 


AAAGTTCGAC AACAGAGATA 


ACGTTCAATA 


CAGATGTATC 


TTTGATATTG 


ATGACAAATT 


1320 


CATTACCAGT TGCAGGTAGG 


ATGTTACGGA 


CTACCTGAGG 


TAGGACAATC 


TTACGCATGG 


1380 


TCTGGTTATG GGTCATACCA 


AGAGCAGTCG 


CAGCTTCAAA 


TTGTCCCTTG 


TCAACTGCTA 


1440 


GGATACCACC ACGGACGATT 


TCAGTCATGT 


AGGCACCGGT 


ATTGATTGAA 


ACGATGAAGA 


1500 


TAGCAGCCAG TGTACGGTCA 


AGGTTGATCC 


CGAAAGCTTG 


GGCAGTTCCA 


TAGTAGATAA 


1560 


CCATCGATTG AACAATCATT 


GGCGTACCAC 


GGAAAATTTC 


AATGTAGACA 


TTGAGAACCC 


1620 


AGCCGACTAG TTTTTGTAGG 


CCGTAAATGA 


CTTTGTTTTC 


AGAGAGAGGA 


GCAGTACGGA 


1680 


AGACACCAAT GGCAAGTCCA 


ATAATGAGAC 


CTATGATGGT 


TCCGACGATA 


GAGATTAAAA 


1740 


GAGTGATACC AGCACCACGC 


AAGAGTTGTT 


GCCAGTTTTC 


AGAAAGAATT 


TTAGCAACTT 


1800 


GGCTAAAGAA ACTACTGCTA 


GTCTCTTCAG 


TTGTTGTAGC 


TTCGGCAGGT 


TGTTCCTTGA 


1860 


TCATACGATC CATCAAGGCA 


ACTTGGTCAT 


CTTTTGAAAT 


GGTTTCAATG 


CTGGCATTGA 


1920 


TTTGGCTAAT ACGATTGTCA 


TTTTTACGAA 


GCCCGATAGC 


GATAGCTGTA 


TCTTCTTCCC 


1980 


CAGTTTTGAA ACCAGGTTCT 


ACTTGAATCA 


TCTTGAACTT 


AGAGTTCGCA 


GCTTCAGCAG 


2040 


TCAGTGCTTC TGGACGTTCA 


GAAACATAAG 


CATCAATGAC 


ACCAGCCTCA 


AGAGCTTGTC 


2100 


GCATTTGAGC GAAGTCTCCC 


ATGGCTGTTT 


CTTTTTTAGC 


ACCTGGGATT 


TGTGCAATCA 


2160 


AGTTATAAAG GTAGACCCCT 


TGTTGAGAAG 


TGATTTTTGC 


ACCGTTAAAG 


TCATCCAAAG 


2220 


ATTTAGCACT TGCGTAGGCA 


GAATCTTTTT 


TGACAAGCAA 


AACTGGTTCG 


CTAGTATAGT 


2280 


AACTGCTCGA AAAGGCAATT 


TCTTGTTTGC 


GTTCTGCAGT 


TGGACTCATA 


CCTGCGATAA 


2340 


TCATGTCAAT CTTACCAGAA 


GTAAGGGCAG 


GGACTAGACC 


TTCCCACTTG 


GTTTTAACAA 


2400 


CCAAAGGTTC TTTACCTAAG 


TCCTTAGCGA 


TTTTCTTGGC 


GATTTGAACA 


TCGTATCCGT 


2460 


TGGCATACTG ATTGGTCCCA 


TCGATTTTGA 


CAGCTCCGTT 


GCTATCATCA 


TCCTGGGTCC 


2520 


AGTTAAAGGG AGCATATGCT 


GCTTCCATAC 


CGATGCGTAA 


ATATTCATCG 


GCTTGAGCAA 


2580 


CATTGACAAG TCCTAGCATC AGCAAGAGAC TTGTGAAAAT AGATAAGTAy ATGTGGCTCA 


2640 


TGATTTCTCC TATTCTGATC 


TATTAAAAAA 


TAACTGTCTC 


CTATTTTATC 


GAAAAATGCG 


2700 


TAATTTTTCA ACATAAGTAA 


GTCTTTACTT 


ACGAAAAAAT 


GCTATAATGA 


TAAGAAAGAT 


2760 


AAAAAGGGGG CTTAGTTGAT 


GAAAAAAACT 


TTTTTCTTAC 


TGGTGTTAGG 


CTTGTTTTGC 


2820 


CTTCTTCCAC TCTCTGTTTT TGCCATTGAT TTCAAGATAA ACTCTTATCA AGGGGATTTG 


2880 
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TATATTCATG CAGACAATAC GGCAGAGTTT AGACAGAAGA TAGTTTACCA GTTTGAGGAG 2940 

GACTTTAAGG GCCAAATCGT GGGACTTGGA CGTGCTGGTA AGATGCCTAG CGGGTTTGAC 3000 

ATTGACCCTC ATCCAAAGAT TCAGGCCGCG AAAAACGGTG CAGAACTAGC AGATGTGACT 3060 

AGCGAAGTAA CAGAAGAAGC GGATGGTTAT ACTGTGAGAG TCTATAATCC AGGTCAGGAG 3120 

GGCGACATAG TTGAAGTTGA CCTCGTCTGG AACTTAAAAA ATTTACTTTT CCTTTATGAT 3180 

GATATCGCTG AATTAAATTG GCAACCTCTG ACAGATAGTT CAGAGTCTAT TGAAAAGTTT 3240 

GAATTTCATG TAAGGGGAGA CAAGGGGGCT GAAAAACTCT TTTTCCATAC AGGGAAACTT 3300 

TTTAGAGAGG GAACGATTGA AAAGAGTAAC CTTGATTATA CTATCCGTTT AGACAATCTT 3360 

CCGGCTAAGC GTGGAGTTGA GTTGCATGCC TATTGGCCTC GGACCGATTT TGCTAGCGCT 3420 

AGGGATCAGG GATTGAAAGG GAATCGTTTA GAAGAGTTTA ATAAGATAGA AGACTCGATT 3480 

GTTAGAGAAA AAGATCAGAG TAAACAACTC GTTACTTGGG TCCTCCCTTC GATCCTTTCC 3540 

ATCTCCTTGT TATTGAGTGT CTGCTTCTAT TTTATTTATA GAAGAAAGAC CACTCCTTCA 3600 

GTCAAATATG CCAAAAATCA TCGTCTCTAT GAACCACCAA TGGAATTAGA GCCTATGGTT 3660 

TTATCAGAAG CAGTCTACTC GACCTCCTTG GAGGAAGTGA GTCCCTTGGT CAAGGGAGCT 3720 

GGAAAATTCA CCTTTGATCA ACTTATTCAA GCTACCTTGC TAGATGTGAT AGACCGTGGG 3780 

AATGTCTCTA TCATTTCAGA AGGAGATGCA GTTGGTTTGA GGCTAGTAAA AGAAGATGGT 3840 

TTGTCAAGCT TTGAGAAAGA CTGCCTAAAT CTAGCTTTTT CAGGTAAAAA AGAAGAAACT 3900 

CTTTCCAATT TGTTTGCGGA TTACAAGGTA TCTGATAGTC TTTATCGTAG AGCCAAAGTT 3960 

TCTGATGAAA AACGGATTCA AGCAAGAGGG CTTCAACTCA AATCTTCTTT TGAAGAGGTA 4020 

TTGAACCAGA TGCAAGAAGG AGTGAGAAAA CGAGTTTCCT TCTGGGGGCT CCCAGATTAT 40 BO 

TATCGTCCTT TAACTGGTGG GGAAAAGGCC TTGCAAGTGG GTATGGGTGC CTTGACTATC 4140 

CTGCCCCTAT TTATCGGATT TGGTTTGTTC TTGTACAGTT TAGACGTTCA TGGCTATCTT 4200 

TACCTCCCTT TGCCAATACT TGGTTTTCTA GGGTTAGTTT TGTCTGTTTT CTATTATTGG 4260 

AAGCTTCGAC TAGATAATCG TGATGGTGTT CTAAATGAAG CGGGAGCTGA GGTCTACTAT 4320 

CTCTGGACCA GTTTTGAAAA TATGTTGCGT GAGATTGCAC GATTGGATCA GGCTGAACTG 4380 

GAAAGTATTG TGGTCTGGAA TCGCCTCTTG GTCTATGCGA CCTTATTTGG CTATGCGGAC 4440 

AAGGTTAGTC ATTTGATGAA GGTTCATCAG ATTCAAGTGG AAAATCCAGA TATCAATCTC 4500 

TATGTAGCTT ATGGCTGGCA CAGTACGTTT TATCATTCAA CAGCACAAAT GAGCCATTAT 4560 

GCTAGTGTCG CAAATACAGC AAGCACCTAC TCTGTATCTT CTGGAAGTGG AAGTTCTGGT 4620 

GGTGGCTTCT CTGGAGGCGG AGGTGGCGGC AGTATCGGTG CCTTTTAAAG AGAGCTACCA 4680 
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TAGACTGAAA AAGTATGATA TAATGGAAGA TAGAAAAAAG ACAAACTATA AGAAAAGTCA 4740 

ATAGTTTTAT CTAAACTATT TCTTATTTCA ATTTGATGAT TTGGCGATGA TTTTAGAGCA 4800 

CGGCAAAAAG CCCTTGAAAA AGTCCATTTT TTCAAAGGTA ATCCTGTGTT AATTTCAGAA 4860 

ATTACATCAC TTTTTGTTCG TCAAATGGCA GCTCTTTTTT AGGATATAAA ACAGGGTTCG 4920 

GATAAGTTTT TTTGCAAGGT GGATGATGGC TACATTGTAA TGTTTTCCTT ATTCTAACTT 4980 

AGTCTTAAGA TAGGCCTTAG AAGCAGGTGA AAAGCGAGGG CATGCTTTGG CAGCTTGTAT 5040 

GAGTGCCCAC CGCAGATGAG GGGAACCCCG TTTGACCATT CTTCCAGCTA AATCAATCTG 5100 

ACCTGACTGA TAAATAGAAG AATCCAGTCC AGCGAAAGCT TGTAATTGAG CAGGATTATC 5160 

AAAGGCATGA ATATTTCGAA TCTCGGCTAA AATGACCGCC CTAAACGATC CCCAATCCCA 5220 

GTAACCGTCG TGATGACCGA GTTGAACTCA GCCATCGAGT CATTGATACA TGTTTCCGCC 5280 

TTGTCAATGA GCCTCTTGTA ATGCTTGATG ATTTCGAATT CACGAGCAGG AGATGTTGTT 5340 

CCGATAGAAC GAGGTGCGAC TGAGAGGATA TCCTGAATTT TAGAAGCGGT CAATCGCTTA 5400 

ATTTCTATCA GCTTATCAAA TCCTGCCTCA ATCCTTTTCT GAGGATTAGG GTAGCGTGTC 5460 

AAGAGTTGGT AGGTATATTC TGAATGCTTT CCAACGATTT TATCCAACTC AGGAAAGATG 5520 

ATATCAAGAC AACGAGTGTA TTGTACTTTC CAATCAGACT GTTTTTCTTG AGACGATGAA 5580 

TATGTCTAGC CAGTATTTTT AGGTCTACTT GCCGATTATC GTGTTGAAAT TGTTCACGAT 5640 

TGGGGTCAGA AAGAAGTTTA AGAGCGATGC CATGAGCGTC TTTCTTATCC GTTTTAGTCT 5700 

TGCGAAGTGA TAATGATTTG GCAAATTCCT TGATGAGCAA AGGATTGTAG GTGTAAACTT 5760 

TATATCCTTG TTCATGCAGG AAGTTCAGTA GATTAAAGGC ATAATGTCCA GTATCTTCAA 5820 

GAGCGATGAG ACAGTCTTGG TTGATCTGTC GAATAGACAG ATCTAAGAGT TCAAAACCAG 5880 

CTTTATTATT TGAAAAAGTG AGTGGTTTAA GAACAGTTTT TCCTGGAACA TTCAAGGCTG 5940 

TAACATCGTG TTTATTTTTA GCGATATCAA TGCCTACATA AAGCATGGGA GTACCTCCAG 6000 

ATATAGTATT TCAAGTCTAC TTGGTTATCC ACGAATTTTT TGCCTTGTTA CCTTAGACGA 6060 

GATCAAACGT CTATGCGTTA TCAAACTCAT TACCAATTGA AACAAAAGCT GTGGTTAGAG 6120 

CCTTTCGGAA ATCGTCAAGC GATTGGAGGA AATGAACTAA TCCATAGTGG CTTATTCCAA 6180 

GTATACCACT TGGGCTTTGG CAGTAGCTAA CTGCGCTAAA TATAATATAG GGAGTAATCT 6240 

ATGTATCTTA TTGAAATTTT AAAATCTATC TTCTTCGGAA TTGTTGAAGG AATTACGGAA 6300 

TGGTTGCCGA TTTCCAGTAC AGGTCACTTG ATTTTAGCAG AGGAATTCAT CCAATACCAA 6360 

AATCAAAATG AAGCCTTTAT GTCCATGTTT AATGTCGTGA TTCAGCTTGG TGCTATTTTA 6420 
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GCAGTTATGG TGATTTATTT TAACAAGCTC AATCCTTTTA AACCGACCAA GGACAAACAG 6480 

GAAGTTCGTA AGACTTGGAG ACTATGGTTG AAGGTCTTGA TTGCTACTTT ACCTTTACTT 6540 

GGTGTCTTTA AATTTGATGA TTGGTTTGAT ACCCACTTCC ATAACATGGT TTCAGTTGCT 6600 

CTCATGTTGA TTATCTACGG GGTTGCCTTC ATCTATTTGG AAAAGCGCAA TAAAGCGCGT 6660 

GCTATCGAGC CAAGTGTAAC AGAGTTGGAC AAGCTTCCTT ATACGACCGC TTTCTATATC 6720 

GGACTCTTCC AAGTTCTTGC TCTTTTACCA GGGACTAGCC GTTCAGGTGC AACGATTGTC 6780 

GGTGGTTTGT TAAATGGAAC CAGTCGTTCA GTTGTGACAG AATTTACCTT CTATCTTGGG 6840 

ATTCCTGTTA TGTTTGGAGC TAGTGCCTTA AAGATTTTCA AATTTGTGAA AGCCGGAGAA 6900 

CTCTTGAGCT TTGGGCAATT GTTTTTGCTC TTGGTCGCGA TGGGAGTAGC TTTTGCGGTC 6960 

AGCATGGTGG CTATTCGCTT CTTGACCAGC TATGTGAAAA AACACGACTT CACCCTTTTT 7020 

GGTAAATACC GTATCGTGCT TGGTAGTGTT TTGCTACTTT ACAGTTTTGT CCGTTTATTT 7080 

GTATAAGAAA AACCTTGAAG GGGCAACTCT TCAAGGTTTT ATACTCTTCG AAAATCTCTT 714 0 

CAAACCGCGT CAGCTTTATC TGCAACCTCA AAACAGTGTT TTGAGCAGCn CTGCGGCTAG 7200 

CCTCCTAGTT TGCTCTTTGA TTTTCATTGA GCTTTAAAAT CCAGTCATGG TAATCCCCAA 7260 

TAGGCGGACA CCTCTTTCTT TCTTGCTTAA TTCTTCATAG AGTTGCAGGG CTATTTGGCT 7320 

TATCTGACTA GCATCTTGTG TTTTTTGAGC . AAGACTTTTT CGTTTGGTAA GAGTTGAAAA 7380 

GTCCTCGTAG CGGATTTTCA AAATGACAAT TTTTCCAGCT TTTTCTTGTT GATGTAGATT 7440 

GAGAGCGACT TTTTCTGATA GAAGAGTCAG CTCTTTTTTG ATATCTTCCT CAGCAAGGAG 7500 

AATCTTCCCG TAGGTTTTCT CCTTGCCGAT TGATTTACGG ATGCGATTGG ATTTGACTGG 7560 

AGAGTTGTGA ATGCCACGAG CCTTTCGATA CAGATCATAG CCTAGTCTAC CAAAACGGTC 7620 

TATTAGGGTT ACCTCAGGAA CTTCAAGTAA ATCAGCACCA GTAAAAACGC CCATTTGATG 7680 

AAGACGTTCT ACTGTCTTTT TTCCTACTCC ATGAAATTTG GAAATATCCA TTTGTTTGAG 7740 

AAAATCCTCA GCCTGTTCAG GTAGAATCAC TGTCAAACCA TGTGGTTTTT GATAATCACT 7800 

CGCCATTTTA GCTAAGAATT TGTTGTAAGA AACGCCTGCG GAAGCAGTTA GATGGAGTTC 7860 

TTGCCAGATA TCTTTTTGAA TGAGGCGAGC AATTTTGACC GCTGACTTGA TACCGAGTTT 7920 

ATTTTCTGTC ACATCCAAAT AGGCTTCGTC AATGCTCATG GGTTCAATCA AATCTGTATA 7980 

GCGCTTAAAA ATAGCTCGAA TCTGGAGTCC CACAGACTTG TATTTCTCAT AATTCCCTGA 8040 

GATAAAGACA GCCTGGGGAC AACGTTCATA AGCTTCCTTG GAACTCATGG CAGAATGGAC 8100 

ACCAAAAGCT CTTGCCTCAT AACTACAGGT AGAAACGACT CCCCGTCCAC CTGTTTGCCG 8160 

AGGGTCGCTT CCAATAATGA CAGGTTTTCC TCTGAGTTTA GGATTATCCC TGATTTCCAC 8220 
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TGCAGCAAAA AAGGCATCCA TGTCAATATG GATGATTTTT CTTGACAAAT CATTTAACAA 8280 

AGGAAAAATC AACATGCCTA GCACCTTTTT ATACTCTTCG AAAATCTCTT CAAACCACGT 8340 

CAGCTCTATm TGCAACCTCA AAACAGTGTT TTGAGCAATC TGCGGCTAGC TTCCTAGTTT 8400 

GCTTTTCGAT TTCCATTGAG TGTTACTGCT TATTyTCTTT TATTATACCC TTTTTTCTGA 8460 

AAAAAAGAAA AAAGGACTTT ATTTTTTCAA AAATATAATA CAGTTTGAAA TAAAATATAG 8520 

ACTGTTTTAG AAAAGAAAGT GTAAAAATAG GGAATTTTCA CTTGTTGAAA TCGGTTACTA 8580 

TATGGTATAC TTGTCTTATG AATGTAACAG ATGACTGTTA CTAGAAAAAA GAGGACATTA 8640 

ATATGGTTGT TAAGACAGTT GTTGAAGCAC AAGATATTTT TGACAAAGCT TGGGAAGGCT 8700 

TCAAAGGCGT AGATTGGAAA GAAAAAGCAA GTGTATCACG ATTTGTACAA GCTAACTACA 8760 

CACCTTATGA TGGAGACGAA AGCTTCCTTG CAGGACCAAC AGAGCGTTCA CTTCACATCA 8820 

AGAAAATTGT AGAAGAAACT AAAGCACACT ACGAAGAAAC TCGTTTCCCA ATGGACACTC 8880 

GTCCAACATC TATCGCTGAT ATCCCTGCTG GATTTATCGA CAAAGAAAAT GAAGTTATCT 8940 

TCGGTATCCA AAACGATGAA CTCTTCAAAT TGAACTTCAT GCCAAAAGGT GGTATCCGTA 9000 

TGGCTGAAAC TACTTTGAAA GAAAATGGAT ACGAACCAGA CCCAGCTGTT CACGAAATCT 9060 

TCACTAAATA TGTAACAACA GTTAACGACG GTATTTTCCG TGCCTACACT TCAAATATTC 9120 

GTCGCGCTCG TCACGCACAC ACTGTAACTG GTCTTCCAGA TGCATACTCA CGCGGACGTA 9180 

TCATCGGTGT TTACGCACGT CTTGCTCTTT ACGGTGCAGA CTACTTGATG CAAGAAAAAG 9240 

TAAATGACTG GAATGCAATC AAAGAAATCG ATGAAGAAAC AATCCGTCTT CGTGAAGAAG 9300 

TAAACCTTCA ATACCAAGCA TTGCAACAAG TTGTTCGCCT GGGTGACCTT TACGGGGTTG 93 60 

ATGTTCGCAA ACCAGCGATG AACGTGAAAG AAGCAATCCA ATGGGTTAAC ATTGCTTTCA 9420 

TGGCTGTCTG CCGTGTGATT AACGGTGCTG CTACATCTCT AGGTCGTGTA CCAATCGTAT 9480 

TGGACATCTT TGCAGAACGT GACCTTGCTC GTGGTACATT TACTGAATCA GAAATCCAAG 9540 

AATTCGTTGA TGATTTCGTT ATGAAACTTC GTACAGTTAA ATTTGCTCGT ACAAAAGCTT 9600 

ATGACCAATT GTACTCAGGT GACCCAACCT TTATCACAAC TTCTATGGCT GGTATGGGTA 9660 

ACGACGGTCG TCACCGTGTT ACTAAGATGG ACTACCGTTT CTTGAACACT CTTGACAACA 9720 

TCGGTAACTC ACCAGAACCA AACTTGACAG TTCTTTGGAC TGACAAATTG CCATACAACT 9780 

TCCGTCGCTA CTGTATGCAC ATGAGCCACA AACACTCTTC TATCCAATAC GAAGGTGTAA 9840 

CAACAATGGC TAAAGACGGA TATGGTGAAA TGAGCTGTAT CTCATGCTGT GTGTCTCCAC 9900 

TTGATCCAGA AAATGAAGAA CAACGCCACA ACATCCAGTA CTTCGGTGCT CGTGTAAACG 9960 
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TTCTTAAAGC 


CCTTCTTACT 


GGTTTGAATG 


GTGGTTACGA 


CGATGTTCAC 


AAAGACTACA 


10020 


AAGTATTTGA 


TATCGAACCA 


ATCCGTGACG 


AAGTTCTTGA 


ATTTGAATCA 


GTTAAAGCGA 


10080 


ACTTTGAAAA 


ATCTCTTGAC 


TGGTTGACTG 


ACACTTACGT 


AGATGCCTTG 


AACATCATCC 


10140 


ACTACATGAC 


TGATAGGTAC 


AACTACGAAG 


CTGTTCAAAT 


GGCCTTCTTG 


CCAACTAAAC 


10200 


AACGTGCCAA 


CATGGGATTC 


GGTATCTGTG 


GATTTGCTAA 


CACTGTTGAT 


ACATTGTCAG 


10260 


CTATCAAATA 


CGCTACAGTT 


AAACCAATCC 


GTGACGAAGA 


TGGCTACATC 


TACGATTACG 


10320 


AAACAATCGG 


TGACTACCCA 


CGCTGGGGTG 


AAGATGACCC 


ACGTTCAAAC 


GAATTGGCAG 


10380 


AATGGTTGAT 


CGAAGCTTAC 


ACAACTCGTC 


TACGTAGCCA 


CAAACTATAC 


AAAGACGCAG 


10440 


AAGCTACAGT 


ATCACTTTTG 


ACAATCACAT 


CTAACGTTGC 


TTACTCTAAA 


CAAACTGGTA 


10500 


ACTCACCAGT 


TCACAAAGGT 


GTATACCTCA 


ACGAAGATGG 


TTCTGTGAAC 


TTGTCTAAAC 


10560 


TTGAATTCTT 


CTCACCAGGT 


GCTAACCCAT 


CTAACAAAGC 


TAAAGGTGGT 


TGGTTGCAAA 


10620 


ACTTGAACTC 


ACTTTCTAGC 


CTTGACTTTA 


GTTATGCAGC 


TGACGGTATC 


TCATTGACTA 


10680 


CACAAGTATC 


ACCTCGCGCT 


CTTGGTAAGA 


CTCGTGATGA 


ACAAGTTGAT 


AACTTGGTAA 


10740 


CAATTCTTGA 


TGGTTACTTC 


GAAAACGGTG 


GACAACACGT 


TAACTTGAAC 


GTTATGGACT 


10800 


TGAACGATGT 


TTACGAAAAA 


ATCATGTCAG 


GCGAAGACGT 


TATCGTACGT 


ATCTCTGGAT 


10860 


ACTGTGTAAA 


CACTAAATAC 


CTCACTCCAG 


AACAAAAAAC 


TGAATTGACA 


CAACGTGTCT 


10920 


TCCACGAAGT 


TCTTTCAATG 


GATGACGCCT 


TGGATGCATT 


GAGCTAATCA 


AGTTCTTGAA 


10980 


TAATAAAAAG 


GAACCCTCGG 


TCAAACGACT 


GAGGGTTTTG 


TGCTTGGGAT 


AGTATGAGCA 


11040 


ATTCCTTCGG 


CGCAATATGC 


AATGTTTTTG 


GGCTCTTTGT 


CAACTGTAGT 


GGGTTGAAAA 


11100 


AAAGCTAAGC 


TTGAGAAAGG 


ACAAATTTCG 


TCCTTTCTTT 


TTTGATGTTC 


AGGGCGATAA 


11160 


AAATCCGTTT 


TTTGAAGTTT 


TCAAAGTTCC 


GAAAACCAAA 


GGCATTGCGC 


TTGATGTCTT 


11220 


TGATGAGTTT 


GTTAGTGGCC 


TCAAGTTTAG 


CGTTAGAATA 


AGGCAATTCA 


ATGGCGTTAG 


1 1 99ft 


TGATGTAGTT 


TTTATAGCAA 


ATAAATGTGC 


TCAAAGTGGT 


TTTAAAGGTG 


CGGTTGAGAT 


11340 


GAGGTAACGT 


GTCTTGAATT 


AAGCCCCAAA 


ACTGGTCAGT 


ATTCTTCTCT 


TGTAGATGAA 


11400 


ATAGGAGTAG 


TTGATACAGG 


TCATAGTAAT 


CTTTAAGTTC 


AGGTACTAGA 


GTAAAGATTT 


11460 


TCTTCAGACA 


CTCCCTAGGA 


GTTAAGGTCT 


CTCTGAAAGT 


TCTAGCATAG 


AAAGGCTTAA 


11520 


GAGAGAGTTT 


CCGACTATCT 


TTTAGGATAA 


ATTTCCAGTA 


ATATTTAAGA 


GCTCTGTATT 


11580 


CCAGAGATTT 


ATCATCAAAT 


TGCTTCATGA 


TGTTGATTCT 


AGTCTGATTA 


AGAGCCCTGC 


11640 


TCATGTGTTG 


GACAATGTGG 


AAACGATCGA 


GAACAATTTT AGCATTGGGA AATAATTTCT 


11700 


TAATGAGAGG 


GATATAACTT 


CCAGACATAT 


CAACAGTGAC 


GACTTTAACT 


TTTTTTCTAG 


11760 
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CTTCTTTCGA 


GTACTTGAAG 


AAATGATTTC 


GGATGGTTGT 


TTGACGTCTG 


TTATCAAGAA 


11820 


TGGTCATGAT 


TTTCTTAGTG 


TTGAAATCCT 


GAGCAATGAA 


AGCCAATTTC 


CCCTTCTGGT 


11880 


AGGAGAATTC 


ATCCCAGGAG 


AGGATTTCAG 


GCAAAGTGGT 


GTAATCCTCT 


TGGAAATGAA 


11940 


ATTGCTTGAG 


CTTACGATAG 


ACGGTAGAGG 


TAGAGGTAGA 


GGTAGAGATG 


GCTAATTTAG 


12000 


AAGCGATATG 


TGTAAGAGCC 


TCTCTGTTGA 


GTAGGAGTTG 


GGCAATTTTC 


TGTCTCACCA 


12060 


TTTCCGAGAT 


TTGGCAATTT 


TTCTGAACGA 


GAGTTGTTTC 


AGCTACAGTG 


ACTTTCCGAC 


12120 


AGGACTTGCA 


TTGAAATCGT 


CTCTTTTTCA 


AATGAATGAG 


GCTAGGGAAA 


CCACCAATCT 


12180 


CGATAAAAGG 


GATTTTAGAA 


GGCTTTTGGA 


AGTCGTATTT 


GATTTGTTTT 


CCTTTACAGT 


12240 


GTTTACATTT 


AGGTGGGTGA 


TAATCAAGTG 


TAGCGAAGAC 


TTCGATATGG 


GTATCGTGCT 


12300 


GAATGGCTTT 


ATTTAAGGTG 


ATGTTTTTGT 


CTTTTATTCC 


GATGAGTAAT 


GTGGTATGAT 


12360 


TGATGTGTTC 


CATAAGATAC 


TTTCTAATGA 


GTTGTTTAGG 


CGCTTTTCAT 


TATAAGTCTT 


12420 


ATGGGACTTT 


TTTGATACTC 


AAAAAGCCCT 


ATAATCTCCA 


CAGTGGGATT 


TACCCACTAC 


12480 


AGAAATTATA 


GAGCCAGAAA AAACACTTTT 


GTTCACTAGC 


AGAAACTAGA 


GAGCAGAAGT 


12540 


GTTTTTCTGT 


TCAGATTTAC 


CCAAAACTGG 


GAAATATGGG 


GATAAGAATA 


GAGATGGCTT 


12600 


AGGAAGCCCC 




TAGACAGTAC 


GATGAACTTA 


TAACAAATAG 


TGAGCCTTTT 


12660 


TAGCAATCAT 


TGCGACCCGT 


TTGTCAAAAG 


CCTCTTTTCG 


GATATCTACA 


ATTGTCTGAT 


12720 


AGATGAGACG 


CTGTTGGCTA 


ACATGCAAAT 


CTAAGGCAAT 


CGTCAAAAAG 


TGATGTTTCC 


12780 


CTTTGGGATA 


CTGCTTTTTA 


ACGTAAGGCA 


GGTATTCTTT 


CGtTGTAATA 


ATAATCAATG 


12840 


GCTCTGTCAA 


ATGCTCCTCT 


GAAGGAGGAG 


GACTAATTAG 


AATATTGTAT 


CCTGTAACAG 


12900 


AGGCAACTTT 


GTCAGTAAAA 


TTCCGTAAAA 


TAATGGACTT 


TATTAAGTTT 


ACATCTGCTT 


12960 


GATTATTTAA 


AATGATAAAA 


ATCGGGATAG 


CAGGTAGTGA 


GGAAAAGATG 


GTTTCTGTCA 


13020 


AGTAGAGTGA 


GAAAAGGTAC 


AGCCGATGCT 


GGTCGATAAC 


TCCTTCAATC 


TTCTGCTCAG 


13080 


TCATCCACTC 


TTGAACAATT 


GCTTTCGAAA 


TATGATACAG 


TGGCTTGTCG 


CTTTCAATCC 


13140 


CATAATGTTC 


GTAATAATTA 


TAATAGGGAA 


CTAGATTTTG 


TAAACCAAAC 


AAAAACGTTC 


13200 


TTGTTAAGAA 


AGTCAGTGCT 


GTTAAAAAAG 


AAAGAGAATT 


CGAAATGTCA 


TTTCCTAAGA 


13260 


TATTCTTGAA 


CTTGGATAGT 


AGATGCTTTC 


CTCTTGTATG 


CTGAAGAATC 


AGTTGAATAG 


13320 


TATGAGTCTT 


TTTTTCTTGA 


TTCCATTTGT 


CCTTGGAAAA 


CGAAGAATTA 


GCAGAACAAT 


13380 


AAACCAAAAA 


GATATAATCC 


AGTTCTTCCT 


GAGTAAAAGT 


CATGTTGGCA 


TGTGGCTCTA 


13440 


AGTAAGTTTG 


GCAATGTTCC 


ATCAAAATCG 


GATACATAAA 


GAGGTTTTTT 


AATTTTTCAA 


13500 
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ACTCTTTGGA CTCAGGGAAC TCAAGTGGAA ATTCCCGACG TTTCCAAGTG AGTGCCACTA 13560 

GTATGCTAAA ATGAACATAC TCGTCAGGTG TGATTTCTAA CAGTTCATGA CTGAGTTGAG 13620 

AATTAGACTG CACAATCATA TGTGTGACCC AATCCATACT TCCATCATTC AAATCATAAA 13680 

TCTCAATACC AAAATGAAAC TGGAGGAGTG CAATTAAAAA ACGAATGCGA TATTCAGGAC . 13740 

CAACTACTTG ATTTTTCACA AGGTCCAAAC CTACTGAACG TAGTAACAAG CCACACTTTT 13800 

GTCGTACGCG GTAGCCTGTT GCGATGGAAA TATACTCTTT TTGTGTAAAT TCGTTAAAGC 13860 

TTTGATTACC TTGTAGTAGA AAGAAGCGGA GTATTTTTAA AATAGTTGAT TGGTTATAAA 13920 

GCTGATGGAA GTAATAATTC GTTTGATGAG AATGGTGTTC GATTAATTGA ACTTGTTGCG 13980 

TATCTAAATT AAATGTCAAC TCTTCCTCGA ATGTTTCTTG TAATTCCTGC AAAATGCTTA 14040 

GGAGACTTTT AGATTGTAAT GAAGTTAAAG TAGACAGTTC ATCTAGTTCA ATAGACCGAA 14100 

TATCCAATAA TATATTTAAA ATGGTAATTT TATCTGTAAT TCTTTTTTCA ATGTATTTGT 14160 

TTAGCATAGT TACCGAATCT TAGTTGCATA TAGATAATTT TAATTATTAT AATACAAAAG 14220 

AAACTAATTG TCTTGTCAAA AAGGTTGTGG AATTTCCGAC TTTATTGATA AAACAGCATG 14280 

TAATAAAAGG CATTTTAAAG ATAGTAATGA GTATTGGTGG AGTTTTATGG CTTATTTTTT 14340 

TTATTAGAAA ATATTTTTTT ATCAAATATT GTCGTTCTAT AAAAAAATAT GTGATAAAAA 14400 

TATCTATTGT GATGGAAGTT GTTTTAATTT ATACTAGGAT AGTTAATAGT AATACTATAC 14460 

TATACTATAT TGTATACAAG TGTGTCATTG CCAGGTTGAG AAGATAGCTA TAACGCACTT 14520 

TTATACGCTT TTGCTACGTT TGTTAGTGAA CGGATTAACT CAGTGAGATA AATTTTATCA 14580 

GAACATAAGT AATCCGTTTC TTCGTGTATA CAGATTGAAA GTACCTATGA ATCATAGAAG 14640 

GATTAACTTG TTCTATGAAT AATGCTTAAC AGGGAGACAC ACATGAAAAA AGTAAGAAAG 14700 

ATATTTCAGA AGGCAGTTGC AGGACTGTGC TGTATATCTC AGTTGACAGC TTTTTCTTCG 14760 

ATAGTTGCTT TAGCAGAAAC GCCTGAAACC AGTCCAGCGA TAGGAAAAGT AGTGATTAAG 14820 

GAGACAGGCG AAGGAGGAGC GCTTCTAGGA GATGCCGTCT TTGAGTTGAA AAACAATACG 14880 

GATGGCACAA CTGTTTCGCA AAGGACAGAG GCGCAAACAG GAGAAGCGAT ATTTTCAAAC 14940 

ATAAAACCTG GGACATACAC CTTGACAGAA GCCCAACCTC CAGTTGGTTA TAAACCCTCT 15000 

ACTAAACAAT GGACTGTTGA AGTTGAGAAG AATGGTCGGA CGACTGTCCA AGGTGAACAG 15060 

GTAGAAAATC GAGAAGAGGC TCTATCTGAC CAGTATCCAC AAACAGGGAC TTA'TCCAGAT 15120 

GTTCAAACAC CTTATCAGAT TATTAAGGTA GATGGTTCGG AAAAAAACGG ACAGCACAAG 15180 

GCGTTGAATC CGAATCCATA TGAACGTGTG ATTCCAGAAG GTACACTTTC AAAGAGAATT 15240 

TATCAAGTGA ATAATTTGGA TGATAACCAA TATGGAATCG AATTGACGGT TAGTGGGAAA 15300 
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ACAGTGTATG AACAAAAAGA TAAGTCTGTG CCGCTGGATG TCGTTATCTT GCTCGATAAC 15360 

TCAAATAGTA TGAGTAACAT TCGAAACAAG AATGCTCGAC GTGCGGAAAG AGCTGGTGAG 15420 

GCGACACGTT CTCTTATTGA TAAAATTACA TCTGATTCAG AAAATAGGGT AGCGCTTGTG 154 BO 

ACTTATGCTT CCACTATCTT TGATGGGACC GAGTTTACAG TAGAAAAAGG GGTAGCAGAT 15540 

AAAAACGGAA AGCGATTGAA TGATTCTCTT TTTTGGAATT ATGATCAGAC GAGTTTTACA 15600 

ACCAATACCA AAGATTATAG TTATTTAAAG CTGACTAATG ATAAGAATGA CATTGTAGAA 15660 

TTAAAAAATA AGGTACCTAC CGAGGCAGAA GACCATGATG GAAATAGATT ' GATGTACCAA 15720 

TTCGGTGCCA CTTTTACTCA GAAAGCTTTG ATGAAGGCAG ATGAGATTTT GACACAACAA 15780 

GCGAGACAAA ATAGTCAAAA AGTCATTTTC CATATTACGG ATGGTGTCCC AACTATGTCG 15840 
TATCCGATTA ATTTTAATCA TGCTACGTTT GCTCCATCAT ATCAAAATCA ACTAAATGCA ' 15900 

TTTTTTAGTA AATCTCCTAA TAAAGATGGA ATACTATTAA GTGATTTTAT TACGCAAGCA 15960 

ACTAGTGGAG AACATACAAT TGTACGCGGA GATGGGCAAA GTTACCAGAT GTTTACAGAT 16020 

AAGACAGTTT ATGAAAAAGG TGCTCCTGCA GCTTTCCCAG TTAAACCTGA AAAATATTCT 16080 

GAAATGAAGG CGGCTGGTTA TGCAGTTATA GGCGATCCAA TTAATGGTGG ATATATTTGG 16140 

CTTAATTGGA GAGAGAGTAT TCTGGCTTAT CCGTTTAATT CTAATACTGC TAAAATTACC 16200 

AATCATGGTG ACCCTACAAG ATGGTACTAT AACGGGAATA TTGCTCCTGA TGGGTATGAT 16260 

GTCTTTACGG TAGGTATTGG TATTAACGGA GATCCTGGTA CGGATGAAGC AACGGCTACT 16320 

AGTTTTATGC AAAGTATTTC TAGTAAACCT GAAAACTATA CCAATGTTAC TGACACGACA 16380 

AAAATATTGG AACAGTTGAA TCGTTATTTC CACACCATCG TAACTGAAAA GAAATCAATT 16440 

GAGAATGGTA CGATTACAGA TCCGATGGGT GAGTTAATTG ATTTGCAATT GGGCACAGAT 16500 

GGAAGATTTG ATCCAGCAGA TTACACTTTA ACTGCAAACG ATGGTAGTCG CTTGGAGAAT 16560 

GGACAAGCTG TAGGTGGTCC ACAAAATGAT GGTGGTTTGT TAAAAAATGC AAAAGTGCTC 16620 

TATGATACGA CTGAGAAAAG GATTCGTGTA ACAGGTCTGT ACCTTGGAAC GGATGAAAAA 16680 

GTTACGTTGA CCTACAATGT TCGTTTGAAT GATGAGTTTG TAAGCAATAA ATTTTATGAT ,16740 

ACCAATGGTC GAACAACCTT ACATCCTAAG GAAGTAGAAC AGAACACAGT GCGCGACTTC 16800 

CCGATTCCTA AGATTCGTGA TGTGCGGAAG TATCCAGAAA TCACAATTTC AAAAGAGAAA 16860 

AAACTTGGTG ACATTGAGTT TATTAAGGTC AATAAAAATG ATAAAAAACC ACTGAGAGGT 16920 

GCGGTCTTTA GTCTTCAAAA ACAACATCCG GATTATCCAG ATATTTATGG AGCTATTGAT 16980 

CAAAATGGCA CTTATCAAAA TGTGAGAACA GGTGAAGATG GTAAGTTGAC CTTTAAAAAT 17040 
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CTGTCAGATG 


GGAAATATCG 


ATTATTTGAA 


AATTCTGAAC 


CAGCTGGTTA 


TAAACCCGTT 


17100 


CAAAATAAGC 


CTATCGTTGC 


CTTCCAAATA 


GTAAATGGAG 


AAGTCAGAGA 


TGTGACTTCA 


17160 


ATCGTTCCAC 


AAGATATACC 


AGCGGGTTAC 


GAGTTTACGA 


ATGATAAGCA 


CTATATTACC 


17220 


AATGAACCTA 


TTCCTCCAAA 


GAGAGAATAT 


CCTCGAACTG 


GTGGTATCGG 


AATGTTGCCA 


17280 


TTCTATCTGA 


TAGGTTGCAT 


GATGATGGGA 


GGAGTTCTAT 


TATACACACG 


GAAACATCCG 1 


17340 


TAAAGTGTAG 


AAATGATAAT 


ATCTATGTTC 


TGAACGATAC 


TTTTAAGAAG 


TAGCACTCAA 


17400 


GAAGAGATTT 


AAGTTTACTT 


GGTGAAACCT 


GTTTTATTCG 


TAAGTAAACT 


ATCATTGAAA 


17460 


GGGGAGATGT 


TTTCGAAAAC 


TTGCACAGAA 


AAAGGATTAT 


TATTGTCATG 


TGTAATTCAT 


17520 


TACATTGCTC 


ACAGTTGATT 


TTAAGAGATA 


TGAATAAGGA 


GAAATCATGA 


AATCAATCAA 


17580 


CAAATTTTTA 


ACAATGCTTG 


CTGCCTTATT 


ACTGACAGCG 


AGTAGCCTGT 


TTTCAGCTGC 


17640 


AACAGTTTTT 


GCGGCTGGGA 


CGACAACAAC 


ATCTGTTACC 


GTTCATAAAC 


TATTGGCAAC 


17700 


AGATGGGGAT 


ATGGATAAAA 


TTGCAAATGA 


GTTAGAAACA 


GGTAACTATG 


CTGGTAATAA 


17760 


AGTGGGTGTT 


CTACCTGCAA 


ATGCAAAAGA 


AATTGCCGGT 


GTTATGTTCG 


TTTGGACAAA 


17820 


TACTAATAAT 


GAAATTATTG 


ATGAAAATGG 


CCAAACTCTA 


GGAGTGAATA 


TTGATCCACA 


17880 


AACATTTAAA 


CTCTCAGGGG 


CAATGCCGGC 


AACTGCAATG 


AAAAAATTAA 


CAGAAGCTGA 


17940 


AGGAGCTAAA 


TTTAACACGG 


CAAATTTACC 


AGCTGCTAAG 


TATAAAATTT 


ATGAAATTCA 


18000 


CAGTTTATCA 


ACTTATGTCG 


GTGAAGATGG 


AGCAACCTTA 


ACAGGTTCTA 


AAGCAGTTCC 


18060 


AATTGAAATT 


GAATTACCAT 


TGAACGATGT 


TGTGGATGCG 


CATGTGTATC 


CAAAAAATAC 


18120 


AGAAGCAAAG 


CCAAAAATTG 


ATAAAGATTT 


CAAAGGTAAA 


GCAAATCCAG 


ATACAGCACG 


18180 


TGTAGATAAA 


GATACACCTG 


TGAACCACCA 


AGTTGGAGAT 


GTTGTAGAGT 


ACGAAATTGT 


18240 


TACAAAAATT 


CCAGCACTTG 


CTAATTATGC 


AACAGCAAAC 


TGGAGCGATA 


GAATGACTGA 


18300 


AGGTTTGGCA 


TTCAACAAAG 


GTACAGTGAA 


AGTAACTGTT 


GATGATGTTG 


CACTTGAAGC 


18360 


AGGTGATTAT 


GCTCTAACAG AAGTAGCAAC TGGTTTTGAT TTGAAATTAA CAGATGCTGG 


18420 


TTTAGCTAAA 


GTGAATGACC 


AAAACGCTGA 


AAAAACTGTG 


AAAATCACTT 


ATTCGGCAAC 


18480 


ATTGAATGAC 


AAAGCAATTG 


TAGAAGTACC 


AGAATCTAAT 


GATGTAACAT 


TTAACTATGG 


18540 


TAATAATCCA 


GATCACGGGA ATACTCCAAA 


GCCGAATAAG 


CCAAATGAAA 


ACGGCGATTT 


18600 


GACATTGACC 


AAGACATGGG 


TTGATGCTAC 


AGGTGCACCA 


ATTCCGGCTG 


GAGCTGAAGC 


18660 


AACGTTCGAT 


TTGGTTAATG 


CTCAGACTGG 


TAAAGTTGTA 


CAAACTGTAA 


CTTTGACAAC 


18720 


AGACAAAAAT 


ACAGTTACTG 


TTAACGGATT 


GGATAAAAAT 


ACAGAATATA 


AATTCGTTGA 


18780 


ACGTAGTATA 


AAAGGGTATT 


CAGCAGATTA 


TCAAGAAATC 


ACTACAGCTG 


GAGAAATTGC 


18840 
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TGTCAAGAAC TGGAAAGACG AAAATCCAAA ACCACTTGAT CCAACAGAGC CAAAAGTTGT 18900 

TACATATGGT AAAAAGTTTG TCAAAGTTAA TGATAAAGAT AATCGTTTAG CTGGGGCAGA 18960 

ATTTGTAATT GCAAATGCTG ATAATGCTGG TCAATATTTA GCACGTAAAG CAGATAAAGT 19020 

GAGTCAAGAA GAGAAGCAGT TGGTTGTTAC AACAAAGGAT GCTTTAGATA GAGCAGTTGC 19080 

TGCTTATAAC GCTCTTACTG CACAACAACA AACTCAGCAA GAAAAAGAGA AAGTTGACAA 19140 

AGCTCAAGCT GCTTATAATG CTGCTGTGAT TGCTGCCAAC AATGCATTTG AATGGGTGGC 19200 

AGATAAGGAC AATGAAAATG TTGTGAAATT AGTTTCTGAT GCACAAGGTC GCTTTGAAAT 19260 

TACAGGCCTT CTTGCAGGTA CATATTACTT AGAAGAAACA AAACAGCCTG CTGGTTATGC 19320 

ATTACTAACT AGCCGTCAGA AATTTGAAGT CACTGCAACT TCTTATTCAG CGACTGGACA 19380 

AGGCATTGAG TATACTGCTG GTTCAGGTAA AGATGACGCT ACAAAAGTAG TCAACAAAAA 19440 

AATCACTATC CCACAAACGG GTGGTATTGG TACAATTATC TTTGCTGTAG CGGGGGCTGC 19500 

GATTATGGGT ATTGCAGTGT ACGCATATGT TAAAAACAAC AAAGATGAGG ATCAACTTGC 19560 

TTAAGTAAGA GAGAAAGGAG CCATTGATGA CAATGCAGAA AATGCAGAAA ATGATTAGTC 19620 

GTATCTTCTT TGTTATGGCT CTGTGTTTTT CTCTTGTATG GGGTGCACAT GCAGTCCAAG 19680 

CGCAAGAAGA TCACACGTTG GTCTTGCAAT TGGAGAACTA TCAGGAGGTG GTTAGTCAAT 19740 

TGCCATCTCG TGATGGTCAT CGGTTGCAAG TATGGAAGTT GGATGATTCG TATTCCTATG 19800 

ATGATCGGGT GCAAATTGTA AGAGACTTGC ATTCGTGGGA TGAGAATAAA CTTTCTTCTT 19860 

TCAAAAAGAC TTCGTTTGAG ATGACCTTCC TTGAGAATCA GATTGAAGTA TCTCATATTC 19920 

CAAATGGTCT TTACTATGTT CGCTCTATTA TCCAGACGGA TGCGGTTTCT TATCCAGCTG 19980 

AATTTCTTTT TGAAATGACA GATCAAACGG TAGAGCCTTT GGTCATTGTA GCGAAAAAAA 20040 

CAGATACAAT GACAACAAAG GTGAAGCTGA TAAAGGTGGA TCAAGACCAC AATCGCTTGG 20100 

AGGGTGTCGG CTTTAAATTG GTATCAGTAG CAAGAGATGT TTCTGAAAAA GAGGTTCCCT 20160 

TGATTGGAGA ATACCGTTAC AGTTCTTCTG GTCAAGTAGG GAGAACTCTC TATACTGATA 20220 

AAAATGGAGA GATTTTTGTG ACAAATCTTC CTCTTGGGAA CTATCGTTTC AAGGAGGTGG 20280 

AGCCACTGGC AGGCTATGCT GTTACGACGC TGGATACGGA TGTCCAGCTG GTAGATCATC 20340 

AGCTGGTGAC GATTACGGTT GTCAATCAGA AATTACCACG TGGCAATGTT GACTTTATGA 20400 

AGGTGGATGG TCGGACCAAT ACCTCTCTTC AAGGGGCAAT GTTCAAAGTC ATGAAAGAAG 20460 

AAAGCGGACA CTATACTCCT GTTCTTCAAA ATGGTAAGGA AGTAGTTGTA ACATCAGGGA 20520 

AAGATGGTCG TTTCCGAGTG GAAGGTCTAG AGTATGGGAC ATACTATTTA TGGGAGCTCC 20580 
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AAGCTCCAAC TGGTTATGTT CAATTAACAT CGCCTGTTTC CTTTACAATC GGGAAAGATA 20640 

CTCGTAAGGA ACTGGTAACA GTGGTTAAAA ATAACAAGCG ACCACGGATT GATGTGCCAG 20700 

ATACAGGGGA AGAAACCCTT GTATATCTTG ATGCTTGTTG CCATTTTGTT GTTTGGTAGT 20760 

GGTTATTGTC TTACGAAAAA ACCAAATAAC TGATATTCAA TGTACATCAT TATGAATAGG 20820 

ATAGCAGGCT GAAGGGAAGA CCAGAGTACT CTGAGGTGAT GTTAATCAGG AATCATGGTG 20880 

ATGTGGCATG AATCATCAAT AACGGATATG AGGCTGGGCA GATTGTGCCA GCCTCATTGT 20940 

GGGTTATTGT TTGTAAAACG ATAGGACTGG TCTGGTAATC ATTTTA 20986 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21040 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

CCCAGCAAAA AGCCATCCGA AGATGACTTT TTTGCTATTT AATTTCTGTA TAAGTTACTT 60 

CCAAGCCACG CTTAACAGCT GGACGATTGG CAATTTTTTC TGCCCATTTT ACTAGATTTT 120 

GATAACTTGA GGCATCCAAG AATTTTGCAG AACCTTGGTA AAGATTTCCT TGAACTAACT 180 

GTCCATACCA AGACCAGATA GCAATATCTG CAATCGTATA GTCATTGCCT GCAATATAAG 240 

GTTTCTGAGC CAATTCCTTA TCCAATAAAT CCAACTGGCG TTTCACTTCC ATCGTAAAAC 300 

GGTTAATAGG ATATTCCAAT TTTTCAGGAG CATAATTGAA GAAATGTCCA AATCCCCCAC 360 

CTAGAAAAGG TGCTGCACCT GCTTGCCAGA ATAGCCAATT CAAAACTTCT ACCTTTTCCA 420 

CAGGATTACT TGGTAAAAAG GCTCCAAATT TCTCAGCAAG GTAAAGAAGA ATATGAGCAG 480 

ACTCAAAGAC TCTTACGTTT TCAGTACCTG ACTGGTCCAA TAAGGCTGGA ATCTTGGAAT 540 

TTGGATTGAG CTTCACAAAG TCTGATCCGA ATTGATCCCC ATCCATGATA GCAATCTTAT 600 

ACAAGTCGTA AGCCGCTTCC TTAAAACCAG CTTCTAGTAA TTCTTCCAAT AAGATAGTAA 660 

CCTTCACACC ATTTGGTGTT CCCAGTGAAT AAAGCTGAAA AGCTTGTTCT CCTTTTGGCA 720 

AGTTTTGTTC GAAACGGGCA CCTGCTGTTG GTCTGTTTAG CCCCGTAAAA GCTCCTTGAT 780 

TACTAGCTTC ATCCTGCCAT ACGGTCGGTA ATTGATATGC TGACATCCGA AACCTCCCTT 840 

AAATCGCATT CTTGTCAAAA CCGAGTTTGC GTTGAATAAA CTTAACGATT TCGACGATGA 900 

TAATCATTGA GAAGCTTCCA GCCATAACAA TTCCCCATTG TGACAAGTCT AGTTTGGTTA 960 

CGTGGAAGAT TCCTTCAAGC GGTTCTACAA CGATTGTTGC CATGAGAAGG ATAAAGGATA 1020 
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CCAAGATGGA CCAGTTAAAG GTCTTAGACT TGAATGGGCC AACTGTCAAG ATGGATTGGT 1080 

AGACAGACTT GACATTGTAG GCATGGAAGA GCTGAATCAA ACCAAGGGTT GCAAAGGCCA 1140 

TCGTTAGGGC ATCTGCATGA ATAGCATGAT TGTCACCCAC ATGAACTGGG TAAGCAATCG 1200 

CAAGGCCATA AACACTCATA ACAAGAGCTG CTTGGAGTAC ACCTTGATAA ATGATAGAAC 1260 

TCAAAACACC ACCTGAGAAG AAGCTTGCCT TGCGTCCACG TGGTTTATGA TTCATGACAC 1320 

CAGGTTCCGC AGGTTCAACA CCAAGAGCGA TAGCTGGGAA GGTATCCGTT ACCAAGTTGA 1380 

TCCACAAAAG ATGAACCGGC TGTAAGACAT CCCAACCAAA CAAGGTTGAT AGGAAGATGG 1440 

TTAATACTTC AGCAGTATTA GCAGAAAGTA GGTACTGAAT AGTCTTTTGA ATGTTTGAGA 1500 

AGACCTTACG TCCTTCTTCC ACTGCGACGA TAATAGTCGC AAAGTTATCA TCTGCAAGAA 1560 

TCATATCAGA AGCCCCCTTA GAAACCTCTG TACCAGTGAT TCCCATACCG ATACCGATAT 1620 

CGGCTGTTTT CAGAGCTGGC GCGTCATTGA CACCGTCACC TGTCATGGCA ACGACTTTAC 1680 

CTTGTTTTTG CCAAGCCTTG ACGATACGAA CCTTGTGTTC TGGAGACACA CGGGCATAAA 1740 

CAGAGTATTG ACCAACGACT TTTTCAAATT CTTCATCTGA CAGTTCATTG AGTTCAGCAC 1800 

CAGTTAAAAC GTGACCTTCT GTATCGTTTG CGTCAATGAT TCCCAAACGT TTGGCAATGG 1860 

CTTCCGCTGT GTCTTGGTGG TCACCTGTAA TCATAATTGG ACGGATTCCC GCTTCCTTAG 1920 

CCACACGAAC AGCCTCAGCG GCTTCAGGAC GTTCAGGGTC AATCATCCCA ATCAAACCAG 1980 

TAAAAATTAA ATCATTTTCA AGCTCTTCAG AAGTGAGATT TTCTGGAATA CTATCGATAA 2040 

TCTTATAAGC ACCTGCAAGG ACACGCAAGG CTTGATGAGC CATTTCAGAA TTGTTTGTAC 2100 

GAATGAGATT TGTAACCTTC TCATCAATCG GAGCAATATC CCCAGCCTTA TCACGAAGAA 2160 

GACAACGTTT TAAGAGTTGG TCTGGCGCAC CCTTGACTGC TACAAGGAAA CGACCATCTG 2220 

GCAATGGGTG AACTGTTGAC ATGAGCTTAC GGTCAGAGTC AAATGGCAAT TCAGCTACAC 2280 

GAGGATATTT CTCTAAGAAA CCTTTGACAT CATAGCCCTT GTCCAAGGCA TATTGGATAA 2340 

AGGCTGTTTC GGTTGGGTCA CCAATCAAGT TACCTTCCAC ATCGATTTTC GTATCATTGG 2400 

CCAAGACAAC TGAACGAAGT AGTGGCATTT CAAGACCTAG TTCAATATCA TCAGCTGAGT 2460 

CATGTAGAAC CGCATCGTAG AAGACTTTTT CGACTGTCAT CTTGTTCATA GTCAGCGTAC 2520 

CAGTCTTATC AGAAGCGATG ATTTCAGTTG AACCAAGTGT TTCAACTGCT GGCAACTTAC 2580 

GAACGATGGA ATGTCGTTTG GCCAAAACTT GAGTACCAAG AGAAAGAACG ATGGTAACGA 2640 

TAGCAGGAAG TCCTTCTGGA ATGGCTGCAA CGGCAAGGGC AACAGAAGTC AACAACTCAC 2700 

CAAGTGGATT TTTCCCTTGA ATGAAGACAC CCACTACAAA AGTAACAAGG GCAATGACCA 2760 
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AG AT AG CAT A GGTCAAGACC TTAGAAAGGT TGTTCAAATT TTGTTTGAGT GGTGTATCAG 2820 

TCTCATCCGC ATCTTGAAGC ATACCAGCAA TATGACCAAC TTCAGTGTAC ATACCTGTAT 2880 

TGACAACAAC ACCCATCCCA CGACCATAGG TTACGTTTGA GTTTTGGAAG GCCATGTTGA 2940 

CACGGTCACC AATACCAGCA TCTGTCGCAA GCTCGACTGA CAAGTCTTTT TCGACTGGTA 3000 

CAGATTCACC TGTCAAGGCT GCTTCTTCAA TTTTAAGAGA GTTGGCTTCT ATCAAACGTA 3060 

GGTCCGCTGG TACCACGTCA CCTGCTTCAA GGGCAACGAT ATCGCCTGGT ACCAATTCTT 3120 

TAGAGTCAAT CTCTGCCATG TGTCCATCAC GAAGAACGCG GGCAACTGGA CTAGACATGG 3180 

ATTTGAGGGC TTCAATAGCT TCTTCAGCTT TTCCTTCTTG GTAAACACCA AAGGCAGCGT 3240 

TGATGATAAC CACAGCTAGG ATGATAATGG CATCTGCGAT ATCTTCCCCA CCAGAAGTCA 3300 

CGACTGACAA GATTGCtGCC GCAACTAGGA TGATAATCAT CAAATCCTTA AATTGCTCGA 3360 

TGAATTTGAC CAAGATTGAT CGTTTCTCGC CTTCTTCGAG TTCATTGTGC CCAAATTCGG 3420 

CAAGGCGCTT TTCCGCCTCA CTTGATGACA AACCTTGCTC GGTCGCATCC ACAGCCTGCA 3480 

AGACCTCTTC AGGGCTCTGA GTATAAAACG CTTGGCGTTT TTGTTCTTTT GACATGTGTC 3540 

TCCTCCTTGA CATTGTGTGC AAAACAGACT CTCTTTCTGT CATAGCTTTT CACGACAAAC 3600 

AAAAAGAAAC CTGTTAATCA TAACAAGTCT CGCTGTTTAA GATAGGGCCG GAAAGCATAC 3660 

TTTTCAGCAT AAAATTCGGA ATGACGACAC TATCACAGGT TTCTGCCAGC TACTCCCTTG 3720 

AGTAGTACCA TTATACCAAA TTTTGGGGAG TTTTCAAAGA GTAAAAACTG CCTTATTTGA 3780 

ATTTTTCCTT GAAAACCAGT ATAATGGTAG AATGCTATGT GACTAGAAAG GAAGTTGAAT 3840 

GAAGCAATCT ATCTCAAATC TCAAGTTAGC TGAGCGTGGA GCCATTATCA GTATTTCGAC 3900 

CTATTTGATC TTGTCTGCAG CCAAATTAGC AGCTGGTCAT CTCCtTCATT CATCCAGTTT 3960 

GGTGGCCGAT GGTTTTAATA ACGTATCGGA CATCATTGGA AATGTGGCCC TCTTAATCGG 4020 

GATTCGGATG GCGCGCCACC TGCAGACCGT GACCACCGTT TTGGTCATTG GAAGATTGAA 4 080 

GATTTGGCAA GCTTGATCAC TTCTATCATC ATGTTCTATG TCGGTTTCGA TGTTCTAAGA 4140 

GATACCATTC AAAAGATTCT CAGTCGGGAA GAAACGGTCA TTGATCCTCT TGGTGCAACT 4200 
CTAGGAATCA TTTCTGCAGC GATTATGTTT GTGGTCTATC TCTACAATAC TCGCCTCAGT * 4260 

AAGAAATCCA ACTCCAATGC GCTGAAGGCA GCTGCTAAGG ACAATCTTTC TGACGCTGTT 4320 

ACCTCACTTG GAACCGCCAT TGCCATCCTA GCTAGTAGTT TCAATTATCC GATTGTGGAT 4380 

AAACTGGTTG CTATCATCAT CACTTTCTTT ATCTTGAAGA CTGCCTATGA TATCTTCATC 4440 

GAGTCTTCCT TTAGTCTTTC AGATGGCTTT GACGACCGCC TGCTCGAGGA CTACCAAAAG 4500 

GCTATCATGG AAATTCCCAA AATCAGCAAG GTCAAATCGC AAAGAGGTCG CACCTACGGT 4560 
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CATGAAATCG CGGATCAGGT CGAGTCTATG CTGGAGGAGC GTTTTGGCGT CTTTGATACC 4680 

GATGTCCATA TCGAACCAGC ACCTATCCCT GAGGATGAAA TTTTAGACAA TGTCTATAAA 4740 

AAATTGCTTA TGCGTGAACA ATTGATTGAC CAAGGAAACC AACTAGAAGA ACTCTTGACT 4800 

GATGATTTTG TCTATATTCG CCAAGATGGA GAGCAGATGG ATAAAGAGGC TTATAAGACC 4B60 

AAAAAAGAGT TAAATTCTGC TATCAAGGAC ATTCAAATTA CTTCCATCAG TCAAAAAACC 4920 

AAACTCATCT GCTATGAGTT AGATGGTATC ATCCATACCA GTATCTGGCG TCGCCACGAA 4980 

ACCTGGCAAA ATATCTTTCA TCAAGAAACC AAAAAAGAAT AGAGAAATCC TTTCATGAGA 5040 

CGGGATTTTT CTATTCTTTT ATACTCAATA AAAATCAAAG TGCAAATTAG GAAGCCGGTC 5100 

ACAGGCTGTA CTTGAGTCGG CAATGTGAAG CCGACATAGT TTGCACTTTG ATTTTCGAAT 5160 

AGTCTTAACT ATCAAATTCA CTGAGATACT CATAGCGTTC GTATTTTTCA AGGAGTGCTT 5220 

CATTTTTCTC ATCCAATTCT TTTTGGAGAG TAGCCAGCTT ACCAAAGTCA GAGCCGTTAG 5280 

CCTGCATTTC CTCTTCAATA GCAGCGATAC GTTTTTCCAA GGTTTCAATA TCACCTTCAA 5340 

TACTTGCCCA CTCCTGCTTT TCTTGGTAGG TCATGCGTTT CTTGTCTTCT CGAACCTTGA 5400 

CCACTTTTTC CTTTTCGGCC TTTTGCACTT GATTGGCCAT ATCTGTTTCA AAAGCTTTTT 5460 

CATCAAGATA GTCGGTGTAA TGACCAAAGA AAGGACGAAT CTTGCCATCC TCAAAAGCGA 5520 

GAATCTTGGT CGCTACCTTA TCCAAGAAAT AGCGGTCGTG ACTGACTGTT AAAACGGGAC 5580 

CTGCAAAACC TTGCAAGAAA TTCTCTAAGA CTGTCAAAGT TGCAATATCT AGGTCATTGG 5640 

TTGGCTCGTC TAAAAGAAGA ACATTTGGTT TTTCCAAAAG CAGTTTGAGG AGATAAAGAC 5700 

GTTTTTTCTC ACCCCCTGAC AATTTCTCAA TCAAAGTCCC ATGCGTCGAA CGTGGGAAGA 5760 

GGAATTGCTC CAGCAACTCA GCGATGGAAG TCGTAGAACC ACCACTGGTC TTGACCTCCT 5820 

CTGCCACTTC CTGCAGGTAA TTGATCACAC GCTTGCTTTC ATCCAAACCC TCAATTTGTT 5880 

GAGAGAAATA GGCGATGCGA ACAGTTTCCC CAATCACAAC TTGTCCTGCT GTCGGCTCAA 5940 

GACTTCCTGC AATCAGGTTA AGTAGGGTTG ATTTTCCAAC ACCATTGTCC CCAACAATTC 6000 

CAATACGGTC TTTAGCCTGA ACTAAGAGAT TAAAATTTTG CAAAATGGGC TTATTTTCAT 6060 

AGGCAAAGGA AACATCCTGA AACTCGATGA CTTTCTTCCC AATCCGACTG GTTTCAAAGT 6120 

TCATAGTCAA GTCTGTCTCA GCACTACTGC CTGAAACTTC CTTTTTCAGA TCATGGAAAC 6180 

GATTGATACG AGCTTGTTGC TTGGTCGCAC GCGCCTGCGG TTGTCTGCGC ATCCAGGCCA 6240 

ATTCTTGTTT GTAGAGTTGT TCTTTTTTGT GAAGAAGAGC CGCGTCGCGC TCATCCTGTT 6300 
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CCGCCTTTAG GCGAACATAG TCCTGGTAAT TTCCCTGGTA CTCGGTCAAG CCTGCACGAT 6360 

CCAACTCGAA AATCCGTGTT GACAAAGCGT CTAAGAAATA ACGATCGTGA GTGATAAAAA 6420 

GGACGGTCTT CTTAGAATTT TTCAAAAAGA GGGTCAGCCA CTCAATAATC GCAATATCCA 6480 

GATGGTTGGT CGGCTCATCC AAAAGCAAGA GGTCGTGGTT GCCAAGTAAG ACTTGTGCCA 6540 

ACTGTACCCG TCTTCTCAGA CCACCTGACA ATTCCCCAAC AGGAGTAGAT AAGTCTTGAA 6600 

TGCCCAATTT GCTAAGAACG GTCTTGACCT GACTTTCGAT TTCCCAAGCT TGGAGAGAGT 6660 

CCATCTCTGC CATGACACGT TCCAAACGCG CCTGCTTGTC CTCACTATAG TCGAGCATAA 6720 

TCAATTCATA CTCACGAATG AGCTGGATTT CCTTGAGTTC ACTAGATAGA ACCGTATCCA 6780 

AAACTGTCTT TCTATCATCA AAATCAGGAT CCTGAGTCAA GTAACCAATC TGGTAATCAT 6840 

TTTTAGCTGA AAAAGGACTG ACATCCCCAT CAAATCCAGA AACACCAGAA AGGACGTCCA 6900 

AAAGGGTGGT CTTGCCAGTC CCATTGACAC CGATTAAACC AATTCTGTCT AAGTCATGGA 6960 

TAATAAAGGA AATATCCCTA AAAACGGTCT TGTCACCAAC GGATTTACTT AGTTTTTCAA 7020 

CGATAAAATC ACTCATTTTT TCTCCCTCAG GTAAGCATGG ATGGCTTCAC GATTATTCTC 7080 

CAATTCTCCA TCGACAATGG CAAACTCAAT CTCTGTTAAA ATCTCTCCCA AGTCTGGGCC 7140 

TGGCTGATAG CCATATTCCT TGATCAAAAT ACCGCCATTA ATCTGAATCT CTTTCTTGTC 7200 

ATGGATAGTC AAGCTTTGGT ATTTTTCTGT GATGGCTTGT GGGTTGACTT CTTTTCCTTG 72 60 

AGCTTGACGA AGATTTTCAG CCTGTAAAAG CAAATCTATG TCAAAGCGAT AACAATCTCG 7320 

CTTGCTCAAT TCTCCATTTT CACGCAGAGC CAAAATAATC AGCAAATCCT GAACTTGCTT 7380 

GGCAAACTGG CGTGAGGTCT TCCAAGATTT CAAAAATGAC TGCGCATTTT CAATCTCCAA 7440 

AGCCCATAGT AAAGCCGCCC AGGCTTGTTC AGAGGATTCA AAAGTAAAAT CAGTCTCCAA 7500 

ATCAAACAGT CTGTTGAGCT TGTCCTGGCT AGATGCCATA TCAGGGAGAT AGTCATAAGC 7560 

TTGACTCTCA ATCATGGAAG CCAAGCCCCT TCTCCAAAAT GGAGCCAGCA AGAGTTTATC 7620 

AAACTCGACG AAGGTACGCT CTACAGAAAT TTTCTCCAAA AGCGGCGTCA AGGTCTTCAT 7680 

AGCTTTAAAT GTTTCTGGCT CAAGTGCAAA ACCAAGACTA GCCTGAAAAC GGAAACCACG 7740 

CATAATCCGT AAAGCATCTT CGTTGAAACG CTCACTAGCC ACTCCAACTG CTCGCAAGAC 7800 

TTGCTTTTCC AAATCTTCTA AACCATGGAA CAAGTCAACG ATTTCTCCTG TCTCATCCAA 7860 

GGCAAAGGCG TTGACTGTGA AATCACGGCG TTTGAGGTCT TCTTCTAGCG ATCGTACAAA 7920 

GGAAACCGCA CTGGGTCTGC GATAGTCCAC ATAGACATCC TCTGTCCGAA AGGTTGTTAC 7980 

CTCATACTCC TCATCCCCAT CTAAGACCAA GACGGTTCCA TGCTCGATTC CGATATCGGC 8040 

TGTTCGCGGA AAAATCTGCT TGGTCTCTTC TGGATAAGAA GACGTCGCAA TATCCACATC 8100 
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GTGGATAGGG CTATGGAGAA GGGCATCTCG AACAGAGCCC CCAACAAAAT AAGCCTCAAA 8160 

GCCTGCTTCT TTAATTTTTT CTAATACTGG TAAAGCCTTC TGAAATTCAG AAGGCATTTG 8220 

CGTTAATCTC ATAATAAGTG TTCTAATCCA TAGACAAGCT CATGACGCTT GACAACTTCT 8280 

TTAATTCCCA AATTGACTCC TGTCATGAAG GAGATGCGAT CATAGGAGTC ATGACGGAGG 8340 

GTCAACCCTT CTCCCTGATT GCCAAAGATG ACTTCCTGAT GAGCTACCAA GCCTGGCAAA 8400 

CGAACTGAGT GGATGCGCAT ACCATCAAAG TCAGCACCAC GAGCACCAGC AATCAGCTCT 8460 

TCCTCATCTG CTGCACCTTG CTGAATTGAC TCTCGAACCT CTGCCATCAA CTCAGCTGTT 8520 

TTAATGGCTG TTCCACTCGG AGCATCCTTT TTCTTGTCAT GATGGAGCTC AATAATCTCC 8580 

ACATTTGGGA AATATTTGGC AGCCTGCGTC GCAAATTGCA TGAGTAAGAC AGCACCCAAG 8640 

GCAAAGTTAG GGGCAATCAG GCCACCCAAG TCTTGGGCAC GAGAAAATTC TTTTAGCTCT 8700 

GCAATTTCTT CACTCGTGAA ACCAGTCGTT CCAACTACTG GAGCAAAGCC ATTTTCAAGA 8760 

GCAAAACGTG TATTTTCGTA GGCAACAGCT GGAGTAGTAA AATCTACCCA GACATCCGCT 8820 

TCAAAACCAG CTAAATCAGC CTTATCCTTG AAAACAGGAA TACCCTGCCA TTCTGACTCA 8880 

GACTCAAAAG GATCCAAAAC TGCCACCAAG TCCAAGTCTG GATCAGTCAA TACCATCTGA 8940 

CAAGCAGCCT GGCCCATCTT TCCCTTAAAA CCGGCAATAA TTACTCGAAT ACTCATCTCT 9000 

ACTCCTGTCT AAGATACAAA GTCCGTAAGA ACACAAAGTG AAAATAGGAA TTCCAATCAA 9060 

GAAGTGTCTA CTTCTTGGAA GAACTATCTT TTTCACACAG GGTTCCAGGC GTGTTCAATT 9120 

ATCAAGATAC AAAGGACCTT AGCTGCCTCT GAAAAATAGG GAATGGCACT GACTTTGCAC 9180 

GAAAGGCAAG ACAGGCATCT TTTTTCAAGA GGCAGGTAGT CCGTGTTCAA TTTCTAAGAT 9240 

ACAAGGCATC TTAACTAGCC TAGAAGCGCC AACTAAATCA CTGGAATATA ACCCAGAGCA 9300 

ATACTTCCTG CTCCTAGGTG CGTTCCAATG ACACTACCAA ATGTAGCAAG TGAAACATCC 9360 

GAACCCAAGC CAAAATCAAG CAAGTGcTGA CGCAATTCTT CAGCCTTTTC AGGAGCATTC 9420 

CCATGAATGA CAATGACCCG GTATTGACCT GAAGCCGTTG TTTCCTTGAT AATTTCAATT 9480 

AAGCGCTTGG TGGCCTTCTT TTCAGTACGA ACTTTTTCGT AAACTTCAAT CACACCTTGA 9540 

TCGTTAAAAT AAAGGATTGG CTTAATGCTA AGCAAATTGC CCAAAATGGC AGCCCCATTT 9600 

GAAAGGCGTC CACCTTTTAC CAAATGATCC AAGTCATCTA CCATGATAAA GGCTGACGTA 9660 

CGGCTGATTT GAATGGCTAG CTTATCCTGA ATGCTGGCAA AATCATCGCC CTGATCACGC 9720 

CAATTAAAGA CGCTTTCAAC CATGATGCCT AGGGGAGCAC TTGTAATCAA AGTGTCTGGG 9780 

AAAGCAATGG TTAAGCCCTC ATAGTCATCG ACCATATACT GGATATTTTG GTAAAAACCT 9840 
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GAAATTCCAG AAGATAGGAA AAGCCCCAAG GCATGTGTAT AGCCTTGTTC TTTGAGCGAA 9900 

GTTAAGATCT CATCTAACTT GGCAATACTT GGTTGACTGG TCTTAGGCAA TTCAGAAGCC 9960 

TGAGCCATTT TTTGGTAAAA TTCCTCAGCA GACAGATTGA TGCCTTCGAC ATATTCCTCA 10020 

CCATCAATAT TGACAGGAAT ATCCAAGACA AACAAGTCTT CTCTTTGCAA GATCTCTGCA 10080 

CTGAGATAAG CAGAGGAATC TGTGAAAACA GCTAATTTCA TATTAGAACT CCAAATTAAT 10140 

TCCTGGTAAG TCTAATGCAA TTTCAGTCAC TTCGTAAGTC AAACGATTGA GCATGTTCAA 10200 

ACATGGACGA GCCAAGGTTT CCACCTCTTC TTGGTTCAAT TCACTTGGTT CATTGACAAT 10260 

ACGGCCATCG ATATGGTTTA CTTGTGAGAT TGTTCCACTA ATGACAAACT TATCAAATAC 10320 

AATCATAAAG CTCAAGATGA CAATCAAGGA AGTCACTTGA TTTTCTTGGT CATGTTGGAG 10380 

CAATTGGAAA TTCACATCCA CCTTGGTTTC AGGAGCTCCA TTTTCATTTT CCCATTCAAA ' 10440 

ATTACGCGCA TCAAAATGAT ACTGACTAAC AAATTCTTGT TCACGTTTAA GATTCATGTC 10500 

TTTCTCCATC GGCTACAATA TTATAAGCTA TTGTACCATA ATTTTTTATT TTCATCTAGT 10560 

TTTCTAGGAT TTAGTCAATC CCAATTTCAG CACGAACTAC ATCTGTGATG GTATCAACAT 10620 

AGTAGTTTAC TTCTTCTGTT GTAGGCGCTT CTGCCATAAC ACGCAAGAGG GGTTCTGTTC 10680 

CACTTGGACG AACAAGGATA CGGCCGTTCC CCGCCATTTC TTCTTCCATC TTCTCGATGA 10740 

TGGCCTTGAT AGCTGGCACT TCCATGGCCT TTTCCTTCAT GACGTTTTCC ACTCGGATAT 10800 

TAACTAATTT TTGTGGATAA ATCGTTACTT CTGCCGCCAA CTCTGATAAG CTCTTACCAG 10860 

TTTCCTTCAT GATTTTAGTC AATTGAACTG CTGATAATTG ACCATCACCT GTGGTATTGT 10920 

AATCCATCAA GATAACGTGA CCAGACTGTT CACCACCAAG GTTGTAGCCT GATTTTCTCA 10980 

TTTCTTCAAC AACGT AG CGG TCACCAACTG CAGTAACTGC CTTGTTAATA CCTTCGCGAT 11040 

TCAAGGCCTT GTGGAAACCA AGGTTAGACA TAACAGTTGT CACAATTGTA TTTTGAGCCA 11100 

ATTGTCCTTT TTCAGAAAGG TATTTTCCGA TGATGTACAT AATCTTGTCA CCATCAACGA 11160 

TGTCACCATT CTCATCAACA GCAATCAAGC GGTCACTGTC TCCATCAAAG GCCAAACCAA 11220 

TAGCTGACCC ACTTTCTTTG ACCACTTCTT GAAGGGCTTC TGGATGTGTT GAACCAACAT 11280 

TAAGGTTGAT GTTAAGACCG TCTGGTGTTT CCCCGATAAC CGTCAATTGG GCACCAAGGT 11340 

CTGCAAAGAT TTGACGGGCA CTGGTAGAAG CTGCTCCATT AGCTGTATCC AAGGCAACCT 11400 

TCATTCCATC AAGAGGAGTT CCAGTTGAAA CAAGGTATCC TTCATACTTA CGCArGCtTC 11460 

TGGATAATCT ACCAAAATTC CTAAGCCTTC TGCACTTGGA CGAGGAAGAG TGTCTTCCTC 11520 

AGCATCTAGC AAGGCTTCAA TTTCTGCTTC TTTTTCATCA TCTAGTTTGA AGCCATCACC 11580 

GCCAAAGAAC TTGATTCCGT TATCAAGGGC TGGGTTGTGG CTAGCAGAAA TCATGACACC 11640 
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GGCACTTGCT CCTTCAGTTT CAACCAAGTA AGCTACTGCT GGTGTTGCAA GGACACCAAG 11700 

TTTGTATACG TGAATCCCTA CTGAAAgGAG ACCTGCCACC AAGGCCGATT CCAACATTTC 11760 

CCCTGAAATA CGTGTGTCAC GTCCTACAAA GACTTTCGGC GCTTCCGTTT CATGTTGACT 11820 

AAGAACATAG CCTCCAAAAC GTCCTAGTTT AAAGGCTAAT TCTGGTGTTA GTTCTAGGTT 11880 

AGCTTCTCCA CGGACTCCAT CAGTCCCAAA ATATTTACCC ATTGTTATAA AATCCTTTTC 11940 

TATTTTTAAT TCGTTTTTGA ACTAGTTGCT TTCGTTGACG AAGATGTCTC CGATGAACTG 12000 

CTTGTACTTG AATTTGATGT GCTTGAACTT GGTGCTACTG GTTTTGTAGT CACCTTCATT 12060 

ATTGTATCAA ACGGAGTGAT AACTGCCGGT AAGACAACAC CATTGCGGTC GATTGCCTGC 12120 

AAAGGTACTG AACCACTGTA ATTACCTGTT ATACGTTCGC TAGTTGGCAA AACAGCGATA 12180 

ATCTTATCAA TTCTATCCAA TGTCTCTTGG TCACTCGTAA TAGACACTTC TTTATCTGAC 12240 

ACCATGACAT TTTCAATTTG TACCCGACTA TCAATTTGAC TAGGGTCAAT CTCTGGTACA 12300 

ATCTTTACCT TATCCTTCTG AGCCTTCTTA CCAATCTTGA CTGTAATTTT TTGCGGAGTC 12360 

GCCACAGCGG TCAGCCCATT GGGTAAATCT TCAATGCTCA AAGGAACTTC AATCGTTCCA 12420 

ACACCGGCAT CTGTTAGGTC AGCAGTAACC TTGAATTTAC GTGTACTTTC TTGCATTTCA 12480 

CTAGCTAGCG ATAGGCGATT TGCACCAGTC AAGACCACTG ATACTTCTGA AGCAAAACCG 12S40 

CTAATAAAAT ACTTATCACT ATTATAGCGT ATGTCAATAG GGACATTTGT T ACTGT ATT A 12600 

GTATAGGTTT CCGTTTTTAC CTGCCTAGCA CTGGTACTGT TTTGAAAATT XTGTCGCCGTA 12660 

GCATAGACAA ATAAGACACA AGCAAAAAAG AGTGAGGATA TGATATATAA ACTATTTTTT 12720 

TTCATGTTTC CATCCTCCTA GCAATCGTTC TTTAAAACTA AGACCCACTT CCTCTTTTGG 12780 

AAGTAAGATT TCACGTAATT CTGTTTCAAA TTCATCAAGT GTTAGGTTGT GCTTAAACCT 12840 

TCCATTATAG GTTATCGAAA TTCCTCCCGT TTCCTCTGAT ACGACAAAAG TCAAGGCATC 12900 

TGAGACTTCT GATAAACCGA TAGCCGCCCG GTGTCTGGTC CCAAATTCCT TGGAAATCCC 12960 

TGTGTTTTTT GTCAAGGGCA GATAGGCAGA CGTCACAGCG ATACGTTCTT CTTTGATAAT 13020 

CACCGCACCA TCATGTAGGG GAGTGTTGGG AATAAAAATG TTAATGAGAA GTTCTGCAGA 13080 

AATCTTAGCA TCCAAGGGAA TTCCTGTCGA AATATACTCC TGCAAGGTAC GTACACGCTG 13140 

AATAGCAACC AAGGCCCCGA TTTTACGAGG ACTCATGTAT TCAACAGACT TAACAAAGGC 13200 

ACGAATCATC TGTTCCTCAG CACTAATAGG GGCATTGGAA AAGAAATCTG TCGCTCTTCC 13260 

CAAACGTTCC AAACCAGTCC GAATCTCTGG AGAGAAGATA ACAACCGCCG CAATAACCCC 13320 

ATAAGTAATA ATTTGATTGA TTAACCAAGA AATCGTAGTC AAACCAATCA TATTTGCAAG 13380 
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GATTTGAGCT 


AAAATAAACA 


CCAAAACTCC 


ACGTACCAAA 


ATCATAATCT 


TGGTTCCTGC 


13440 


AATAGCTTTT 


GTAAAATGGT 


ATAAAATATA 


AGCAACAATC 


AAAATATCAA 


TCAGATTGAT 


13500 


AGCTATCGTC 


CATGGACTTG 


CAAACAAACT 


GGTCCAATAT 


TGCAGATTGG 


ATAATTGTTG 


13560 


AAAATTCATC 


CCTGATATCC 


TCCCTATCAA 


AACACTTTCG 


TCCTATTATA 


CCATTTTCTG 


13620 


GCATTTTTTT 


CCCTATCCTA 


GTCCATTTTA 


CATTGAACAA 


AAATATGATA 


AAATAAACTG 


13680 


ACTAAAAAAA 


ACAAAGGAGA 


AACTATGTCT 


CAACTCTATG 


ATATTACCAT 


TGTGGGTGGT 


13740 


GGTCCTGTCG 


GGCTTTTTGC 


AGCCTTTTAT 


GCCCACCTAC 


GCCAAGCCAA 


GGTTCAAATC 


13800 


ATCGACTCTC 


TTCCCCAGCT 


AGGTGGACAA 


OCTGCTATTC 


TCTACCCTGA 


AAAGGAAATC 


13860 


CTAGACGTAC 


CAGGCTTCCC 


AAACCTGACT 


GGAGAAGAGT 


TGACTAACCG 


CTTGATTGAA 


13920 


CAGCTAAATG 


GATTTGATAC 


CCCTATTCAT 


CTCAATGAAA 


CGGTTCTTGA 


GATTGACAAA 


13980 


CAAGAAGAAT 


TTGCCATCAC 


AACTTCTAAA 


GGAAGTCACC 


TGACTAAAAC 


AGTTATCATC 


14040 


GCTATGGGTG 


GCGGTGCCTT 


CAAACCACGT 


CCGCTGGAAC 


TTGAAGGGGT 


TGAGGGCTAT 


14100 


GAAAATATCC 


ACTACCACGT 


TTCTAACATT 


CAGCAATACG 


CTGGTAAGAA 


AGTGACGATT 


14160 


CTTGGTGGGG 


GAGACTCGGC 


TGTGGATTGG 


GCTTTGGCTT 


TTGAAAAAAT 


CGCACCAACT 


14220 


ACCCTTGTTC 


ACCGCAGAGA 


TAATTTCCGT 


GCCTTGGAAC 


ACAGTGTTCA 


AGCCTTGCAA 


14280 


GAATCATCTG 


TAACCATCAA 


GACACCATTC 


GCCCCTAGCC 


AACTCCTTGG 


AAATGGAAAA 


14340 


ACACTTGATA AACTTGAAAT 


CACAAAAGTC 


AAATCTGATG 


AAACTGAAAC 


CATTGACCTA 


14400 


GACCACCTCT 


TTGTCAACTA 


TGGTTTCAAA 


TCTTCTGTCG 


GTAACCTTAA 


AAACTGGGGG 


14460 


CTCGACCTCA ACCGTCACAA -GATTATCGTC AACAGCAAAC AGGAATCCAG 


1 CC AAGC AGGT 


14520 


ATCTATGCTA 


TCGGTGACTG 


CTGCTACTAT 


GACGGAAAAA 


TTGATCTGAT 


TGCGACAGGC 


14580 


CTCGGAGAAG 


CTCCAACTGC 


TGTCAACAAC 


GCTATCAACT 


ACATTGACCC 


TGAACAAAAA 


14640 


GTACAACCAA 


AACACTCTAC 


TAGTTTATAA 


AAAAGAACCA 


CGAGTCACAT 


AGGATTCGTG 


14700 


GTTTTATAAT 


TCATCCGCTA 


TCTTATTGAT 


TTTTCTGAGT 


CTGTGATTGA 


CACCACTTTT 


14760 


GGTCAGAGGG 


GTGCTGAGAC 


TATCTGCTAA 


CTGCTGGATA 


GAGTAGTCTG 


GGTGCTGAAT 


14820 


CCTCAGTTGC 


GCCACTTCCT 


GCAAATCTAC 


TGGCAAATTT 


TCTAAGCCCA 


TGATATCTTT 


14880 


GATTTTACTG 


ATATTGTTAA 


TGGTCTTCAT 


GCTGGCAGAA 


ACTGTCCGAG 


CGATATTAGC 


14940 


TGTCTCGGCA 


TTATTAGCCC 


GATTGAGGTC 


GTTACGGGTT 


TCTCGCAAAA 


TCTTAACCCG 


15000 


CTCAAAATCA 


TCACGTGCCT 


GCATGGCTCC 


TATTACTATC 


AAGAAGTCCA 


TAATGTCTTC 


15060 


TGCTCGCTGG 


AGATAGGTCA 


CAGCCCCCTT 


CTTGCGCTCA 


AGCACCTTGG 


CATCCAGTAA 


15120 


AAACTGTTGG 


AGAAGGGAGG 


CAATTCCTTG 


CGCGTGGTCC 


AGATAAACAG 


AACTGATTTC 


15180 
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CAACTGGTAC 


TTGCCTGACT 


CAGGGTCACG 


AATGCTCCCA 


TTTGCCAAGA 


AAGCGCCACA 


15240 


GAGATAGGCA 


CGACCTGCTT 


CCTCATCCGA 


TAAAATCGCC 


TCATCAATAC 


CTGTTTCCAG 


15300 


GCCAAAGAAA 


GAGTCTGCCA 


AGTGCAAATC 


ACTTAACAAA 


TCCTGCACCT 


TTTCATCTGT 


15360 


AAAAACGGTA 


TAGACGCGAT 


TCTTGCGAAG 


ATTGCTCCGT 


TGGTGGTGAC 


GAATTTCAGA 


15420 


TTTGATTTCA 


TAGAGATGGA 


GAAAGGACTC 


ATAGAGGTGA 


CGGGCCAGTT 


TGGCATTTTC 


15480 


TGTCACAACT 


GACAAAGTCA 


AGCCCGAAGT 


CGAGAGACCG 


ATGCTACCAG 


ACATTTTGAT 


15540 


AATGGCAGAT 


AATTCATGCC 


AGCTCAGATG 


GTGTTGGCCC 


AGGATTTCTT 


CTTTTACTGC 


15600 


TACTGTGAAA 


CTCATTTTTT 


CACCTGTATA 


ATGCGCATCA 


ACTCGTCCAC 


AATCAAATCT 


15660 


CCATCGTGGA 


AGGCACCGCC 


ATTTTCCAGA 


CGAAGGAAGT 


TAGATGAAAT 


CACGCGCGAA 


15720 


ACTTGCTTAC 


AAAGACCTAC 


AAAATCGTGT 


TCCACTTGCA 


CTAAGTATTC 


ATCAAAACGG 


15780 


TTGGAATTCA 


TGTATTCCTG 


AGGCACTTTT 


TCAATATTCA 


CCAAGACAGT 


GTCGATAAAA 


15840 


GGGCGACCAA 


GGTGACGATG 


CAAGACTTCC 


ACGTGGTCGC 


TATCTGTAAA 


GTGTTCCGTC 


15900 


TCCCCACGTT 


GGGTCATGAT 


ATTGCAGACA 


TAGGCAATTT 


CTGCCTTGGT 


TTCCAAAAGA 


15960 


GCCCGCCCAA 


TTTCCTTAAT 


CACGATATTG 


GGCAAAATAG 


AGGTAAAGAG 


GGAACCTGGC 


16020 


CCTAGGACAA 


TCATGTCACT 


TTCAAGGATG 


GTCTGCACTA 


CTCGACGGCT 


GGCCAGAGGC 


16080 


GTATCATCGT 


TTAGGGCATT 


GGTCACATAG 


ACATTGTCAA 


TTATGCCTCG 


ATGGTCTACA 


16140 


ATATGACTCT 


CTCCAGCCAC 


TTCTGTCCCA 


TCCTGAAAGA 


CTGCATGAAG 


GGTCAAAGGA 


16200 


TGGTCACTGG 


AAGGATAAAT 


TTTCCCTGTT 


GTATGGAAAA 


ATTTGCTCAA 


TAACTGCATG 


16260 


GCATTATAGG 


TTGAACCCTG 


CATTTCTGAC 


AAGCCAGCAA 


TGATGAGATT 


TCCCAATGGA 


16320 


TGGCCAGCAA AGGCTCCGGC ATCCTCAGAG AACCGATACT 


GAAAGACCTT 


CTCATAAAAC 


16380 


TTAGGCATAT 


CCGACATGGC 


CACAAGGACA 


TTACGAAGAT 


CACCTGGCGG 


TGTCAACTGT 


16440 


TGCATATTTT 


TTCGGAGTTC 


ACCTGAAGAA 


CCACCATCAT 


CTGCCACCGT 


CACGATAGCT 


16500 


GCGATTTCCA 


CATCTTTTTC 


CCGCAGACTT 


TTTAGAATGA 


CGGGACTTCC 


AGTCCCTCCA 


16560 


CCAATCACCG 


TTATCTTTGG 


TTTTCTCATG AACGGTTTAC 


CGTTTCCTTT 


CTGCGGTCTT 


16620 


TGTCGCGATG 


CCCTTCATTA 


ACAGACCAAT 


TCTTGGATAA 


GTCCTGCGCC 


AAGCGTTTAG 


16680 


CAAATGCCAC 


ACTACGGTGT 


TGTCCACCCG 


TACATCCCAT 


GGCAATGGTC 


AAAACGGACT 


16740 


TACCTTCCTT 


TTGGTAACTT 


GGCAGAATCG 


GCTCAATCAA 


GGCCAATAAA 


TGTTGATAAA 


16800 


AGTCTTCTGA 


CTCAGGATGG 


TTCATGACAT 


AATCATAAAC 


AGGTTCATCC 


ACACCCGTTT 


16860 


GGTTTCTCAG 


TTCTGGTAAA 


TAATAGGGAT 


TTGGCAAGAA 


ACGGACATCA 


AAGACCAAGT 


16920 
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CCGCATCAAT CGGGATTCCA TACTTAAATC CGAAAGACAT GACTTCGATA CGGAAAGACT 16980 

GGGCTTGTTC TTGGTCTGAA AACTGCTCTG CAAGGGTTTT GCGCAgCTCA CGTGGAGTGA 17040 

GTTCAGTCGT ATCCACCACA TTTTGGCTCA TATTTTTCAA AGGTGCCAAG AGTTCACGTT 17100 

CCAACTTGAT TCCATCTAAA ATACGACCGT CTGCTGCTAG TGGGTGACTC CGTCTGGTTT 17160 

CCTTGTAACG AGCGACCAAT TCCTTATCAG CCGCATCCAA AAAGAGGATT TTGAAATCCA 17220 

AACCATCTTG ATTTTCCAAC TCATCCAAAA CAGCTTGAAT CTCTGAAAAG AAAGAACGGC 17280 

TACGCATATC CACTACCAAG GCCAACTTAG GATTGTCTTC CTTAATTTCA ACCAGCTGCA 17340 

AAAACTTAGG CAAGAGAGCT GGCGGCATAT TATCAATGGT GAAATAACCT AGATCCTCGA 17400 

AGGACTGAAT GGCTACAGTT TTCCCTGCGC CACTCATCCC TGTCACAATC ACCAAGTGAA 17460 

GTTGTTTCTT TGTCATCTTT TTCTCCTTAT ATCAAAAGAA GTTTGGCAAC ACCAAACTTC 17520 

AACTAGCTTA TCCAATCTCT GCGATGACTT CAATTTCGAC TTTT AC ATCA • CGAGGAAGAC 17580 

GAGCTACCTC CACAGCTGAA CGAGCTGGGA ATTCCTCTTT GAAGGCCGTT TGGTAAACCT 17640 

CATTAAAAGG AACAAAGTCG TTCATATCGC TCAAGAAGCA AGTTGTTTTG ACAACATGGT 17700 

CAAAGTCTGT TCCTGCTTCT GCCAAAATAG CACCGATGTT TTTCAAGACT TGCTCTGTCT 177 60 

GTTCTTGGAT ATTCTCTCCT ACAATTTCCC CAGTTTCAGG GGATAGGGGA ACTTGACCGC 17820 

TAGCAAACAA AAGGTTGCCA ACGATTTTTC CTTGAACATA GGGTCCGATA GCCTTTGGGG 17880 

CCTTATCTGT ATGAATTGTT TTTGCCATTT TCTTTTCCTC ACAATTTTTC TAAGATTGCA 17940 

TCCCAAGCCT CATCCATCCC TGCCTTACTG ACAGATGAAA AGAGGATGAA ATCGTCACTC 18000 

GGGTCAAAGT TTAATTTCTT TTTGATTGCT GATTCATGCT TGTTCCATTT ACCACGAGGA 18060 

ATCTTGTCCG CCTTGGTCGC CACAATGATG ACTGGAATCT CATAATACTT GAGAAATTCG 18120 

TACATCTGCA CATCATCTGC TGACGGGTCA TGACGAAGGT CAACTAGACT GAGAACCGCA 18180 

CGGAGATTTT CCCGAGTCGT TAAGTACTCC TCAATCATGC ACCCCCACTT TTCACGTTCC 18240 

TTTTTAGAAA CACGAGCATA GCCATAACCA GGCACATCCA CAAAGCGCAT CTTGTCATCA 18300 

ATGTTAAAAA AGTTCAGGAG CTGGGTTTTA CCAGGTTTTC CTGATGTACG GGCGAGATTC 18360 

TTACGGTTCA ACATAGTGTT GATAAAGCTG GATTTACCAA CATTTGAACG CCCTGCTAGG 18420 

GCAATCTCTG GCAGTTCATC CTGCGGATAG TGGGACTTAT TAGCTGCACT GAGCAAGATT 18480' 

TCAGCATTGT GTGTATTAAG TTCCATAGTC ACCTCTAGGC TGTTTCTAGG ATCGGTTTAT 18540 

CCGTTCCATC TACAGTTTCT TTAGTGATGC GAACCAATTT CACATTTTCC TGACTCGGCA 18600 

CCTCAAACAT GACATCTAGC ATGGTTTCTT CGATGATGGA GCGAAGTCCA CGCGCCCCTG 18660 

TCTTCCGTTC GATTGCTTTA TTAGCAATCT CTTGAAGGGC TTCGTCGTCA AATTCCAACT 18720 
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CAACATCATC 


ATAAGAAAGC 


AAGGTTTGGT 


ATTGTTTCAC 


CAAGGCATTT 


CTTGGCTCTT 


18780 


TCAAGATGCG 


AACCAAGTCA 


TCAACGGTCA 


ATTGCTCAAG 


AGCCGCAAAA 


ACAGGCAAGC 


18840 


GTCCAATCAA 


CTCAGGGATA 


ATACCAAATT 


TTTGAATGTC 


TTCAGCGATG 


ATTTCTTGCA 


18900 


TGTATGAGCT 


GTTTTCGTCA 


ATCGCCTTAT 


TATTTTGAGC 


AAATCCGATG 


ACTTTTTCAC 


18960 


CCAGACGTTG 


TTTGACAATT 


TCTTCAATAC 


CATCAAAAGC 


ACCACCCACG 


ATGAAGAGGA 


19020 


TATTTTTTGT 


ATCCACTTGA 


ATCATCTCTT 


GTTGTGGATG 


TTTGCGTCCA 


CCTTGAGGCG 


19080 


GTACGCTAGC 


AACAGTTCCC 


TCAATAATCT 


TGAGAAGGGC 


TTGTTGCACC 


CCTTCACCAG 


19140 


AAACATCACG 


TGTGATAGAC 


ACATTCTCAC 


TCTTCTTGGC 


AATCTTGTCA 


ATTTCATCCA 


19200 


CATAGATAAT 


GCCACGCTCT 


GCACGTTCGA 


TGTTAAAGTC 


AGCAACCTGC 


AAGAGTTTGA 


19260 


GGAGGATATT 


TTCCACATCC 


TCACCCACAT 


AACCAGCCTC 


CGTCAGAGCT 


GTCGCATCCG 


19320 


CAATAGCAAA 


AGGTACATTC 


AAGCTCTTAG 


CCAAGGTCTG 


GGCAAGGAAA 


GTTTTCCCTG 


19380 


AACCAGTTGG 


GCCAATCATC 


AAAATGTTTG 


ACTTCTGCAA 


ATCCACATCT 


TCTGACTCTT 


19440 


CGCGTGTATC 


GTGGAAATTG 


ATGCGTTTGT 


AGTGGTTATA 


AACCGCCACT 


GCCAAGGCAC 


19500 


GCTTGGCACG 


ATCTTGACCA 


ATTACATAGT 


GGTTCAAGAT 


ATGGAGGAGT 


TCAATTGGTT 


19560 


TTGGCACCTC 


AGACAAGTCT 


GCCAAGACTT 


CCTCAACCAA 


TTCTTCTCGA 


ATGATTTCCT 


19620 


GAGCTAACTC 


CACGCATTCA 


TTACAAATAA 


AAGCATTGTT 


GCCAGCAATT 


ATTTTTTGTA 


19680 


CTTCTTCTTG 


GTTTTTGCCA 


CAAAATGAGC 


AATAAACCAT 


CATATCATTT 


TTTCTATTTG 


19740 


TAGACATGAT 


TTCCTTCCAT 


TCTATACTGT 


CATTCTATCT 


AAAATAAGGT 


CATGTAAAAA 


19800 


GCATGAATAC 


TATTGACCAG 


ATTGGTAAAG 


GCATTTAACC 


AAAGGAGGAT 


AGAAAGCCCG 


19860 


TAACGCTTTT 


TACGAAAAGC 


TTGTGCTCCT 


GCCAGAAAGC 


AGATGAAACA 


CAGAAAAGCC 


19920 


GTGAATAGAC 


CAAATAAACT 


CCGTTCCATT 


AGACTTCCTT 


TCTCTTGCGG 


TATTGGATGG 


19980 


TAAAATCATA 


AGGATTCTTC 


TCATCTTTGG 


CGTAAAATTT 


GCTTGAAACT 


GTCTCAAAAA 


20040 


GAGACAAGTC 


AAGTTCTTCA 


GGGAAATAGG 


TATCTCCTTC 


CACCCGAGCA 


TGAATGTGAG 


20100 


TGACAATCAC 


TTCATCAAGG 


TAAGGTTCAA 


AAGCCTGAAA 


AATTTGCTTC CCAGCGATAA 


20160 


TGTAGAGATT 


CTTTTCTTGA 


GCCTGATACC 


AGTCAAGAAC 


AGACTGGACG 


TCCTGAAAAG 


20220 


TAGCAACCCC 


ATCTATCTTT 


TCTTCCGGAT 


TACGCGTCAA 


AATCAAGGTT 


TCCCGTTTTG 


20280 


GAAGCAAGCG 


ACGCCCCATC 


CCATCAAAGG 


TCACACGCCC 


CATCAAGATA 


GCATGATTCA 


20340 


GAGTTGTTTC 


TTTAAAGTGC 


TGCAATTCTG 


CTGGCAAATG 


CCAAGGCAGA 


CGATTTTCCT 


20400 


TACCAATCAC 


ACCCTCTTCA 


TCCTGGGCCC 


AAATAGCTAC 


GATTTTCTTA 


GTCATGCTTC 


20460 
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CATCCTTTTC ACTGATAGTA CTATTTTATC AAAAAACTCA AAAAAAGACT GGTTTGGAAT 20520 

AGCTTACAAA ATAGAAAAAA TCTGTAAGAA ATTTCCTACA GATTTATCTA TGTTTCCTTA 20580 

TTTCTTACAA ACCAGGTGCT TGTCCAAGTT CGGCTGCAAG CATCCAAATT GTTTTATCTG 20640 

TTTCAGTTTT AGCGCCTGCA AAGATACCGT TTGTCACATC GTCACCTTCT TCATCAGTGA 20700 

CATCCAAACC TTTTTGGAAA AGTTCTGACA AGTAACGGTA GATAACAAGA ACACGTTCCA 20760 

AGCTTTCTTC AACATTACGG TATTCACCAG CTTCTTCTTC GATTTCACTA TTTTGAAGGA 20820 

ACTCTGTCAA TGTAGAGAAT GGGCTTCCAC CGAGTGTAAT CAAGCGTTCA CTGATTTCAT 20880 

CCAATTGACC GTCAAGAGCT TCCATGTACT CATCCATTTT TGGATGCCAT ACAAGGAAAC 20940 

CACGACCATG CATATACCAG TGCACTTGGT GCAAAGCAAC GTGAGCTACA TACAAATCAG 21000 

CAACAGCTTG GTTCAAGACT TCCTTTGTTT TTGCCAATGC 21040 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2387 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(DJ TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

ATTCTTAATA CGATTAAAAG GCTTATTACT AAAAGAAAAT TTCAGTTAGA TGAACTAAAC 60 

TTGCTCGTCA AATCCCGATT TAACGAGATG TTTGGGGAAA ATAAAATATT TGAAAGCATT 120 

GATAACTTAT TTGATATTAT AGATGGTGAT AGGGGCAAAA ATTATCCTAA ATCAGATGAG 180 

TTGTTTAGTG AGGAGTACTG TTTATTTTTA AATACAAAGA ATGTTACTAA AAACGGATTT 240 

TCATTCGATA CAAAGCAATT TATCACTAAA ACAAAGGATA AATTACTTCG AAAAGGCAAA 300 

CTTGAGCGTT ATGATATAGT CTTGACAACA AGAGGTACTG TTGGAAATGT AGCGTACTAC 360 

GATGAATTAA TAAAATAT.AA ACATTTACGT ATAAATTCAG GTATGGTAAT ATTACGTCCC 420 

AAGACACCAA ATCTAAATCA GAAATTTATT ATCCATGTTT TAAGGAATAA TAATTATAGT 480 

CGAGTGATAT CAGGAAGTGC TCAGCCTCAG TTACCAATTA CAAAATTAAA AAAAATACTT 540 

CTCCCCCTCC CCCCACTAGC CCTCCAAAAT GAGTTCGCAG ACTTTGTAGT CCAGGTCGAC 600 

AAATCACAAT TGGCAATCCA AAAATCTCTG GAAGAACTTG AAACTTTGAA GAAATCTCTG 660 

ATGCAGGAGT ATTTTGGCTG ATATTCTGCC ATTGTAATTA CGGTAATGAT TTGTTATAAT 720 

ACTTCAAAGG AGGAAATCAG ATGGTAGTAA AAACAAGAAA ACAAGGAAAT TCAATCACCA 780 

TTACGATTCC AAGTGAATTT AATATTCCAA GTGGTGTTAA ATACGAAGCG AAATTGTTAC 840 



WO 98/18931 



PCTYUS97/19588 



503 

CAAGTGGTGA GATTATCTTT ACTCCTGAAG AATTGGGGCA GCAGGTTTCT TATGTATCTG 900 

ATGATGCCTT TGACTTAAAT TTAGATAAAA TATTTGACGA ATACGACGAT GTTTTCAAAG 960 

CTTTGGTGGA AAAATGACAA TCTATTTGAC AGAAAAGCAA ATTGAAAAAA TAAATGCTTT 1020 

AGCAATTCAA CGGTATTCTC CAAATGAGAA AATTCAAACA GTTAGTCCTT CTGCCTTAAA 1080 

TATGATTGTG AACTTACCAG AACAATTTGT CTTTGGGAAG CCTCTTTATC CAACAATTTT 1140 

TGATAAAGCA ACGATACTAT TTGTCCAATT GATAAAGAAG CATGTTTTTG CTAATGCTAA 1200 

TAAAAGAACT GCTTTCTTCG TTTTGGTCAA ATTTTTACAA TTAAACGGCT ATCGTTTTTC 1260 

TGTAACGGTA GAAGAAGCAG TAAAAATGTG TGTAACCATC GCAGTAGAAG CTTTAACTGA 1320 

TGAAAAAATG ACAAGCTACT CCAAATGGAT TTCTGAACAT TCTGTTAGAG AAAAGGTCAA 1380 

AAAGTAACCT AGTATGCTGG ATTTGAATGA GCACAAGAAA ATAAATGAAC AGACAATATT 1440 

AGAATTCTGT AATGCAGAAA CTGATATTGT CTCTTTTTAT TGATGAATAA GAAAGTGAGA 1500 

AATTATGGAA TCAAAAGTTA CAATTATCAT GCAAGAAATG TTACCTCTTT TAAATAATGA 1560 

ACAATTACTA GCGTTGAGAG AGAGTTTAGA ACATCATCTA GTAGACGGAA AAAAGCAGCA 1620 

GAAGTATTCG AATAATAACC TGTTGCAACT ATTTATTACC GCCAAGCAGG TAGAGGGCTG 1680 

TAGCTCAAAA ACAATTCGTT ATTATCAGAG GACGATTGAA AACTTGTTTA ATGCTATTAA 1740 

AGAGTCTGTG ACACAACTCA CAACAGATGA TTTAAGGAGT TATTTAGCAA ATTACCAGTC 1800 

TGAAAAGGAT TGTAGTAAGG CAAATTTAGA CAATATTAGG CGTATATTGT CTTCTTTTTT 1860 

TGCTTGGCTT GAGCAAGAGG ATATATCATT AAAATTCCCA TTCGACGGAT ACAGAAAATT 1920 

AAGACTGAGC AAAATGTGAA GGAAACTTAT ACTGATGAAC ATTTGGAAAT TATGCGTGAT 1980 

AACTGTGAAA ATTTGAGAGA TTTGGCAATA ATAGACCTAC TAGCATCGAC AGGTATGCGT 2040 

GTAGGGGAGC TTGTACAGTT GAATCGTTCA GATATTGATT TTGAAAACAG AGAGTGTGTT 2100 

GTCTTTGGTA AAGGAAAGAA GGAGAGACCA GTATATTTTG ACGCTCGTAC GAAAATTCAT 2160 

TTAAGAAATT ATCTTAACGA CAGAAAAGAT AGTCACCCTG CTCTTTTTGT AACGCTAGTT 2220 

GGAAAAGTCC AGAGGCTTGG AATTGCTGGT GTAGAGATTC GCTTAAGAAA GTTAGGAGAC 2280 

AAACTCGGCA TACAAAAGGT TCACCCACAT AAGTTCAGAA GAACTTTAGC GACTAAGGCA 2340 

ATTGATAAAG GTATGCCTAT CGAACAAGTC CAAAAACTGC TAGGTCA 2387 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10669 base pairs 
{B) TYPE: nucleic acid 
(C) STRANDEDNESS i double 
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(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

ATATTAAAGC GACTTTCTGT GCGCTAGGGA AAAATGTTCC TGGGAATGAG GACTTGGTGA 60 

AGAGGATAAA ATCTGAAGGT CATGTTGTTG GAAACCATAG CTGGAGCCAT CCGATTCTCT 120 

CGCAACTCTC TCTTGATGAA GCTAAAAAGC AGATTACTGA TACTGAGGAT GTGCTAACTA 180 

AAGTGCTGGG TTCTAGTTCT AAACTCATGC GTCCACCTTA TGGTGCTATT ACAGATGATA 240 

TTCGCAATAG CTTGGATTTG AGCTTTATCA TGTGGGATGT GGATAGTCTG GACTGGAAGA 300 

GTAAAAATGA AGCATCTATT TTGACAGAAA TTCAGTATCA AGTAGCTAAT GGCTCTATCG 360 

TTTTGATGCA TGATATTCAC AGTCCGACAG TCAATGCCTT GCCAAGGGTC ATTGAGTATT 420 

TGAAAAATCA AGGTTATACC TTTGTGACCA TACCAGAGAT GCTCAATACT CGCCTAAAAG 480 

CTCATGAGCT GTACTATAGT CGTGATGAAT AAGCAAGAAA AAATAGGTCT GTTAGATATT 540 

TGACAGACTT ATTTTTTACA GAATATAGTA CTACTTAAAA AATGTTTTAT GCTATAATTG 600 

ATGAATAAAA TAGAAGGAGA AGCATATGAA TACCTATCAA TTAAATAATG GAGTAGAAAT 660 

TCCAGTATTG GGATTTGGAA CTTTTAAGGC TAAGGATGGA GAAGAAGCCT ATCGTGCAGT 720 

GTTAGAAGCC TTGAAGGCTG GTTATCGTCA TATTGATACG GCGGCGATTT ATCAGAATGA 780 

AGAAAGTGTT GGTCAAGCAA TCAAAGATAG CGGAGTTCCA CGTGAAGAAA TGTTCGTAAC 840 

TACCAAGCTT TGGAATAGTC AGCAAACCTA TGAGCAAACT CGTCAAGCTT TGGAAAAATC 900 

TATAGAAAAA CTGGGCTTGG ATTATTTGGA TTTGTATTTG ATTCATTGGC CGAACCCAAA 960 

ACCGCTCAGA GAAAATGACG CATGGAAAAC TCGCAATGCG GAAGTTTGGA GAGCGATGGA 1020 

AGACCTCTAT CAAGAAGGGA AAATCCGTGC TATCGGCGTT AGCAATTTTC TTCCCCATCA 1080 

TTTGGATGCC TTGCTTGAAA CTGCAACTAT CGTTCCTGCG GTCAATCAAG TTCGCTTGGC 1140 

GCCAGGTGTG TATCAAGATC AAGTCGTAGC TTACTGTCGT GAAAAGGGAA TTTTATTGGA 1200 

AGCTTGGGGG CCTTTTGGAC AAGGAGAACT GTTTGATAGC AAGCAAGTCC AAGAAATAGC 1260 

AGCAAATCAC GGAAAATCGG TTGCTCAGAT AGCCTTGGCC TGGAGCTTGG CAGAAGGATT 1320 

TTTACCACTT CCAAAATCTG TCACAACCTC TCGTATTCAA GCTAATCTTG ATTGCTTTGG 1380 

AATTGAACTG AGTCATGAGG AGAGAGAAAC CTTAAAAACG ATTGCTGTTC AATCGGGTGC 1440 

TCCACGAGTT GATGATGTGG ATTTCTAGAA AATCATAAAA AGAATTGTAC ATTATTCTAA 1500 

TTTTTGATAT AATAGTCAGC AGGAAAGAAA GTCTTATGGC GTTCTTCAAG CGAGCTTGGG 1560 

ATAGTGGGAG CCAAGTAGGG CAAAATAAAG GGCTGGCGCT TTCTGTAGTA TTTTCAAAAA 1620 
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CAATGAAGTA ATAAATTAGG 


GTGGAACCGC 


GTTTCTGACG 


CCCCTAGGTT 


AAATCAACCT 


1680 


AGGATTGTCA 


un I Vjt l uVj 1 Ik. 


TTTTGCTTAT 


TCAGTCTATT 


GTGTGAAAGA 


AAGGAGAGCC 


1740 






AAAAGACGAT 


AGTCAACTAG 


CTACATTTCG 


TGATTTTGTA 


1800 


fypa aria hats. 


V. 1\>AAAA(j 1 1 


GAAAGATTAT 


CAATCTTTTT 


TAAAGAATGA 


ACTTGCAGTC 


1860 


1 Vj 1 u/\ ill riV». 


VAiU AAUU 1 A3 1 


TATTTGGTCA 


GATTTTAATG 


CTGCTACACA 


GATTATTAGG 


1920 


unAAu 1 ivj 


1 i\.L*AACCTA 


TACAAATAAT 


AGACGAGTGG 


TTATGACGCC 


TGATTTAGCT 


1980 


uil luuAAAu 


AATTGTATTT 


GTATCAGTTG 


ATGGACTACG 


AGTGTTCTGA 


GCAAACTCAA 


2040 


CjLAATAGAAA 


GTCACTATCA 


TTCTTTATCT 


GAAAATTTCC 


TCTTACAGAT 


TGTAGGACAT 


2100 


IjALal IALj(JTt- 


ATTGGTCGGA 


CATTTTTTAG 


ATGATTTTGA 


TGGTTATGAC 


TCTTATATCT 


2160 


GGTTCGAAGA 


GGGGATGGTT 


GAATATATTA 


GTCGCAAGTA 


TTTCTTGACA 


GAAGAGGAAT 


2220 


TTCAAGCGGA 


AAAAATTTGT 


AATCAATCTC 


TCGTAGAACT 


TTTTCAGAAG 


AAGTATAGTT 


2260 


GGCATTCATT 


GAATGATTTT 


GGTTCTTCGA 


CTTATGATAA 


GAACTATGCA 


AGTATTTTTT 


2340 


A rG AATACTG 


GCGCAGCTTT 


TTGACAGTAG 


ATAAGTTGGT 


AGAAAATTTA 


GGTAGTGTAC 


2400 


A A /"'/"'/'""/'"•m/TPfn 

AAtaCGGTCTT 


AGATTCTTAT 


CATTTATGGG 


CAAATACAGA 


AAAAACTTTT 


CCCTTGTTAG 


2460 


ATTGGTTTGT 


TCAGCAGAAA 


TTAATTGAAA 


AAGAAATATA 


AAAACTAAAG 


GAGTAAACAA 


2520 


1 \» 1 1_ i AAuM 


ATTAACATTT 


CACTGCATCA 


GTGGCAGAGA 


CCTCCTTACA 


GTCGGGCTGC 


2580 


I UL.AL.tjU 1 t_A 


GCACTAGAGT 


GCCTGAGCTA 


GACGCAGTAC 


TAACTCGTCT 


TGCCTCGTAT 


2640 


IjA 1 VLaAUVaAla 


GCAGACTCGT 


GTCGCAAGTA 


ATTATTTTTT 


ATTAAGGAGT 


ATTCAATGTC 


2700 


i nnuiuvi 1 1 A 


ACATTTCACT 


GCGTCAGTGG 


CAGAAACCTC 


CTTACAGTCG 


GACTGCCCTA 


2760 


CGCTCAGCAC 


TAGAGTGCCT 


GAGCTAGACG 


CAGTACTAAC 


TCGTCTTGCC 


TCGTATAATC 


2820 


GACGAGGCAG 


ACTCGTGTCG 


CAAGAAATTA 


TTTTTTATTA 


AGGAGTATTC 


AATGTCTAAG 


2880 


AAATTAACAT 


TTCAAGAAAT 


TATTTTGACT 


TTGCAACAAT 


TTTGGAATGA 


CCAAGATTGT 


2940 


ATGCTTATGC 


AGGCTTATGA 


TAATGAAAAA 


GGTGCGGGGA 


CAATGAGTCC 


TTACACTTTC 


3000 


CTTCGTGCTA 


TCGGACCTGA 


GCCATGGAAT 


GCAGCTTATG 


TAGAGCCATC 


ACGTCGTCCT 


3060 


GCTGACGGTC 


GTTATGGGGA 


AAACCCTAAC 


CGTCTCTACC 


AACACCACCA 


ATTCCAGGTG 


3120 


GTCATGAAGC 


CTTCTCCATC 


AAATATCCAA 


GAACTTTACC 


TTGAGTCTTT 


GGAAAAATTG 


3180 


GGAATCAATC 


CTTTGGAGCA 


CGATATTCGT 


TTTGTTGAGG 


ACAACTGGGA 


AAACCCATCA 


3240 


ACTGGTTCAG 


CTGGTCTTGG 


TTGGGAAGTT 


TGGCTTGACG 


GAATGGAAAT 


CACTCAGTTC 


3300 


ACTTATTTCC 


AACAAGTCGG 


TGGATTGGCA 


ACTGGCCCTG 


TGACTGCGGA 


AGTTACCTAT 


3360 
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GGTTTGGAGC 


GCTTGGCTTC 


TTACATTCAA 


GAAGTAGACT 


CTGTCTATGA 


TATCGAGTGG 


3420 


GCTGATGGTG 


TAAAATACGG 


AGAAATCTTT 


ATCCAGCCTG 


AGTATGAGCA 


CTCAAAATAT 


3480 


TCATTTGAAA 


TTTCGGACCA 


AGAAATGTTG 


CTTGAAAACT 


TTGATAAGTT 


TGAAAAAGAA 


3540 


GCTGGTCGTG 


CATTAGAAGA 


AGGCTTGGTA 


CACCCTGCCT 


ATGACTATGT 


TCTCAAATGT 


3600 


TCACATACCT 


TTAATCTGCT 


TGACGCGCGT 


GGTGCCGTAT 


CTGTAACAGA 


GCGTGCAGGC 


3660 


TATATCGCTC 


GTATCCGTAA 


CTTGGCCCGT 


GTCGTAGCCA 


AAACCTTTGT 


CGCAGAACGC 


3720 


AAACGCCTAG 


GCTACCCACT 


TTTGGATGAA 


GAAACAAGAG 


CTAAACTCCT 


AGCAGAAGAC 


3780 


GCAGAATAAA 


GAGAGTGACA 


AATTACGAAA 


ATGGGCGAAC 


AGAGTGAGCC 


CTGAGCCAGT 


3840 


TGCCGCAGTG 


ATGAAGGTAT 


CCTTAGTGAA 


ACTAAGGATA 


CTAGGCAAAA 


TTGGAGACTT 


■ 3900 


TTGGCTCCAA 


TTTTAGCAAT 


GAAACAACGA 


AGTTGGTTGC 


TTGCGTGCCA 


ATCACATAAG 


3960 


GCAAACTGGA 


AAATAAAAAG 


ATACTTTTCG 


GAGAAAAAAC 


ATGACAAAAA 


ACTTATTAGT 


4020 


AGAACTCGGT 


CTTGAAGAAT 


TACCAGCCTA 


TGTTGTTACG 


CCAAGTGAAA 


AACAACTAGG 


4080 


CGAAAAAATG 


GCAGCCTTCC 


TCAAGGGAAA 


ACGCCTGTCT 


TTTGAAGCCA 


TTCAAACTTT 


4140 


CTCAACACCA 


CGTCGTTTGG 


CTGTTCGTGT 


AACTGGTCTT 


GCAGACAAAC 


AGTCTGATTT 


4200 


AACAGAAGAT 


TTCAAGGGTC 


CAGCAAAGAA 


AATTGCCTTA 


GATAGTGATG 


GAAACTTCAC 


4260 


CAAAGCAGCT 


CAAGGATTTG 


TCCGTGGGAA 


AGGTTTGACT 


GTTGAAGATA 


TCGAATTCCG 


4320 


TGAAATCAAG 


GGTGAAGAAT 


ATGTCTATGT 


CACTAAGGAA 


GAAATTGGTC 


AAGCAGTTGA 


4380 


AGCCATTGTT 


CCAGGCATTG 


TGGATGTCTT 


GAAGTCACTG 


ACTTTCCCTG 


TCAGCATGCA 


4440 


CTGGGCGGGA 


AATAGCTTTG 


AATACATCCG 


CCCTGTTCAC 


ACTTTAACTG 


TTCTCTTGGA 


4500 


TGAGCAAGAG 


TTTGACTTGG 


ATTTCCTTGA 


TATCAAGGGA 


AGTCGTGTGA 


GTCGTGGCCA 


4560 


TCGTTTTTTG 


GGACAAGAAA 


CCAAGATTCA 


GTCAGCATTG 


AGCTATGAAG 


AAGACCTTCG 


4620 


TAAGCAGTTT 


GTAATCGCAG 


ATCCATGTGA 


ACGTGAGCAA 


ATGATTGTTG 


ACCAAATCAA 


4680 


GGAAATTGAG 


GCAAAACATG 


GTGTACGTAT 


CGAAATTGAT 


GCGGATTTGC 


TGAATGAAGT 


4740 


CTTGAATTTG 


GTTGAATACC 


CAACTGCCTT 


CATGGGAAGT 


TTTGATGCTA 


AATACCTTGA 


4800 


AGTTCCAGAA 


GAAGTCTTGG 


TGACTTCTAT 


GAAGGAACAC 


CAGCGTTACT 


TTGTTGTTCG 


4860 


TGATCAAGAT 


GGAAAACTCT 


TGCCAAACTT 


CATTTCTGTT 


CGTAACGGAA 


ACGCAGAGCG 


4920 


TTTGAAAAAT 


GTCATCAAAG 


GAAATGAAAA 


AGTCTTGGTA 


GCCCGCTTGG 


AAGACGGAGA 


4980 


ATTCTTCTGG 


CGTGAAGACC 


AAAAATTGGT 


GATTTCAGAT 


CTTGTTGAAA 


AATTAAACAA 


5040 


TGTCACCTTC 


CATGAGAAGA 


TTGGTTCTCT 


TCGTGAACAC 


ATGATTCGTA 


CGGGTCAAAT 


5100 


CACTGTACTT 


TTGGCAGAAA 


AAGCTAGTTT 


GTCAGTGGAT 


GAAACAGTTG 


ACCTTGCTCG 


5160 
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TGCAGCAGCC ATTTACAAGT TTGACTTGTT GACAGGTATG GTTGGTGAAT TTGACGAACT 5220 

CCAAGGAATT ATGGGTGAAA AATACACCCT TCTTGCTGGT GAAACTCCAG CGGTGGCAGC 5280 

TGCTATTCGT GAACACTACA TGCCTACATC AGCTGAAGGA GAACTTCCAG AGAGCAAGGT 5340 

CGGCGCAGTT CTAGCCATTG CAGACAAATT GGATACGATT TTGAGTTTCT TCTCAGTAGG 5400 

ATTGATTCCA TCAGGTTCTA ATGACCCTTA TGCCCTTCGT CGTGCAACTC AAGGTGTGGT 5460 

TCGTATCTTG GATGCCTTTG GTTGGCACAT TGCTATGGAT GAGCTGATTG ATAGCCTTTA 5520 

TGCATTGAAA TTTGACAGTT TGACTTATGA AAATAAAGCA GAGGTTATGG ACTTTATCAA 5580 

GGCTCGTGTT GATAAGATGA TGGGCTCTAC TCCAAAAGAT ATCAAGGAAG CAGTTCTTGC 5640 

AGGTTCAAAC TTTGTTGTGG CAGATATGTT GGAAGCAGCA AGTGCTCTCG TAGAAGTAAG 5700 

CAAGGAAGAA GATTTTAAAC CATCTGTTGA ATCACTTTCT CGTGCCTTTA ACCTGGCCGA 5760 

GAAGGCAGAA GGGGTTGCTA CGGTTGATTC AGCACTATTT GAGAATGACC AAGAAAAAGC 5820 

TTTGGCAGAA GCAGTAGAAA CACTCATTTT ATCAGGACCT GCAAGTCAGC AATTGAAACA 5880 

ACTTTTTGCG CTTAGCCCAG TCATTGATGC TTTCTTTGAA AATACTATGG TAATGGCTGA 5940 

AGATCAGGCT GTCCGTCAAA ATCGTTTGGC AATCTTGTCA CAACTAACCA AGAAAGCAGC 6000 

TAAGTTTGCT TGTTTTAACC AAATTAACAC TAAATAAAAT TTGATAAACG GACTTTATCT 6060 

TATTACAAAG GAGAAGAAAT GGATCCGAAA AAAATTGCTC GTATCAATGA GCTTGCTAAA 6120 

AAGAAAAAAA CAGAAGGCTT AACACCAGAA GAAAAAGTGG AACAAGCCAA ACTACGTGAG 6180 

GAGTACATCG AAGGTTATCG CCGCGCTGTT CGTCACCACA TTGAAGGAAT CAAAATTGTG 6240 

GACGAAGAAG GAAACGATGT TACACCAGAA AAACTACGCC AAGTACAACG TGAAAAAGGA 6300 

TTACATGGCC GTAGTCTTGA TGATCCAAAT TCATAATAAT ACTCTTCGAA AATCAAATTC 6360 

AAACCACGTC AGCTTCACCT TGCCGTACTT AAGTACAGCC TGCGGCTAGC TTCCTAGTTT 6420 

GCTCTTTGAT TTTCATTGAG TATATGTATT CTTTCTTTTA ACAAAGATAG ATGAAACGAT 6480 

AACAAAGAGA CTAGCAGTTT GTGTTTGCTA GTCTTTTTTC GCTAAAAAAG GAACCATAAT 6540 

GGTTCCTAAA AACTATCATT AGTAACTTGC ACCGGCTGTA GCGTCTGCGT CACCACCGTG 6600 

GCCTCCAGCA TCCCCTGAAT CAGAAGCGCC AGAAGTAGCA TCGGCGTCTC CATGACCTCC 6660 

GGCAGCAGGA GCAAATGGTC CGCTACCACC CACCAAACGT TGACCAGTCT CTTTTAGGTA 6720 

CCAGTCAAGC CATGGTTGGA AGTTAAAGAC GATTTCATTG ATACCAGCGT ATGATCCATC 6780 

AGGATAGTAC ATTGCTTGGT AGTTGTGAGT GTTGATAACA CCTGCAGGAG AACCTGGAAC 6840 

GATCGTACGG ACGTATTCTT GGTTTCCGTT GCGAAGTGTT CCGATAACCC ACTCTACGTT 6900 
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CTTCATACGT GCTGGTGGAA GAGAACCATG AACAGTCGAC ATACGGCTAC CTGATTGAGG 6960 

TGGTACACGT TTAGCGAACA TAGTGTCTGG ATCTTGGTGA GCGTTGTTGT AGTAGAGGAA 7020 

TTGGTTGTTG TCGTCAGCGT ATGTCAATTC AAATGGCATA GCTTTCAAGA ACATATCAAT 7080 

TTGGTTAACT GTTAGGATAC CGTGGTCCAA TTTGACATAG GTATCACCAG AAACAGCACC 7140 

AGTGAATGCT GCAACTTTTT CTACCCATTC TGGATCGTCA GGGTCAACTT CTGTGATGGT 7200 

TGTAGCGATT GGTTTTCCAC AATCCAAGTC TTCTGATTCG ATTGGTTTTG GTTTTTTCAA 7260 

TTTCGAAACG ACTCCTACGT ATTTAACAAA GTTATCTAAG CAAGTTTCAA GGAATTTAAC 7320 

AGTGCCTTCG TTGGTGATAT TTCCGTTGTT ATCAAAAGCT TCCTTAGCTT TACCAAGAAG 7380 

GAATTCGTTA CCTGGAAGCG TGTAGGCATT AACACCTGGA GCATCAAGGA TTTTACGAAG 7440 

GTGAACTTGA GCACGTGATG TTCCTTGGTC ATAGTATGAT GCACCCACAA TCATAACAGG 7500 

CTTGTTTTCA AATGGATGAA CTTCGTATGA AAGCCATTCA AGTACAGATT TGAGTGAAGC 7560 

TGAGATAGTG TGGTTATGCT CAGGAGTAGC AATGATAACA CCATCTGCAC GAGTAATTTT 7620 

GTTATATAAA TAACGTAATT GGAAACTTTC ATCCCATTTT TCATCTTGGT TAAACATTGG 7680 

AACTTCGTCA ATTTCAAGAA CTTCTAATTC AAATTTGAGT TTGAAGTAGC GACGGATAAA 7740 

TTCCAAGAGC TTACGGTTAT ATGATTGATC GTAGTTTGAT CCAACAAGTC CAACAAATTT 7800 

CATTCTTTTT GGTCTCCTAT CTTACAAATT TTCCCAGTCA AAGTCTTCAG CATCTTTGCG 7860 

AAGTAATTCT TGTGCATTAC GTAATTTTTC TGTGATTTTT ACAAAGATAC GGAAGTCATC 7920 

AAAGATGGCA TCCAATTTCT TGATAACATC AAGGTCAACC AAGTCGCCAC TTGGGTTAAA 7980 

TGCTTGAAGA GAGTGTGAGA GCAAGAATTC ATCTGGAAGA ACATTTGCCT TGATTTCAGG 8040 

AGCATTCAAG ATTTGACGAA GTTGCAATTG GGCACGAGAT GAACCAAGCG TACCGTAAGA 8100 

AGCACCTGTA ATCATGATTG GTTTGTTCAA AAGTGGGTAA ATACCATAAG ACAACCAAGC 8160 

AAGAGCGCTC ATCAAAACAG CTGGAATAGA GTGATCATAC TCAGGAGTAC CGATAATAAC 8220 

GCCATCTGCC TCTTCGATTT TAGCAGCAAT TTCCAATATT TCAGCAGGTA CTTGCTTGTC 8280 

AGCTGGTTTG TTGAAGACAG GAATGGCCTT GATTTCAACA AGTTCAATTT CAGCTTTGTC 8340 

AGTAAAGTGT TTTTGCATGT ATTGAAGCAA TTGACGGTTT GTAGAACGTT TTGAATTTGT 8400 

TCCAACAATA GCAATAAGTT TTAACATGAG ATTTCCTTTC TCTTTTTACA TAATACAATT 84 60 

TTAAAATTCC ATTGAAACAG TTGTCTCTAT AGAGTAGGAA TTCCTGAAGA ACAGCTTAGG 8520 

TGGCCTTCTT TATCGATGAG GATGACTTCG ATGCCCTCCA AACTTTCGAC TTGCCAGAGG 8580 

ATAGAAGCAG GTCTTTCTCC AAAGAGTCGA GTCGTCCAGA TTTCGCCATC GACTGATTTA 8640 

TCAGAGATGA TTGTTAGACT CGCTAGTTCC GTTTCAACAG GATATCCTGT TTGACTGTCA 8700 



WO 98/18931 



PCT/US97/19588 



509 . 

AAAATGTGAT GGTAATCTTG TCCATCGACG GTCAGGTGAC GTTCATAAAT GCCTGAAGTC 8760 

ACGACAGATT TATTGACAAC AGGGATGGTC ATTAAATGAT TTCCCCTAGG ATTGGCTGGG 8820 

TCTTGAATCC CGATTTGCCA TGGGTTATCC CCTCTTGCCT GATTTTTTCC AATGGTCAGG 8880 

ATATTCCCTC CCAGATTGAT CAAGGCAGAA GTCACCCCCT CTTTCCTAAG AAATTGGGCA 8940 

ACCTTATCCG CACTGTATCC TTTGGCTAAA CAACCTAGAT CGATCTTCAT TCCTTTCTGT 9000 

TTTAAAAACA CAGTAGAAGT AGAAGAATCT AACTCGATAC CATGAGGATT GATTAGAGGC 9060 

AGCACCGATT CAATTTCTTG AGGCTGGGCG ACCTTGGCAT CTGAAAAACC GATACGCCAG 9120 

GTTTGAATTA AGGGACCAAT GCTGATATTG AGGTGGCTAG AGAGCGCTAG GCTATGCTCT 9180 

AACCCAAGTG AAATCAGCTC AAACAGGTCT GGATGAACCG TGACGGGGGC TATTCCTGCT 9240 

TGATAATTGA TTTCCATCAA CTCAGATTCT TGACTATTGG CGTTGAAGCG GTATTCAAGT 9300 

TCTTTGAGCA AGTCAAAGGA TTTTTGGAGA AAGATATCGG CTTGCTCATC CACTAATGAA 9360 

ATAGTGATAG TAGTCCCCAT TAGCCGTTCA GAATGTGAAC GAAGAGTCAA GCTACCAACT 9420 

CCTTTCTCTT ATAGAAAATA AGTTGTAATA TCAAATAATC ATCTAAATTG AAGCCCTTAC 9480 

ATTTCATTTT CATGTTATTA TAATACCATA AAGTTAGAAT TTTCACAAAC AAAATTTGGA 9540 

AAAAGTCAAG AAATATGCTC ATAAAATTCA TCAGGCTTGA AAACAGGATA AATGGGGAAT 9600 

TATTTTTGAT AAAAAATGCT GAAATAATAG TACCCCCCTT GTAAACGCTA ACGGTAAATG 9660 

GTATACTAGT AAGGTAAATT TAGAATGAAG GCAGGAAATT TTTATGAGTA AAATCGTTGT 9720 

AGTCGGTGCT AACCACGCTG GTACAGCATG TATCAATACC ATGTTGGATA ATTTTGGAAA 9780 

TGAGAACGAA ATTGTTGTAT TTGACCAAAA CTCTAACATC TCTTTCCTAG GATGTGGAAT 9840 

GGCTCTTTGG ATTGGTGAAC AAATTGACGG TGCTGAAGGC TTGTTCTATT CTGATAAAGA 9900 

AAAATTGGAA GCTAAAGGTG CTAAAGTTTA CATGAACTCA CCTGTTCTTT CAATCGACTA 9960 

TGATAACAAA GTAGTTACAG CGGAAGTTGA AGGAAAAGAG CACAAAGAAT CATACGAAAA 10020 

ATTGATTTTC GCTACAGGCT CTACACCAAT CTTGCCACCA ATCGAAGGTG TTGAAATTGT 10080 

TAAAGGAAAC CGCGAATTTA AAGCAACTCT TGAAAACGTA CAATTCGTGA AATTGTACCA 10140 

AAATGCTGAA GAAGTTATCA ATAAACTTTC TGACAAGAGC CAACACCTCG ACCGTATCGC 10200 

CGTTGTTGGT GGTGGTTACA TCGGTGTTGA ACTTGCTGAA GCCTTTGAAC GTCTTGGAAA 10260 

AGAAGTTGTC CTTGTTGATA TCGTTGATAC TGTCTTGAAC GGTTACTATG ACAAAGACTT 10320 

CACACAAATG ATGGCGAAGA ACTTGGAAGA TCACAACATC CGCTTGGCTC TAGGTCAAAC 10380 

TGTTAAAGCA ATCGAAGGTG ACGGTAAAGT TGAACGCTTG ATTACTGACA AAGAAAGCTT 10440 
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TGACGTGGAT ATGGTTATCC TTGCAGTTGG TTTCCGTCCA AACACAGCCC TTGCAGGTGG 
TAAGATCGAA CTCTTCCGCA ACGGTGCCTT CCTTGTAGAC AAGAAACAAG AAACATCTAT 
CCCAGACGTT TACGCTGTTG GTGACTGTGC GACTGTTTAT GACAATGCTC GTAAAGATAC 
AAGCTATATC GCTCTTGCTT CAAATGCTGT GCGCACTGGT AACGTTGGT 
(2) INFORMATION FOR SEQ ID NO: 58: 

ti) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7542 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



10500 
10560 
10620 
10669 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

CGCGCTAATA GATACTTTAT GATAGAATAA AGAACAAGAT TGACAAGTAA GAGGAAACAT 60 

TATGCAAAAT CAAACACTCA TGCAATACTT TGAATGGTAT CTGCCCCACG ACGGTCAACA 120 

CTGGACGCGT CTGGCTGAAA ATGCTCCACA CCTAGCTCAT CTGGGGATCA GTCACGTCTG 180 

GATGCCACCA GCCTTCAAGG CAACCAACGA AAAAGATGTC GGCTATGGGG TCTATGACTT 240 

ATTTGACTTA GGAGAGTTCA ACCAAAAAGG GACTGTCCGC ACCAAGTATG GTTTCAAAGA 300 

AGACTATCTT CAAGCCATTC AAGCCCTTAA AGCACAGGGA ATTCAACCTA TGGCCGATGT 360 

AGTTCTCAAC CACAAGGCTG CTGCCGATCA CAGGGAAGCC TTTCAGGTTA TCGAAGTTGA 420 

TCCTGTAGAC CGTACAGTTG AACTTGGAGA ACCCTTCACC ATCAATGGCT GGACTAGTTT 480 

TACCTTCGAT GGTCGCCAAG ATACCTATAA TGGCTTCCAC TGGCATTGGT ACCACTTCAC 540 

CGGTACAGAC TACGATGCCA AACGCAGTAA ATCTGGGATT TATCTGATCC AAGGGGACAA 600 

CAAGGGCTGG GCCAACGAGG AATTGGTCGA TAACGAAAAC GGAAACTACG ACTACCTCAT 660 

GTATGCCGAC CTAGACTTTA AACATCCTGA AGTCATCCAA AACATCTATG ACTGGGCTGA 720 

TTGGTTCATG GAAACGACTG GTGTAGCTGG TTTCCGTTTG GATGCCGTTA AGCATATTGA 780 

CTCTTTCTTT ATGCGCAACT TCATCCGCGA TATGAAGGAA AAATACGGTG ACGATTTCTA 840 

TGTTTTTGGT GAATTTTGGA ACCCAGACAA GGAAGCCAAT CTGGACTATC TCGAAAAAAC 900 

GGAAGAACAC TTTGACCTTG TCGATGTTCG TCTCCACCAG AATCTCTTTG AAGCCAGTCA 960 

AGCTGGCGCA AACTATGACC TTCGTGGCAT TTTCACAGAT AGCCTGGTTG AACTCAAGCC 1020 

TGACAAGGCT GTGACTTTTG TCGACAACCA CGATACCCAA CGAGGACAAG CCCTTGAGTC 1080 

TACCGTTGAA GAATGGTTCA AGCCAGCAGC CTATGCCCTC ATTTTGTTAC GCCAAGACGG 1140 

CCTTCCATGT GTCTTTTACG GAGACTACTA TGGGATTTCA GGGCAGTATG CTCAAGAAGA 1200 
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TTTCAAAGAA ATCCTTGACC GCCTCCTAGC CATCCGAAAA GATTTGGCCT ATGGAGAACA 1260 

AAATGACTAC TTTGACCATG CTAACTGTAT CGGTTGGGTA CGTTCAGGTG CTGAAAATCA 1320 

ATCCCCAATC GCAGTCCTTA TCTCAAATGA CCAAGAAAAC AGCAAGTCAA TGTTTGTCGG 1380 

TCAAGAATGG ACTAATCAAA CCTTTGTAGA TTTACTTGGT AACCACCAAG GTCAAGTTAC 1440 

AATTGATGAG GAAGGTTATG GACAATTCCC TGTCTCAGCT AGATCCGTAA GTGTCTGGGC 1500 

AGTCAATACC ATCTAATAGC TGATAATAAC CAAGCTAGGT CCAAGCGGAT TTGGCTTTTT 1560 

TGTATTCACA AAAAGACCTA CCCAAATGGA TAGATCTTTA CTTGATTACA ATTTACCTGC 1620 

TACTGCATCC AACAATTCTT GGATCTTAGG TTGGTTGCTT CCTCCTGCCA TGGCCATATC 1680 

TGGTTTACCA CCACCACGTC CATCGATGAT TGGTGCTAAT TCTTTGACAA GGTTTCCTGC 1740 

ATGAAGGTCT TTTGTCTTGC TTGCTACAAG GACATTGACT TTGTCACCGA TAGCGGCAAC 1800 

TAGGACAAGA AGATCAGAGT AGTCTTTTTG TTTCCAGTTA TCTGCAAAAG TACGAAGGGC 1860 

ACCGGCATCG GATACAGACA CTTGACTAGC AATGTAACGA TGACCGTTGA CTTCCTTAAC 1920 

ATCTTTGAAG ATATCGCCTG CGGCTGCAGC TGCGGCTTTT TCTTTCAACT CAGCATTTTC 1980 

TTTTTGAAGT TGACGAAGTT GTTCTTGAAG TCCTTCTACC TTGTGAGGTA CTTCCTTGAC 2040 

TTGAGGTGCT TTCAAGGTTG CTGCGATAGC TTTAAGAGCA TCCTCTTGTT CACGATAGGC 2100 

TTCAAAGGCT TCCTTACCAG TCACTGCCAA GATACGGCGA GTTCCTGAAC CGATTCCTTC 2160 

TTCTTTGACA ATTTTGAAGA GACCAATCTC AGAAGTGTTG TCAACATGAG TACCACCACA 2220 

AAGTTCAATA GAGTAGTCAC CGATAGTCAC GACACGAACT TCCTTGCCGT ATTTCTCACC 2280 

AAAGAGGGCC ATAGCTCCCA TTTCTTTAGC AGTGTCAATA TCCGTTTCAA CTGTCTTCAC 2340 

TTCAAGTGCT TCCCAAATTT TCTCGTTAAC TTGCTGTTCA ATCGCACGAA GTTCCTCAGC 2400 

AGTTACTGCT TGGAAGTGGG TAAAGTCAAA GCGAAGGAAT TCAACTTCGT TAAGAGATCC 2460 

TGCCTGTGTT GCGTGGTTTC CAAGGATATT GTGAAGGGCA GCGTGAAGCA AATGAGTCGC 2520 

AGTGTGGTTT TTCATGACAC GGTGACGGCG ATTGCTATCA ATTGCCAAGG TATATTCTTG 2580 

GTTCAAGGCA AGCGGTGCAA GGACTTCAAC TGTATGAAGG GCTTGACCAT TTGGGGCTTT 2640 

CTGAACATTG GTCACAGTAG CCACAACCTT ACCTGACTCA TCCAAGATTT GTCCGTAGTC 2700 

AGCTACCTGT CCACCCATTT CAGCATAAAA TGACGTTTCC GCAAAGATAA GAGAGGCAGT 2760 

TCCTTCTGAA ACAGCTCCTA CTTCTGCATT GTCAGCAACG ATAGCTACCA ATTTAGAAGA 2820 

CAATTGGCTA GCATTGTAGT TGAAGACACT TTCTACAGTG ATGTTTTGAA GAGTTTCATT 2880 

TTGCATACCC ATTGAGCCAC CCTTGACAGC TGACGCACGC GCGCGTTCTT GCTGTTCTTT 2940 
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CATGGCTGCT TCAAAACCTT CACGGTCTAC AGTCATACCA GCTTCTTCAG CGATTTCTTC 3000 

AGTCAATTCA ACTGGGAACC CATAAGTATC ATAGAGTTTG AAGACATCTG AACCAGCGAT 3060 

AACAGATTGA CCTTTTTCTT TCAAGTCTGC TACAATGCCT TGGGCAAAGT GTTGACCTGA 3120 

GTGAAGGGTA CGGGCAAATG ATTCTTCTTC GCTCTTAACG ATTTTCTCAA TAAAGTCACG 3180 

TTTCTCAAGC ACTTCTGGGT AGTAGCTTTC CATGATTTTT CCAACAGTTG GAACCAATTT 3240 

GTAAAGGAAA GGCTCGTTGA TACCCAATTT TTGACCATGC ATAGAAGCAC GACGGAGAAG 3300 

ACGACGAAGA ACATAACCAC GACCTTCATT TCCTGGAAGG GCACCATCAC CGATAGCAAA 3360 

TGAAAGAGAA CGAATGTGGT . CTGCGATAAC CTTGAAGCTC ATGTTGTCGC CATCTTGGTC 3420 

ATAAACCTTA CCAGACAATT TCTCGACTTC ACGGATAATC GGCATGAAGA GGTCCGTTTC 3480 

AAAGTTGGTC TTAGCCCCTT GGATAACGGC CACCAAACGC TCCAAACCAG CGCCCGTATC 3540 

' AATGTTCTTA TGTGGCAATT CCTTGTATTC GCTACGAGGA ACAGCAGGGT CTGCGTTAAA 3600 

TTGTGACAAA ACGATGTTCC AGATTTCAAT ATAACGGTCG TTTTCAATAT CTTCTGCAAG 3660 

CAGGCGAAGA CCGATATTTT CTGGGTCAAA GGCTTCCCCA CGGTCAAAGA AGATTTCTGT 3720 

ATCTGGTCCA GAAGGTCCCG CACCGATTTC CCAGAAGTTG TCCTCAATTG GAATCAAGTG 3780 

ACTTGGATCC ACTCCCACTT CAATCCAGCG GTTGTAAGAA TCTTTATCGT CTGGATAGTA 3840 

GGTCATGTAA AGTTTTTCAG CAGGGAAATC AAACCATTCA GGGCTTGTCA AAAGCTCATA 3900 

AGCCCAAGTG ATAGCTTCGT CACGGAAGTA ATCCCCGATA GAGAAGTTCC CCAGCATTTC 3960 

AAACATGGTA TGGTGACGCG CGGTCTTCCC TACGTTTTCG ATGTCGTTGG TACGGATAGC 4020 

CTTTTGGGCA TTGGTAATAC GTGGATTTTC AGGGATAATG GTCCCGTCAA AGTATTTCTT 4080 

AAGGGTTGCT ACCCCAGAGT TGATCCACAA AAGAGTTGGG TCATTTACAG GAACCAAACT 4140 

TACTGATGGT TCTACTGAGT GACCTTTGGT CGCCCAGAAA TCAAGCCACA TTTGGCGTAC 4200 

TTGTGCACTA GATAGTTGTT TCATATTGTC TCCTTATTCA CTTGTTTAAT GTGATTGGCT 4260 

TTCCAGCATT TCCACATAGT CAATCGCGAC ACAGAGGGAA ATGACTAGGT CTGCATAAGC 4320 

GTCTTCAAGA ACCGTTACGG TATAGGTAGA AGTCAGATGG AAGAGTTCCT TCTTAATTTC 4380 

CGCAATCAAC TGATCGCGAT CATCCAGCAA TTTGAAATTC AAATCCCAGA TATTGCCCTC 4440 

GATACGAAGA CCTAGATTAT CAAACTCATA CTTATCTCGC CAGAAGGTCA ACTTCTTACG 4500 

AATGACAAAA CTCGAGCCAT CCCGAAGCTG AATTTCAAAA CGAGGAAGCA AGGTCAAGAT 4560 

TTCTTTACTA ATCTCACTGA CTTGTTCACC AGCCGCATCA TAGATGGTAA AGGTTTTAGG 4620 

AATCTTAAAA AATGATCCCT CCACCTGATA GGCAATTTCT CCCCTGTCAT CCTTGATAGC 4680 

GAAGCGTTCG CCTCCAAGAC GAAACTTTTG TTTGACAAGA AATGTTTTCA TCAACACCTC 4740 
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CAAAAATCAA AAGACAAGCT CATATCACGA AGGGCGAAAA ACCGCGGTAC CACCTTCATT 4800 

CAATGAACTT GTCATTCTCT TGTTCTTATG CAATTGTATG ATTGAGTAGC ATGACTTCCT 4860 

AGCTTAGATG GCTCGCAGCA CCGCCATTTC TCTGGACTAA GACAAGTGAA AATCAATTCT 4920 

CAACTTTCTT ATTATAACGT TTTTTTAAGC TTGCGTCAAC TGGAAATGAT CTCCGTTGAA 4980 

TTAGACCAAT TCCCTACATC TCTGATTACT TTTTCAGGAT ATATTTTTTC TTACTGCCAT 5040 

TTTTCTTTTT ATCCCAAATT TTCATATTAC TAAACACAGC TACTAGAATA TTTCCAAATA 5100 

TAAAGGTGCC TATCACCCAA TATATGGACT CAGTTGTTAG GTATTGTCGA TCCAAGCCAT 5160 

CCTTTAAATG GAATAGTATA GCAGTTTGGT TAACAATCAT AAAGGTTGGC CAGAAACTTT 5220 

TTTTGAAAAA AGTAGACATT TTCATTATTT GTTGCCGCTT TCTGTAAGGT TAATACTCAA 5280 

TAAAAATCAA AAAGCAAACT AGGAAGCTAG CCTCAAGCTG TACTTGAGTA CGGCAAGGCA 5340 

ACGCTGACGT GGTTTGAAGA GTATAGGCTT AGTATACTAC TAGGCAAGCA AATAAACAAA 5400 

TAAACAACTA GAATAGAAAA AGATAGGGCT CTAAAAACTG ACTTCTATTC CTTAAAAACG 5460 

AACCAGCTTG ACTGATTCGT CTTCTTACGT TTATCTCCTA CTTCCGATAC ATTTTAAACT 5520 

GTAGGAAGAG GTCGCTATAT TTCCCTGTCC ATTTATGGTC AAATTTCTCA TAAACTTCTA 5580 

GGTGTTTCAT GGTTTCAACA TCGGGATAGA AGGCCTTATC TTCCTTTGTT TCCTCTGGGA 5640 

GCAATTCCTT CGCTGGTAGG TTTGGTGTTG AATAGCCGAC ATACTCCGCA TTTTGGAGAG 5700 

CATTTTCAGG TTTCAACATA AAGTTGATAA AGGCATAGGC TGAGTTTTGG TTTTTAACTG 5760 

TTTTGGGAAT GACCATATTG TCAAACCAAA GATTGCTGGC CTCTGTCGGT ACCACATAAC 5820 

GTAGATTTTC ATTTTTTTCT AACATTTGGC TGGCTTCACC AGAGAAGGTC ACGCCGATTG 5880 

CAACATTATT CTGAATCATA TAGCCCTTCA TCTCGTCCGC AACGATAGCC TTGATATTTG 5940 

GAGTCAGTTT GTAGAGCTTA TCCACTGTCT CTTCCAACTG CTGCAGATCC TTGGAGTTGA 6000 

GGCTGTAGCC GAGGGAATTG AGTCCTAGTC CCAGCACCTC ACGCGCCCCA TCAAAGAGCA 6060 

TGATAGAATT CTTATACTCC GGCTTCCAAA GGTCATCCCA ATGCTCAGGC GCTTCATCTA 6120 

CCATGGTTTC GTTGTAGACA ATTCCTAAGG TTCCCCAGAA GTAAGGGATG GAGAATTTAT 6180 

TACCTGGGTC AAAGGACTGG TTGAGAAACT CTGGTCCGAT ATTTTCGATT CCTTCAATTT 6240 

TTGAATAATC AAGCGGAACC AAGAGGTCTT CGTCCTTCAT CTTGTTAATC ATGTATTCAC 6300 

TTGGAATGGC AATATCGTAG GTCGTTCCAC CCTGCTTTAT CTTAGTGTAC ATGGCTTCGT 6360 

TGGAGTCAAA AGTCTCGTAC TGAACTTGAA TTCCTGTTTC TTCTGTAAAC TGAGTCAAGA 6420 

GTTCAGGATC GATATAGTCT CCCCAGTTAT AGATAACCAA TTTTTGACTA TCTCGACTAT 6480 



WO 98/18931 



PCT/US97/19588 



514 

TGATTTTACT ATCTAAATGA GTCGCAATTC CCCACAAGAC AAGGATAATC GCTGCAATTC 6540 

CTGCTAAAAA TGAATAGATT TTTTTCATGC TTGCTCCTCC TTCTCACGAG AGATAAAGTA 6600 

ATAACCTACA ACTAGGATAA TACTAAAGAG AAAGACTAGA GCAGACAGGG CATTGATTTC 6660 

TAAGGAAATC CCCTTGCGAG CACGAGAGTA AATCTCGACT GATAGGGTTG AAAAGCCATT 6720 

TCCTGTTACA AAGAAGGTCA CGGCAAAGTC ATCTAACGAA TAGGTGAAGG CCATGAAATA 6780 

ACCAGTAATG ATAGACGGAG TCAGGTAAGG AAGCATGATT TCCTTGAACA TCTGAAATTG 6840 

ACTAGCTCCC AAGTCATAGG CCGCATGAAT CATGTCGCCA TTCATTTCCT TGAGTCGAGG 6900 
CAAGACCATC AAGACCACGA TAGGAATGGA GAAGGCCACG TGACTAGATA GAACGGTCAA , 6960 

AAAGCCAAGT GAAAACTTGA GTTGGGTAAA GAGAATCAAG AAGCTAGCAC CAATCATAAC 7020 

GTCAGGCGCA ACCATGAGGA TATTATTGAG TGATAGAAAG GCTTCTTGGT ATTTCTTACG 7080 

AGACTGGTAG ATGTAAATGG CACCAAAAGT CCCGATAATG GTCGCTATCA AGGCTGATAG 7140 

GAAGGCCAAG AAAAATGTCT GAGCCAAAAT CAGCATGAGT CTCCCATCTC CAAACATGGT 7200 

TTCAAAGTGA GTCCAGCTAA AACCTGTAAA GCTATTCATA TCATCACCAG CATTAAAGGC 7260 

ATAGCCAATC AAGTAAAAGA TAGGCAGGTA GAGGACCAGA AAGACCAGTC CCAGATAAAG 7320 

GTTGGCAAAT TTTTTCATCG TTCTCTCCTT TCCTTAGTCA CCCACATGGT GATGAACATG 7380 

GTCAGGATGA GAATCACACC GATGGTTGAA CCCATACCAT AGTTGTCATT GGTTAGAAAA 7440 

TTCTGCTCAA TAGCCGTCCC CAAGGTGATA ACGCGTTCCC ACCAATCAAA CGGGTCAGCA 7500 

TGAAGAGACT CAAACTTGGG ATAAAGACCG ACTGAACCCC GG 7542 
(2) INFORMATION FOR SEQ ID NO: 59: 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9223 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

AAAACCAAAT TCCGGTATTT TAACCTATGC TGTAAATACC ATGAAGTCTG TCATGACAGA 60 

TCAGGTCTAT AACATTAAGG TTGAGACAGA AAATGGAAAT TATGTTGGTG AAGCTAGCCA 120 

TGTTTTGGTC CTTTTGACAA ATTACTTCGC TGATAAGAAA ATCTTTGAAG AAAACAAGGA 180 

CGGCTATGCC AACATTTTGA TTCTGAAAGA TGCCTCTATA TTCTCCAAAT TATCCGTCAT 240 

TCCTGATTTA TTAAAAGGGG ATGTTGTCGC AAATGATAAT ATCGAGTATA TCAAAGCGCG 300 

TAATATTAAA ATCTCTTCAG ATAGTGAATT GGAGTCAGAT GTTGACGGAG ATAAATCAGA 360 
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TAACCTACCT GTAGAAATCA AAGTCCTAGC TCAGCGAGTA GAAGTATTTT CAAAACCGAA 420 

AGAGGATTAG TATATAGAGA AAGCCTTTTT TAAGGCTTTT TGTATACTTT AAAAGATAGT 480 

TCCTTTAACA ACGGACATTC CTTGCAAATA GTTTTACAAA AATAGTATAC TGGATTCATT 540 

GAGTTTGAAA ACGTTTGCGT AAAATTTGAA TGAATACTTT AGGAGACAAA TTGATGGAAT 600 

TGAGTGCTAT TTACCATAGG CCTGAGTCGG AGTATGACTA TCTTTATAAG GATAAGAAAC 660 

TCCATATTCG AATTCGAACT AAGAAAGGGG ACATTGAAAG CATCAACTTG CACTATGGGG 720 

ACCCTTTTAT CTTTATGGAG GAGTTTTATC AGGATACAAA AGAAATGGTC AAGATAACTT 780 

CTGGTACCTT ATTTGACCAT TGGCAGGTTG AAGTGTCAGT TGACTTTGCA CGTATCCAGT 840 

ATCTCTTTGA GCTCAGAGAT ACAGAAGGTC AAAATATTTT GTATGGCGAT AAAGGGTGTG 900 

TGGAAAATTC TCTAGAAAAT CTTCATGCAA TTGGGAATGG ATTTAAGTTG CCTTAGCTTC 960 

ATGAGATTGA TGCCTGCAAG gTTCCTGACT GGGTTTCAAA TACGGTATGG TATCAGATAT 1020 

TTCCTGAAAG ATTTGCCAAT GGCAATGCTC TATTAAACCC AGAAGGGACT TTAGACTGGG 1080 

ATTCATCTGT CACACCTAAG AGCGATGATT TCTTTGGTGG TGATTTACAG GGGATTATTG 1140 

ATCATATGAA TTACTTGCAA GACTTGGGTA TTACTGGACT ATATCTTTGT CCCATCTTTG 1200 

AATCTACAAG CAATCACAAG TACAATACGA CAGATTACTT TGAAATTGAC CGTCATTTTG 1260 

GAGACAAGGA GACCTTTCGG GAACTGGTGG ATCAAGCGCA TCATCGTGGC ATGAAAGTCA 1320 

TGCTGGATGC GGTATTTAAT CATATTGGTT CGCAATCTCT TCAATGGAAA AATGTCGTCA 1380 

AAAATGGTGA ACAGTCTGCT TATAAGGATT GGTTCCATAT TCAACAATTC CCAGTGACAA 1440 

CTGAAAAGCT AGTTAATAAG AGAGACTTAC CCTATCATGT TTTTGGTTTC GAGGACTATA 1500 

TGCCTAAGCT AAATACAGCC AATCCAGAGG TCAAGAATTA TCTTTTAAAG GTTGCGACTT 1560 

ATTGGATTGA AGAGTTTAAT ATCGATGCTT GGCGTTTGGA TGTGGCTAAT GAGATTGACC 1620 

ATCAGTTCTG GAAGGATTTT CGTAAGGCAG TTTTAGCTAA AAATCCTGAT CTTTATATCC 1680 

TAGGAGAAGT CTGGCATACA TCTCAGCCTT GGCTAAATGG AGATGAGTTC CATGCCGTCA 1740 

TGAATTATCC TTTATCTGAT AGTATCAAGG ACTATTTCTT ACGAGGAATT AAGAAGACAG 1800 

ACCAGTTCAT CGATGAAATC AATGGAGAGT CTATGTATTA CAAGCAGCAG ATTTCAGAGG 1860 

TCATGTTTAA TCTCTTGGAT TCACATGATA CAGAGCGAAT CCTGTGGACG GCCAATGAAG 1920 

ATGTTCAACT GGTTAAATCA GCCTTAGCCT TTCTCTTTTT ACAAAAAGGA ACACCGTGCA 1980 

TTTATTACGG AACCGAGCTA GCCTTGACTG GAGGACCAGA TCCAGATTGT CGTCGTTGTA 2040 

TGCCTTGGGA ACGTGTATCA AGTGACAATG ATATGCTGAA CTTTATGAAG AGGCTGATTA 2100 



WO 98/18931 



PCT/US97/19588 



516 

AAATTCGGAA ATACGCGTCA GTAATCATTT CGCATGGCAA GTATAGCCTT CAAGAAATCA 2160 

ACTCTGATCT AGTAGCTCTG GAATGGAAAT ACGAAGGACG GATCCTCAAA GCAATATTCA 2220 

ACCAATCAAC AGAAGATTAT CTTTTAGAGA AAGAAGCAGT AGCACTAGCA AGCAATTGCC 2280 

AAGAATTGGA TAATCAGCTT GTCATCTCTC CAGATGGATT TATGATTTTC TAAAAACTAG 2340 

TTGATGAAGA TTATGGTACA TTTCATACCT TATATAGTAT AATAAGGCTA GTTACTAAAC 2400 

TTGTAAAGGA GAACTTAAAT GAATTGTAGA GGACATGAAA CAAGACAAAG AATTGTTAGA 2460 

GATTTTGAAG TTCAGCCTAA AGCACATATT AAGCTGTTAG CAAATCAACA AAAACATAGT 2520 

GATGCAGGAG CAACTATTGA AGATGAATAT TATGTATTTA TCGCTGAGAG TAAAATTGAT 2580 

GGCAAGAAGG AAGTTATTCA GTGTTGCATG GGTGCGGCAA GGGATTTTTT AGAACTAATT 2640 

AATCACAAAG GGCTACCTCT TTTTAATCCG CTTGTAGGTG ATTCTCATGT AAATAATAGA 2700 

CAAGAATATG ACAATACAGG GAGTGGAAAT TTATAACCTG AAAAGTGGAA TGAAACTGCA 2760 

AAGCAGCTTT ATAATGCTAT AATGTGGTTG ATTATTTTAT GGAATGCTAA GCCGGATACA 2820 

CCTTTATTTA ATTTTAAAGA CGAAGTAATT AAGTATAAAA CATATGAGCC TTTTGAAAGC 2880 

AGTATAAAAA GAGTAAATAC TACTATAAAG AATGGTAGTA AAGGGAAAAC TCTGACTGAG 2940 

ATGATTAATG GCTACAGAGC GGATAACGAT ATTAGAGATG AAATTTGTAA CTTTAATATT 3000 

CTGAAAAATA AAATTCGTGA TATGAAAAAC CAACAAGGAA ATACAATGGA ATCTTACTTT 3060 

TAGTTATTGT TGAATTTTGG GTATTCTATA AAATATCCTA ATTGAGATTT AAATAGTAGA 3120 

CTATACAATA TAGTTAAAAT ATCAGTAAAA ACAACACTTT ATTGAGGTAT TGGATACGCT 3180 

TTGCTAATAG CCTAATAATC ACATGTGGAG TGTTGCTACA ACGAAAAAGG TGATAATCCT 3240 

TGATTTCAAG CTATTTTATA AGCATTTTGT CTTTGTAGAT AAAGGCAATT TTGACAATAA 3300 

AAATCCTAAA AGGTGAATCG TTATAGATGT ATTTGTAGAT ATCGTTTGCG CATCGAAAAA 3360 

ATTAATACAA GAATAAATAT TTATAGCTCT TTAGGTGACT TTTATAGAAG TAAAGTTTAG 3420 

GATAGAAAAA CAAGAAATAA CGCACCATTT TTGGTGCGTT ATGCTTTTTT ATGCTATAAT 3480 

GGATTTATAA AAATAAAGGA GTTTGCTATG ATTGGAAAGA ACATAAAATC CTTGCGTAAA 3540 

ACACATGACT TAACACAACT CGAATTTGCA CGGATTGTAG GTATTTCACG AAATAGTCTG 3600 

AGTCGTTATG AAAATGGAAC GAGTTCAGTC TCTACCGAAT TAATAGACAT CATTTGTCAG 3660 

AAGTTTAATG TATCTTATGT CGATATTGTA GGAGAAGATA AAATGCTCAA TCCTGTTGAA 3720 

GATTATGAAT TGACTTTAAA AATTGAAATT GTGAAAGAAA GAGGTGCTAA TCTATTATCT 3780 

CGACTCTATC GTTATCAAGA TAGTCAGGGA ATTAGCATTG ATGATGAGTC TAATCCTTGG 3840 

ATTTTAATGA GTGATGATCT ATCTGATTTG ATTCATACGA ATATCTATCT AGTAGAAACT 3900 
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TTTGATGAAA TAGAGAGATA TAGTGGCTAT TTGGATGGAA TTGAACGTAT GTTAGAGATA 3960 

TCTGAAAAAC GGATGGTGGC CTAATGGAAA TCCAAGATTA TACTGATAGT GAATTCAAAC 4020 

ATGCTTTAGC AAGGAATCTT CGTTCACTGA CAAGAGGAAA AAAGTCCAGT AAGCAACCTA 4080 

TAGCGATTTT GCTTGGAGGG CAAAGTGGTG CCGGTAAGAC TACAATTCAT CGTATTAAAC 4140 

AGAAAGAATT TCAAGGAAAT ATTGTTATCA TAGATGGTGA TAGTTTTCGT TCTCAGCATC 4200 

CACACTATTT AGAACTGCAG CAAGAATATG GCAAAGACAG TGTAGAATAT ACCAAAGATT 4260 

TTGCAGGAAA AATGGTAGAG TCTTTAGTAA CAAAATTGAG TAGTTTGAGA TACAATCTTT 4320 

TGATAGAGGG AACTTTACGA ACAGTTGATG TTCCAAAGAA AACAGCACAA CTCTTGAAAA 4380 

ATAAGGGATA TGAAGTACAA TTGGCCTTAA TTGCGACAAA GCCTGAATTG TCGTATCTAA 4440 

GTACTCTTAT CCGTTATGAA GAACTGTACA TTATCAATCC AAATCAAGCA CGCGCAACTC 4500 

CAAAAGAACA TCATGATTTC ATTGTAAATC ATCTAGTTGA TAACACACGA AAATTGGAAG 4560 

AACTAGCTAT CTTTGAAAGA ATTCAAATTT ACCAACGAGA TAGAAGTTGT GTATATGATT 4620 

CAAAAGAAAA TACAACTTCA GCAGCAGATG TTCTTCAAGA GTTACTCTTT GGGGAGTGGA 4680 

GTCAGGTAGA GAAGGAGATG TTGCAGGTGG GGGAAAAGAG ACTTAATGAA TTACTTGAAA 4740 

AATAAACAAT TGATATTTTT AGGAGAATAG AAATGAGAGG GTTTAATAAC AAGATAAAGT 4800 

CTGTTTATCA AGAACTAACA AATTCCAAAG AGAAATTCGG TAGCTTTCAC AAGACTTTAA 4860 

TTCATTTGCA TACACCTGTT TCTTATGATT ACAAGCTATT TTCTAATTGG ACTGCAACGA 4920 

AATATAGAAA AATTACTGAA GATGAACTAT ATGATATATT TTTTGAAAAT AAGAAAATAA 4980 

AAGTTGATAA GACAATTTTT TTTAGTAATT TTGATAAGGT TGTTTTTTCT AGTTCAAAAG 5040 

AATATATTAG TTTTCTTATG TTAGCAGAGG CAATCATAAA AAATGGAATA GAAATAGTTG 5100 

TAGTAACTGA TCATAATACT ACCAAAGGTA TTAAAAAGTT ACAAATGGCA GTCTCAATCA 5160 

TAATGAAAAA TTATCCGATT TATGATATAC ATCCTCATAT TTTACATGGA GTAGAAATTA 5220 

GTGCAGCAGA TAAATTGCAT ATTGTATGTA TATATGATTA TGAACAAGAA TCATGGGTTA 5280 

ATCAATGGTT AAGTGAAAAT ATTATAAGTG AGAAAGATGG AAGTTATCAA CATTCACTGA 5340 

CTATAATGAA GGATTTCAAT AATCAAAAAA TAGTTAACTA TATTGCTCAT TTCAATAGTT 5400 

ATGACATTTT GAAAAAAGGT TCTCACTTAT CAGGTGCATA TAAACGAAAA ATTTTTTCTA 5460 

AAGAAAATAC ACGATTTTGG AGTTTAATAT TAACTCGAAA GAATCTTCGC AACAACTTGA 5520 

TATTCTCTAT AAAGAAGTTG GTGTATTAAG TTTGGGACAA AAAGTTGTAG CCATGCTTGA 5580 

TTTTTTATTA GCATATAGTG ATTATTCTAA AGACTTCAGA CCATTGATTA TTGATCAGCC 5640 
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TGAAGACAAT 


CTAGACAATC 


GTTATATTTA 


CAGGCATTTA 


GTTCAGCAGT 


TTAGAGATGT 


5700 


GAAAGCTCAA 


CGTCAAATTA 


TTTTAGCAAC 


ACATAATGCT acaattgtaa 


CAAATTCTAT 


5760 


GACAGATCAA 


GTTGTTATTA 


TGGAGTCAGA 


TGGAGTTAAC 


GGATGGATTG 


AATCACAGGG 


5820 


ATATGTTAGT 


GAAAAATATA 


TAAAAAATCA 


TATCATCAAT 


CAATTAGAGG 


GAGGAAAAGA 


5880 


TTCCTTCAAG 


CATAAAATGT 


CTATATATGA 


GACGGCTTTA 


TCAGAGTAGA 


GTCAGAAAAA 


5940 


GTAGGTTAGA 


AATTTAGCCT 


ACTTTTTTCT 


TTGTCCGACA 


GGCATAGTGT 


ACATCTGAGG 


6000 


TCCAAGTCCT 


CTGTGGATAT 


TTGCTGCAGA 


TGAAACCAAT 


AGCGACTCCT 


AAGCCTGAAT 


6060 


ATCGTGAGGT 


AGGGGGGATA 


GGAAGGAATT 


AGCGAAATCA 


AGGTTCTACA 


AACAGAATCG 


6120 


TG ACTTGAAG CCATATATAG CGGATGAGGA 


ACTCTAAAAT 


CCAAATAGGT 


GTCGTAACCT , 


6180 


ATATACGTAA 


ATTACGAGAG 


TAAACTAGGA 


AAGATGTACG 


GCTTATTCCG 


TGAGCGTTTA 


6240 


GGACGTAGTA 


CAACGAATCA 


TGGGAGTCAG 


CTGAACACAT 


AGTATTGAAG 


AAATTTCTGT 


6300 


AATGGAAATG 


GAGCGAAGAA 


GTGAACAATT 


AAATGAATAC 


CTCTCTAATT 


AAATTTGTCA 


6360 


ATTCTAATTC CTGGTATGAA AAGACAGTGA 


CCTGAAAATG 


TAAACGATGG 


GAGCTGATCA 


6420 


TAAATATAGG 


ACGGTACATG 


CAGTGGTGTT 


AGAGATTAGT 


CCTTACTTGA 


TTTGTGATAA 


. 6480 


CTTCCCCAAA 


TTTCTTCTGC 


TATACTTTTC 


TCAACTTTTA 


AAAATCCAAC 


TAAGAATTTT 


6540 


ACCTGGGGGT 


TTGGGGGCGG 


AGCACTAAGT 


TATCTTATCG 


TTAGCTGTCA 


AAACTGGTAG 


6600- 


GTTTTGATAG 


GCTGGCGATA 


TGATTTTTGG 


GATATTGTGG 


ACACAATATC 


TGAGCTCGCA 


6660 


AAGCCTTACA 


AGAATGAAAA 


TCAGTTGTTG 


GAAAAGTGTA 


CTGACATTGT 


ATGGTAGCTC 


6720 


ACATTGTCAG 


TACAAGTATT 


TTGGAAAGGA 


AGTAGCAGTA 


TGAAACGAGA 


TGTGCGTGAT 


6780 


ATTCGGAAAC 


AATTTCGTTT 


AACAGAAGCA 


GAAGAAAAGC 


AAATTCTAGC 


TTTGATGAGA 


6840 


GAGCGGGGAG 


AGACTAATTT 


CTCTGATTTT 


CTTCGTAAAA 


GTTTACTTTC 


CTCTGATTTA 


6900 


CAAAAACAGA 


TGGAGACATG 


GTTTGCCCTC 


TGGCAATCCC 


AAAAACTAGA 


ACAAATCAGT 


6960 


CGTGACGTTC 


ATGAAGTTTT 


AATCTTGGCA 


CAGTCAGAAC 


GTCAAGTCAC 


CCAAGAGCAT 


7020 


GTATCTATTC 


TCTTAACGTG 


CGTGCAGGAA 


TTGATTCAAG 


AGGTTGCAAA 


CACCATACCC 


7080 


CTCAGTAAAG 


AATTTCGTGA 


GAAGTACATG 


AGGTAAGCAC 


ATGGAACATC 


GTTACCGAAC 


7140 


CAATCTCAAG 


AAAGTGTTTT 


TGTCTGATAG 


TGAGTTGAAC 


CAACTAAATA 


TAAATATCGA 


7200 


TCAAAGTGGT 


TGTAAATCCT 


TTTCTGAATA 


TGCGAGACGA 


ACTCTACTCG 


ATCCTGGTAT 


7260 


GAATTTTATC 


ACGATTGACA 


CAAACGGTTA 


CCAAGATTTA 


GTGTTTGAGT 


TAAAGAGGAT 


7320 


TGGCAATAAT 


ATCAACCAGA 


TTGCTCGAAG 


TGTTAATCAA 


TCTCAGTTAA 


TTTCTGGTGA 


7380 


AGAATTGCAG 


GAGTTGAAAA 


AAGGAATTGG 


TGAATTGATA 


AAAGAAGTTG 


ATAAGGAATT 


7440 
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TAATCTGCAA GCGCAGAAGC TAAAGGAGTT CCATGGTCAT CACTAAACAC TTTGCCATTC 7500 

ACGGAAAGAG TTACCGCAGA AAGCTTATCA AGTACATTCT CAATCCTGAG AAAACCAATA 7560 

ATCTTGCCTT GGTGTCGGAC TATGGCATGA AGAATTTTCT GGACTTTCCT AGCTATGAGG 7620 

AAATGGTGCA GATGTATCAT GAAAATTTCA TCAGCAACGA TACGCTTTAC GATTTTCGCC 7680 

ACGACAGGAT GGAAGAAAAT CAACGAAAAA TACACGCTCA CCACATCATT CAGTCTTTCT 7740 

CGCCAGAGGA TCATATCACT CCTGAACAAA TCAATCGGAT AGGTTATGAG ACTGTGAAGG 7800 

AATTAACTGG TGGCAAATTT CGTTTTATCG TTGCGACCCA TGTTGATAAA GACCACCTGC 7860 

ACAATCACAT CATTATCAAT TCAGTAGATA GCAATTCTGA CAAAAAGCTC AAGTGGGACT 7920 

ACAAGGTGGA GCGAAATCTT CGCATGATTT CTGACCGTTT TTCTAAAATC GCAGGTGCTA 7980 

AAATCATTGA GAACCGCTAT TCTCACCAGC GGTATGAAGT CTATCGTAAG ACTAATCACA 8040 

AGTATGAACT CAAGCAGCGA CTCTATTTTT TGATGGAACA TTCTAGGGAC TTTGAGGATT 8100 

TCAAAAAGAA TGCTCCGCTA CTACATGTGG AGATGGATTT CCGTCACAAG CATGCCACCT 8160 

TTTTTATTAC GGACTCAACT ATGAAACAGG TGGTGCGTGG CAAGCAACTC AATCGCAAGC 8220 

AGCCTTACAC AGAAGAATTT TTTAAGAACT ACTTTGCCAA AAGAGAAATA GAAAGTCTCA 8280 

TGGAATTTTT ATTGCTGAAA GTTGAGAATA TGGATGATTT ACTTCAGAAA GCAAAACTTT 8340 

TTGGACTAAC TATCAATCCT AAACAAAAGC ATGTTTCTTT TCAATTTGCA GGAGTGGAGG 8400 

TAAAGGAGAC AGAGCTAGAC CAGAAAAATC TTTATGATGT AGAGTTTTTC CAAGATTATT 8460 

TTAAAAATAG AAAAGATTGG CAAGCTCCAG AAACTGAGGA TTTCGTTCAA CTTTATCAAG 8520 

AAGAAAAGTT ATCCAAAGAA AAAGAACTTC CAAGCGATGA GAAGTTCTGG GAGTCCTATC 8580 

AAGAGTTCAA GAGTAACAGA GATGCCGTTC ATGAATTTGA GGTGGAGTTG TCACTCAATC 8640 

AAATTGAAAA AGTAGTGGAT GATGGAATTT ACGTCAAGGT CAAGTTTGGT ATTCGTCAGG 8700 

AGGGACTTAT CTTTGTGCCG AACATGCAGC TTGATATGGA AGAGGATAAG GTGAAGGTTT 8760 

TCATCAGGGA AACCAGCTCC TACTATGTCT ACCACAAAGA CGCTGCCGAG AAAAATTGTT 8820 

ATATGAAAGG TCGAACCTTA ATTAGACAGT TCAGCTATGA AAATCAAACC ATTCCATTAC 8880 

GCAGAAAAGC GACAGTCGAT ATGATTAAAG AGAAGATTGC GGAAGTGGAT GCTTTGATTG 8940 

AACTGGAAGT AGAAAATCAA TCTTATGTCA CGATTAAAGA TGAGTTAGTG CATGAACTAG 9000 

CAGOGTCTGA ATTGAGAATC AATGAGTTGC AAGAACGAAT GTCAACCTTG AATCAAGTAG 9060 

CAGAATATCT ACTGGCTTCA GTTGAAAGTA AGCAAGAAAT GAAATTAAAT CTTTCAAAAC 9120 

TGAATATAAC TGAGAATATC AGTGCTAATA TTGTTGAGAA AAAATTGAAG AGCCTGGGGA 9180 
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ATCAACTGGA ATTGGAAAGG GGCAGGTATG AAAAGATGGT AGT 9223 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6827 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

TCTGCTGGCT ACCATCATCT GACTTGGGCA AGACCAAAGT CTTAGTTACA ACTGTATTCT 60 

TCTCAGCATT TTCAATAACT GGCAATGCCG ACTGAAGCGT ATCTTTTTCT GTTTTTGTAG 120 

CTGGTCCAGT TTCTTTTTTC TGTCCGCAAC CAACCAGGAC AAAAAGGAAA GCTAGACTAA 180 

CAAGAACTAT TTTTTTCATT TCTTTCTTCT TTCTTTTTGA AATTAAAATA GAATAAGACT 240 

GGGAAGTGCT CCCAGCCTTG ATGTTTATAG AGCTGCACGC AAACGTGCTT CTGCATTTTC 300 

TACATTACGG ACAGAGCGTG GTAGGAAGGC ACGAATATCG TCTTCCTTGT AGCCAACTTG 360 

CAGGCGTTTT TCATCTACAA GGATTGGGCT CTTTAAAATT CTCGGTGTTT CCATAATCAG 420 

ATTGAGAACT TCATTGACAC TCAAATCTTC AATATCCACT CCAAGGGCTT TGGCATAGCG 480 

ATTTTTAGAC GAAACGATGC TGGCTATTCC GTTATCTGTT TTGGTTAGAA TATCCAGTAA 540 

TTCTTCTCTC GTAATTCCTT CTTTACCAAG GTTTTGTTCT TTATAACTTA ACTGGTGGGC 600 

ATTGAGCCAG GTTTTTGCTT TTTTACAGCT AGTACAACTT GAGACTGTAT AAATTTTAAT 660 

CATGTACCTA CCCCTTTCGC TACATGTTAC TATCAGTTTA GTCTATTATA CCATAAAAAA 720 

CATCCGACTT GCGACCTATT TTTAATTTTT TTTGACTTTT TTCGTCATTT TCGTACTTTT 780 

TTCTTGACAA ACAACTAAAT GACTATCAAC TCTTTTGGAG CTAGGGTCAA TAATTCACAA 840 

CCTGTCTCTG TAATCAGGAT ATCATCCTCG ATACGAACGC CATATTTGCC TTCGATATAG 900 

ATACCTGGTT CATCGGTCAA GGCCATACCT GTCTTAATAG TTTCTGTAGA AGTCTGACTA 960 

AAGTAGGGTT CCTCATGGAT ATCCAGACCA ATACCGTGGC CAATGCCGTG AGTAAAGTAG 1020 

TCACCATAAC CTGCCTCAAT GATAATATCA CGAGGGATTT TGTCAAAGTC ACGGAAACCT 1080 

AAGCCTGCCT TAGCTTGGTC AATCAAGGCT TGGTTAGCTT TTAGAACCGT ATTGTAAATC 1140 
TCTGCCTGCT CATCGCTAAC ATGCCCTAGA TAGATAGTCC GGGTCATATC ACTGACATAG • 1200 

TGGTCATAGA GACAGCCGAA GTCCATGGTG ATGGCTTCTC CCAACTCCAC TGGTTTGTGC 1260 

ATTGGATGGG CATGGGGTTT AGAAGAATTG ATACCGCTAG CTAGGATCGT ATCAAAAGAT 1320 

AAGCCAGATG CTCCCAACTC ACGCATGCGG AAATCAAGGA AGTTGGCAAT CTCAATTTCA 1380 
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GTTTTTCCTG GTTTGATAAA GTCAAGCGCA TCGCGGAAAG CTTGGTCTGA GATAGAACAA 1440 

GCCTTGCGAA TCGCTGCAAT CTCTGCCTCA TCCTTAATCA TACGAAGACC TTCCACAAAC 1500 

TGAGTTTGTG GAAGCAAGTT CAAACCTGCA AAAGCTGCCT GCATACGGTG GTAATAAGAC 1560 

ACTGAAATCT CATC TTC AAA ACCGATACGA GTCAAGCCCA TGTCCTTAAC AATTCCTGCA 1620 

ATGACAGCCA ATTCATCACG ATCAGCCACA ATCTCAAAAC CACTGGTTTC TTGCTTAGCT 1680 

GCGATGATAT AGCGAGAGTC TGTCACTAAG ACCTGACGGT CACGACTGAT AAAGACTGTT 1740 

CCGTTTGAGC CCCAAAAACC AGTCAAATAA TAGACGTTTT TAAGATTGTT GATGATGATA 1800 

CCATCTAGTT CTTTTTCTTG CATTTTAGCT AGAAATGCTT GTACGCGTTT ATTCATGATG 1860 

TAACTTTCCT TTCAAATAGT GTCCTGTATA GCTGGCTTCG TTGGCAGCTA CTTCTTCTGG 1920 

AGTTCCTGTT ACGATGATGG TTCCACCACC GACACCGCCC TCAGGTCCCA AGTCAATGAT 1980 

ATGGTCTGCC GTCTTGATAA CATCCAGATT GTGCTCGATG ACGAGGACTG TATTGCCATC 2040 

GTCTACAAAG CGAGCTAAAA CCTTGAGCAG GCGAGCAATG TCCTCTGTAT GAAGCCCTGT 2100 

CGTCGGCTCA TCCAGAATGT AGAAAGATTT TCCTGTCGAT CGTTTGTGGA GTTCGCTAGC 2160 

TAACTTCATA CGTTGGGCTT CTCCCCCAGA AAGGGTGGTA GCTGGCTGTC CCAAGGTCAC 2220 

ATAGCCTAGC CCTACATCCT TGATGGTCTG GAGTTTGCGT TGAATTTTCG GAATGTGTTG 2280 

GAAAAATTCT ACCGCATCGT TGACCGTCAT ATCCAAGACC TGCGAAATAT TCTTTTCCTT 2340 

GTAGTGAACT TCTAGGGTTT CACTGTTATA GCGGGTTCCG TGGCAAACTT CACAAGCCAC 2400 

ATAAACATCT GGCAAGAAGT GCATCTCAAT CTTGATAATC CCGTCACCTG AGCAAGCTTC 2460 

ACAGCGACCT CCCTTGACGT TGAAACTGAA GCGCCCCTTC TTGTAGCCTC GAATCTTGGC 2520 

TTCATTTGTC TGAGCAAAAA GGTCACGTAT ATCGTCAAAA ACTCCTGTAT AGGTAGCTGG 2580 

GTTAGACCTC GGCGTCCGTC CGATAGGGCT CTGGTCAATA TCAATCAAAC GGTCGACATG 2640 

CTCAATCCCT GTAATAGTCT TAAACTTACC AGGTTTGTCT GAATTACGGT TGAGCTTCTG 2700 

GGCAATGGCT TTTTTGAGAA TGCTGTTGAT TAGAGTCGAT TTCCCTGAAC CCGACACACC 2760 

TGTCACTGCG ATAAATTTTC CTAGTGGAAA GCGAGCCGTG ACATTTTGCA AGTTGTTCTC 2820 

ACGCGCTCCT ATCACTTCAA TAAAACGACC ATTTCCGACA CGGCGCTCTT CTGGTACTGG 2880 

GATGACACGT TTGCCTGACA AGTACTGACC TGTGATAGAC TTGCTGTTGC GAGCCACTTG 2940 

CTTAGGTGTA CCTGCTGCAA CAATCTCACC ACCAAAAACA CCGGCACCAG GACCAACGTC 3000 

AATCAGATAA TCAGCCTCAC GCATGGTATC TTCGTCGTGT TCCACCACGA TAAGAGTATT 3060 

GCCCAAGTCA CGCATCTTTT TCAGACTGGC AATCAGGCGA TCATTGTCCC TCTGGTGAAG 3120 
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ACGGATTGAC GGCTCGTCTA GGATATAGAG GACACCTGAT AGGTTGGAAC CAATCTGGGT 3180 

TGCCAAACGA ATGCGCTGAC TTTCCCCACC TGAAAGGGTT CCTGCTGAAC GTGACAGGGT 3240 

TAGATAGTTA AGACCCACAT TATTAAGGAA GGTCAAACGA TCCTTGATTT CCTTGAGAAT 3300 

GGGACGAGCA ATGATGGCTT CATTTTCAGA CAAAGTTAAC TGGCTCACCA AGTCCAAGTG 3360 

GTCAGCGATA GACAGGTCTG AGATTTCTCC AATATGTGGC CCTTGCTGGC CGCCCACACG 3420 

GACAGACAAG GCCTGGTCAT TGAGACGATA GCCTTGACAG GTTCCGCAGG TCAGCTCATT 3480 

CATGTAGAGA CGCATCTGAG TGCGAGTGTA ATCGCTATTG GTTTCATGGT AACGACGTTT 3540 

GATATTATTG ATAACTCCCT CAAACGGAAT GTCGATATCG CGCACGCCAC CAAATTCATT 3600 

CTCATAGTGG AAATGGAATT CCTTACCATC TGACCCATAG AGAATCAAGT TCTTATCTTC 3660 

TTCTGACAGG TCCTCAAAAG GCTTATCCAT AGCCACTCCA AAGACTTTCA TGGCCTGCTC 3720 

TAACATGTTT GGATAGTAGT TGGATGAGAT AGGATTCCAA GGTGCTAGCG CTCCCTCACG 3780 

TAAGGTTTTG CTAGCATCTG GCACTACCAA ATCAGTATCC ACCTCCAGCT TGATGCCCAA 3840 

GCCGTCACAC TCACTACAAG AGCCAAAAGG AGCATTGAAA GAAAAGAGAC GAGGCTCTAA 3900 

CTCTGGGACA GTAAAACCAC AAACTGGACA GGCATAATGC TCAGAGAACA ACAACTCCGA 3960 

GTCGTCCATG GTGTCGATAA TGACATAACC TTCTGCAATA CGAAGGGCAG CCTCAATGGA 4020 

ATCAAAGAGA CGACTACGAA TGCCCTCCTT GATAACAATA CGGTCAACCA CGACATCGAT 4080 

ATTGTGTTGC TTGCTCTTAG ACAACTCTGG CACTTCGGTC ACATCATAGA CTTCCCCATC 4140 

CACACGGACA CGAACATACC CGTCTTTCTG AACCTTCTCG ATAACACTCT TATGTTGGCC 4200 

TTTTTTCTTG CGGATGACAG GAGCCAAGAT CTGCAAGCGC TGGCGTTCAG GTAACTCCAA 4260 

AACCTTATCA ACGATTTGCT CCACAGAAGA AGCATTGATA GCTCCATGTC CGTTGATACA 4320 

GTAAGGCGTC CCCACACGTG CGTAGAGGAG ACGCAGATAG TCATTGATTT CAGTCGTCGT 4380 

TCCCACCGTC GAGCGAGGAT TTTTACTAGT CGTTTTCTGG TCGATGGAAA TAGCTGGGCT 4440 

GAG AC CATC A ATGGCATCTA CATCTGGTTT TTCCATATTT CCCAAGAACT GACGAGCGTA 4500 

GGCGGACAAA CTCTCTACAT AGCGACGTTG TCCCTCCGCA TAGAGAGTAT CAAAAGCCAG 4560 

ACTGGACTTC CCTGAACCTG ACAAGCCAGT CACGACAACC AACTTGTCTC GCGGAATCTC 4620 

CACATCAATA TTTTTTAAAT TATGGGCACG CGCCCCATGA ATGACAATTT TATCTTGCAT 4680 

CTTTGTTCTT TCTAGTCCAT TATTGCTTAC CATTATACCA AAAAAAGTGA GATTCTATTA 4740 

CCCAAAAGGC CGATTTTGTA GTATAATAGT ACAGTGTGAA AAAATCTGAA AAATGAGAAA 4800 

GGATAAGGGA TATGAAACAA GTTTTTCTCT CTACAACAAC TGAATTTAAA GAGATCGATA 4860 

CGCTTGAACC GGGTACTTGG ATCAATCTCG TCAATCCGAC TCAAAATGAA TCACTCGAAA 4920 
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TCGCCAACAC CTTCGATATT GATATTGCTG ACCTTCGAGC ACCGCTCGAT GCGGAAGAAA 4980 

TGTCTCGTAT TACCATTGAA GACGAGTATA CCCTGATTAT CGTAGACGTG CCGGTCACGG 5040 

AGGAAAGAAA TAACCGCACC TACTACGTAA CCATCCCGCT TGGTATTATC ATCACTGAGG 5100 

AAACCATTAT CACTACGTGT TTGGAACCAC TACCTGTCCT TGATGTCTTT ATCAACCGTC 5160 

GATTGCGTAA TTTCTATACC TTCATGCGTT CACGTTTTAT CTTTCAAATT CTTTATCGCA 5220 

ATGCAGAGCT TTACCTAACA GCCCTTCGTT CAATCGACCG CAAGAGTGAA CAAATCGAAA 5280 

GTCAACTGCA TCAATCAACT CGTAATGAAG AATTGATTGA GCTCATGGAA TTGGAAAAAA 5340 

CTATCGTCTA TTTCAAGGCC TCCCTCAAAA CAAATGAGCG CGTGATTAAG AAATTGACCA 5400 

GTTCAACCAG CAATATCAAG AAATACCTTG AGGACGAAGA CCTGCTTGAA GACACCCTGA 5460 

TTGAAACCCA ACAGGCCATC GAGATGGCAG ATATTTATGG AAACGTCTTG CATTCTATGA 5520 

CAGAGACCTT TGCCTCTATC ATTTCTAACA ACCAGAACAA CATCATGAAA ACCTTGGCCC 5580 

TTGTGACCAT CGTCATGTCC ATCCCAACCA TGGTCTTTTC TGCCTACGGG ATGAACTTTA 5640 

AGGATAATGA AATCCCCCTA AACGGAGAGC CAAATGCCTT CTGGTTAATC GTCTTTATCG 5700 

CCTTTGCTAT GAGTGTCTCG CTCACTCTCT ATCTCATCCA TAAAAAATGG TTCTAAGAGG 5760 

AGTTCCTATG TCTCAAATTG ATCTACAAAA ATtAACTAAG AAAAACCAAG AGTTTGTCCA 5820 

CATTGCTACC CAACAATTCA TCAAAGATGG GAAAACAGAC GCTGAAATCC AGACTATTTT 5880 

TGAGGAAGTC ATTCCCCAAA TCCTTGAGGA GCAATCTAAA GGTACAACTG CCCGTTCCCT 5940 

ATACGGCGCA CCAACTCATT GGGCTCATAG CTTCACTGTC AAAGAGCAGT ACGAAAAAGA 6000 

GCATCCAAAA GAAAATGATG ACCCAAAACT GATGATTATG GACTCAGCTC TTTTCATCAC 6060 

TAGCCTCTTT GCCCTTGTCA GCGCCCTCAC AACCTTCTTT GCGGCAGACC AAGCTTTCGG 6120 

CTATGGATTG ATTACTCTTC TATTAGTTGG ACTGGTTGGT GGATTTGCCT TCTACTTGAT 6180 

GTACTACTTT GTTTACCAAT ACTATGGACC AGATATGGAT CGCAGTCAAC GTCCACCTTT 6240 

CTGGAAATCT GTACTAGTTA TCCTAGCTTC TATGTTCCTT TGGTTGCTTG TCTTCTTTGC 6300 

AACAAGCTTC CTACCAGCTA GCCTTAACCC AGTACTGGAT CCATTGCCAC TAGCTATTAT 6360 

TGGAGCAGCC CTCCTAGCCC TTCGCTTCTA TCTCAAGAAA CGCTTGAATA TCCGTAGTGC 6420 

AAGTGCAGGA CCAACACGCT ATCAAGAATA AGAAAACGAT AAAAGCAACT GCAGGTGCGG 6480 

TTGCTTTTTC ACTTACTTTT TTGAGTTATA TTCAATGAAA ATCAAAGAGC AAACTAGGAA 6540 

GCTAGCTGCA GGTTGCTCAA AGCACAGCTT TGAGGTTGCA GATAAAACTG ACGTGGTTTG 6600 

AAGAGATTTT CGAAGAGTAT TAAAAGTATT CTTCTGAAAT CCCACATAGC TTTCTCTTAT 6660 
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ATTTTGTGAT AAAATAGGCT CAATCTATTT CTAGGAGGAT GAGATATGGT TTCTACTATT 6720 

GGTATTGTTA GTTTATCTAG TGGCATTATC GGAGAGGATT TTGTCAAACA CGAAGTGGAC 6780 

TTGGGTATCC AACGTCTCAA GGATCTGGGA CTCAATCCCA TCTTTTT 6827 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11864 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 61: 

CTGGCTAGTT GCATAGAGCA AAGTTGCTTC TTCATCAACA AAACCGTTCA TTTCAAAATA 60 

GGAAAGCAGC TCATCAGGAC TCTCCAAACG AATCCCTTTG TAATCCAGCT CAACTGCCAC 120 

CTCTTTCAAG GCTGCAAGAA GAAGTGTTCC CAGGCCCTGT CTCTGATGGT CAAACTCGAT 180 

GACTAAAGAA TGTACTTTTA GACATTGCGG ATTGTCTGAC TGGGGACTTG ATAAAATATA 240 

GCCTAAAAGT TGATTTTCAT CCCTAGCTAG AAGAAAGGTA TCCGCACACT TACGGATACT 300 

TTCTTCTAAA ATATGGGAAA GTTGCTGCTT TTCAGCTGGA AAAGACGAGG TCTGAAGTGC 360 

CCCTATCTCA GGCAAATCAG ACTTGCTTGC CTGAATGATC TTAATTGGAA TTTCCATGGG 420 

AACATCCTAT TGAACATTGC TTGTCAAGTT AGACAAGAGA CGCTCAAATG AGTATTCATA 480 

GGTTTGGATG TCTCCTGCTC CCATAAAGAC GTAAACAGCA TTGTCATGGT CTAGGAGTGG 540 

AGAAACATTT TCAACAGTAA TCACTTGGTG TTTTTTGTTG ATTTTGTTGG CTAGGTCTTC 600 

TACCTTAACG TCACCATGAT CTACTTCACG AGCCGAGCCA TAAATTTGCG CTAGATAAAC 660 

AGCATCTGCT TGGTTTAAAG CATGGGCAAA GTCGTCCAAC AAGGCAATGG TTCTTGTAAA 720 

GGTATGCGGT TGAAAGACTG CTACAATTTC CTTGCTTGGG TATTTCTGAC GAGCCGCATC 780 

CAAGGTCGCA ATAATTTCTG TTGGATGGTG GGCAAAGTCA TCGATAATCA CTGTATCATT 840 

GACAATTTTC TCAGTGAAAC GACGTTTAAC ACCGGCAAAT GTTTTCAAGT GCTCACGCAC 900 

CAAGTTCAAA TCAAATCCTG CTGTGTAAAG AAGACCAATA ACGGCTGTCG CATTCATGAT 960 

ATTGTGACGA CCAAAGGTTG GAATGTGGAA TTGCCCCAAG TTTTGTCCAC GGAAATGAAC 1020 

GGTGAAGGTT GAACCAGTTA TTGAACGAAG AAGATCACTA GCTACAAAGT CATTGCCTTC 1080 

AGCTTCAAAA CCATAATAAT AAATTGGTGC ATCAGACGTA ATCTTACGCA ATTCAGCATC 1140 

TTCACCATAG ACAAAAAGAC CCTTGGTGAT TTGTTTGGCA TAGTCGTTAA AGGCATTAAA 1200 

AACATCCTCG AGACTTGTGA AATAATCTGG ATGGTCAAAG TCAATGTTGG TGATAATAGA 1260 
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GTATTCTGGG TGGTAAGGCA TGAAGTGACG CTCATATTCG TCAGATTCAA AGACAAAATA 1320 

TTTGGCATTG GCCGAACCAC GACCTGTCCC ATCTCCAATC AAGAAGCTGG TATCTGTAAT 1380 

GTGAGACAAG ACATGAGACA ACATACCTGT CGTTGAAGTT TTTCCATGTG CTCCTGCTAC 1440 

TCCCATGCTA ACAAAGTCAC GCATAAAGCT ACCTAGAAAC TCATGGTAAC GTTTGTAGCT 1500 

GATACCATTT TGGTCCGCAT AGGCAATTTC GACGTTGTTA TCTGGACGAA AGGCATTTCC 1560 

AGCGATAATT TCCATATCAC CGTCTAGATT TTTTTCATCA AAAGGAAGAA TGGTAATTCC 1620 

TGCCTGCTCA AGACCGCGTT GGGTAAAGTA GTACTTTTCA ACATCTGATC CCTGAACCTT 1680 

GTGCCCCATC TGGTGCAACA TCAAGGCCAA GGCACTCATC CCTGATCCCT TAATTCCGAT 1740 

AAAATGATAT GTCTTTGACA TGTTTTCTCC CCTATTCTGT CATTCTGGTC AGATTCAACT 1800 

CTTGGGCAAC CCGACGTTCT TGTTCTGTTT GTTTACTTTT TTTATTGTAG ATTTGGCTCT 1860 

TCTTTAGAAA ATCATAATTG TTTTTCTTTG GAGCAGGTGC TGACACTTCT TCATTCTTGG 1920 

TAGGGATAGA ATGAACTTCT TCCGCCAAGA TATAATGAGA CTGGGTCAAT TTTTGGCTAT 1980 

ATTTGACAAA TTCACCAGGA TTTTCCTTTT GGAAAGGAGC TGTCGGTTGA TTGCCCTGTC 2040 

TAACTAGACT GGGCTGAGAA TGACGTCTCG CAAGGCTGAA ATCCTGAGTT AGGTAGTTAG 2100 

CAGAGCGTTT CTTTTTCAAG TCCGCACGCG CTTCTTCACG CGCCACCTCC GCATAGCTCT 2160 

TTCCTTCTTT TTTAACCCCT AAAGGAGCCT TTTTAGGTTT TTCGACTTGC TTTTCAATCG 2220 

GTTTTACTGG TTTTTCTTCA GCAATAGGAG CCCATTCTAA ATAATTTTTA TCTCGATACT 2280 

CACCCTTGAT ATTACTGATC AGATCAGACT CATCATAGAG ATTCATGACT GGCATTTCAG 2340 

TCAACATGAC CTCGTCATCT GACACCAATG GAAATCGTTC TTGTTTCATT TTCTATTTCC 2400 

TTTCAACACT TCATTATAGC GTATTGTCTT GATTTTTCAA GTGCTGGCTT CAGAAATTCC 2460 

CAAAATTTCT CTAATTTCTG CTAGGGTCAG ACTACCACGT GACTCTGTGC CGTCCAATAC 2520 

TTGTGACACC AGATGTTTCT TTTGTTCTTG GAGTTCCTGA ATTTTTTCTT CAATGGTTCC 2580 

CTTGGTCACC AAGCGATAGA CCTCAACCGT TTCTTCCTGA CCCATCCGAT GGGCACGGCC 2640 

AATGGCTTGC GCTTCCACCG CAGGATTCCA CCAAAGGTCA ACCAAGATCA CTGTATCTGC 2700 

ACCTGTCAGG TTCAGACCGA CCCCACCAGC CTTGAGGGAA ATCAGAAAGG CATCTCTTTC 2760 

TCCTTGGTTA AAGGCCTTGG TCATGTCTTG TCTTTCCTTG GCTGGGGTTG AACCCGTAAT 2820 

TTTAAAGGAA GTCAGGCCCA AGTCTGGCAG TTCTTGTTCA ATTTTTTCCA ACATTCCCTT 2880 

GAACTGAGAG AAAATCAAGA CACGGTGTCC GCCGTCTGCC ACCTGTACCA GTAGGTCTCG 2940 

GAGACTATCT AGTTTGCCGC TGGCTCCCTG ATAATCTTCC ATAAACAGGG CAGGAGTGTC 3000 
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ACATATTTGA CGCAAGCGCA TCAAACCAGA TAAAATTTCC ACACGACTTC GCTGAAATTC 3060 

CTGTTCTGAC ACTTGAGCCA GATGGTCTCG CATCTGTTGT AACTGGGCAA GGTAAATAGC 3120 

CTTTTGCTGG TCTTCCAGTT CATTTTTATA AACCACCTCA ATCAAGTCTG GCAATTCAGT 3180 

CAGAACTTCT TCTTTCTTGC GTCGCATCAC GAAAGGCTTG ATAAACTGAG CCACTCGCTC 3240 

TGCTGGCAAT TTCATAAATT CTTTCTTGCT TGGCAAAAGT CCAGGCATGA CGATTTGGAA 3300 

AATAGACCAC AACTCACCCA GATGGTTTTC AATCGGAGTT CCTGACAAGG CAAAGACCGA 3360 

CGGCACCACA AATTGTCTCA AGGTCTGGGC AATCTTGGTC TGGGCATTTT TCATGACCTG 3420 

AGCCTCATCT AAGAAAAGGA AGTCAAAGGC CATCCCTTGA TAAAACTCAC TGTCCTGACG 3480 

GAAGGTGGCA TAGCTAGTCA CATAGATTTG ATGGCTCTCG GCAAGAATCT CCTCACGACT 3540 

TGCTTTCAAA CCATGAACAA CAGTCACATC CAACTGTGGA GCAAATTTCT GAAACTCATC 3600 

TGCCCAGTTG TAAATCAAAC CCGACGGAGC GAGAATCAAA ACCCGACTTT CTTTTGTCAC 3660 

TTGACTAGTC AAAAAAGCAA TGGTCTGAAG GGTTTTCCCA AGTCCCATAT CATCAGCCAA 3720 

AATCCCACCA AAACCATAAT GATGGAGCAT CTGCAACCAG CCAATTCCCT TTTCCTGATA 3780 

ATCTCGCAAG TCAGCCTTGA CCTGAGTTGC TTGCAAAGGA AAGTCCTCTG GATGCGTCAA 3840 

ATCCTGGGCC AGATTCTGGA ATTCTTGTGA AAAAGAAACA CGGTCTGGCC CTTCAAAGAG 3900 

ATGAGCTAAA CTGTAGGCCA AGGATTTCCG AGCCTGCAAG GTCCCATCTT TTAATTCAAA 3 960 

TTGCCCCAGT TCCTGTAGAT TTTGGCGAAT TTTCTTGGTT TCTTCATCGA AAAAGTAAAC 4020 

TTGATTAGAC GAATCAATAT AAAAATCCTG ATTGGCAACC AAGGCCTGCA TGGCTTGGTC 4080 

GATTTCCTCC TGGACAATAT TTTGAAAATC AAACTGGATT TCCAAGAGAC CTCCCTTGGA 4140 

GGCAATCTGC ACCTGAGGAC TCGCTAGGCT ATAAAGCTCT TCTAGTTTAT CTGATAGGTC 4200 

AACATGCCCG AGTTTTTCAA AGACTGGAAT GATATCATGA AAAAAATGAT AGACAGACTC 4260 

CGCTTTTAAG GCCTGACGCC AAGATTGAAA ATCGGCCTCA AAGCCCGCAG CCAAACAGAC 4320 

TTGGAAAATT CTTTCTTCTA AGTCTGCGTC ACTTGAAAAG GGTAATTCTT CTAGCTCTTG 4380 

TCGGCTAGAT ACCTGTCTAT TTCCATAATC AAACTGAATT TCTAAACGAA TCCGATTATC 4440 

TTCTTCCCTG TCAAAGTAAA AAGAGGGCGC AAAAGTTTTG ATTTGTAGAC GTTCTGGAGC 4500 

TGAAACGGTG CCCATCTGGA TAAAAAGAGT CAGACAGGAG GCCAATTTGT CTCGATCACT 4560 

GCTATCAAAT TGCAGGTATT TCTTTCCTTG TTGACCCACA GGTAACGCTT TAATTTCCTT 4620 

GAGAAGACGC ATCTGCTGGT CTGTTAAAAA ATAAACCTGA CCTTTATGGA AAAGTACTGC 4680 

TCCCTGATAA AAGACATTGA CCCTAGGACT CTCACTGATT TCCATTTCAA AATAATCGGA 4740 

GTATTCTGTT ACTGTAAAGG CAAATAGATT GGCATCAGCA TGCATATCCT GAAAAAGCAG 4800 
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GGTTTGGTAG CTATCCACTT GATGGTCAAA TTGAAAATGG GGCAAGGCCA TCAGTAAATT 4860 

CACACCCTGC TCAAAAAAGG TCAGAGGGAA AAAGAGGTGC CGACCTTGGT TTTGGAAAAA 4920 

GAGGTCTGGA ACCAGCCCTT CCTCCGTTAG TCCGTGCAAG AAAGTCAAAA GTTCTTGGCT 4980 

GGCATCATCA AAGGCTTCCC AAGAAAGAGA CTCCTCATAA ATCTTGCCAA TCATATACGA 5040 

CTTTCTCTGC TCGACAATCC TTAAAAAAAG TGGAATATCC CGAATGACAT AGTATTTTTG 5100 

GCTATTGATT TGGCCGATTC TCAGAGTCCA CAAGATATGA TTGGTTCCTG CTTCCACCTG 5160 

ACCCACAGCT GATAACTCAT AGGCGCATTC TGATTTTGGA GATAAAATTC GATCCAAAAA 5220 

CTTGCCACCC AAGGTCACCT TGGTTTCAAC AGCCTCTTTT TCTTCATGAC CTTCTTCCAG 5280 

ACTCCACAAG ATTTCCTGAC CACGCTCATC ATTTTTCAGA AAATGCTCTA GCGCTGCCAA 5340 

ATGCACACAG TAGCCCCTCT TTTGAAAAAA ATCACAGGCA CAAAAAACCA AATCATCCTC 5400 

TAAACTATAG CGCAGTTCTT CTTCTGCAAC GCGAGCGTAG AGCCGATTGT TCTTTTCCTT 5460 

GATGATATCA ACCTTACCAG TTTCATAAAG GGCAACACCT TCGATACGAA TTTTCCCCGG 5520 

AATCAATTTA GCCATATTTT CACCTTTACC TTATCTTTTT ATTATACCAT ATTTTCGCCT 5580 

ATGAAAATAG CCTTCTAGGA AGACTTTTCT CCTAGAAGGC TGGATTTTTA ACGTTTGGCA 5640 

AAAGTAGCCA CAATCCGCTG ACAGACTTCT TGCAACAGAG ATTTGGGCAT AGCTATATTG 5700 

ATGCGGGCAT GGAGACTTCC TTCCTCTCCA AAATCCAAAC CACGGTTGAG GATAACCTTG 5760 

GCTTCATTTC TCAACAACTC TTGCAATGTT TCATCAGTCA GGTCATAAGC TGAAAAGTCA 5820 

AGCCAAATCA AGTAGGTACC TTGCGGTTTC ATGACCTTGA TTTTAGTCTC TTTTCCAAAT 5880 

AGATCCATCA CATAATTGAT GTGGTCTTCA AAGACTTGCT TGAGTTCCTC TAGCCAATCT 5940 

TTACCGTATC GATAGGCAGC TTCTGTCGCC AAATAACCCA AGCCTGAAAT TTCATGCTGA 6000 

TTATTGGCCA ACAGGCGTTT CTGGAAAGCC AGTCTCAACT TAGGATTTTC AATGACTGCA 6060 

TAGGAATTTT TTGTTCCAGC AATATTAAAT GTTTTAGTGG CACTGCTCAA GACGATAGCA 6120 

AAATTTTTGA AGGCAGGATT GATGGTATTG AAAGACTGGT GTTTGTGACC AAAGAGGGTC 6180 

AAATCTTGGT GAATCTCATC CGAAACTAAC AAAACACCGT GTTTTTGGCA GAGTTGGCCA 6240 

ATCTTCTCCA ACACTTCTTT TTCCCAAACA CGTCCACCAG GATTGTGAGG GTTGCAAAGA 6300 

ACATAGAGTT TAACCTCCTC TTCCACCAAA TCCTTTTCAA GTTGGTCAAA GTCAATCTCA 6360 

AACAGACTAT CCTTTTCCAC TAAGGAATTA GTAATCAATC TACGATTATT CAACTTGACA 6420 

CTGCGAGCAA AGGGTGGGTA GACAGGCGTG TTAATTAAAA CCGCCTCGCC TTCTTTTGTA 6480 

AAGGTTTGAA TAGCTGTTGA GATGGCTGGT ACCACACCCT CGATAAAGAC AAGAGCCTCT 6540 
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TTGTCAAAGT 


TGTAACCGTA 


TTGTGTAGCT 


TCCCACTTTT 


GAACTTCCTT 


AATTAAGTCT 


6600 


TCACTGGCAT 


AGGTATAACC 


ATAAACCAGT 


TGGTCTGCGT 


AAGTTTGCAC 


GGCTTGGCGG 


6660 


ATTTCAGGCA 


AGACCACAAA 


GTCCATATCC 


GCTATCCAAG 


CTGGTAGAAC 


TTCACTATCC 


6720 


GTTTCTGTTT 


CTTTCCATTT 


ATAGGTATGG 


TGCCCTAAAC 


GGTTGGGCAG 


GCTTGTAAAA 


6780 


TCATATTTTC 


CCATCTTTGT 


CTTATCCTTC 


TATGGCTTGG 


CGCAAATCTG 


CAATCAAATC 


6840 


TCTAGCATCC 


TCAATCCCAA 


TAGACAAACG 


CAAGAGGTCA 


TCTGTCAAAC 


CATAAGAATG 


6900 


GCGTACCTCT 


GCTGGAATAT 


CAGCATGAGT 


TTGAGTCGTT 


GGATAAGTAA 


TAAGACTTTC 


6960 


CACTCCACCC 


AAACTTTCCG 


CAAAAGAGAA 


GACCTTGAGA 


CTGTTCAAAA 


TATGAGGAAT 


7020 


GCGTGTTTCA 


TCGGCTACTT 


TAAAGGAAAT 


CATGCCTCCA 


CGACCAGTGT 


AGAGAACTTC 


7080 


CTTAACTGCT 


GGAGAATCCT 


TCAAAAAGGC 


AACCACTTCT 


TGGGCGTTAG 


CTGTTGAGCG 


7140 


CTCCATACGA 


AGAGACAAGG 


TCTTGAGACC 


ACGAAGCAAC 


TGGTAGCTGT 


CAAATGGAGA 


7200 


CAAGACTGCC 


CCTGTTGTAT 


TAAGATTGTA 


AAAAAGCTTC 


TCGTATAGTT 


CTAAACTATT 


7260 


GGTCACAACC 


ACTCCAGCCA 


AGACATCATT 


GTGGCCTGCT 


AGATACTTGG 


TTGCTGAATG 


7320 


GAGAACGATA 


TCTGCTCCAT 


CTTCAATCGG 


ACGTTGGTAG 


ATAGGGCTAT 


AGAAGGTATT 


7380 


GTCCACCACC 


ACTTTGGCAC 


CCTTAGCATG 


AGCCAATTTT 


GCTAGTTTTT 


CGATATCAAA 


7440 


TTCCAACATC 


AAGGGATTGG 


TTGGGGTTTC 


GATATAGAGA 


ACATCCACAT 


CCTTTTCTAA 


7500 


CTCGGCAATC 


AACTCTTCTT 


CTGTATTGGC 


ATAGGTAAAA 


TGGAAATGAC 


CTTCCTGCTC 


7560 


CACTTGGTTA 


AACCAGCGAA 


AAGAACCACC 


GTAAAGATCA 


CGCACTGCCA 


AGACCTTACT 


7620 


TCCTACTGGA 


AAGACGCTAA 


AGGCCAGTAC 


AATAGCTGAC 


ATCCCTGAGC 


TAGTCGCTAG 


7680 


GGCATAGTCT 


GCTGACTCAA 


TAGCCGCCAA 


GACTTCCTCA 


GCCTTACTAC 


GAGTTGGATT 


7740 


TTTAGTGCGC 


GTATAGTCAA 


ACCCAGTAGA 


TCGACCAAAC 


TCTGGATGCT 


GATAGGTCGT 


7800 


TGAAAAATGA AGTGGTGTCA 


CCAAAGCACC 


TGTTGCCTCA 


TCAGACTTGA 


TCCCTGCTTG 


7860 


TGCTAAAATT 


GTGTTAATGT 


GTAATTCCTT 


GCTCATACAA 


TTCCTCCAAA 


TCTATAGTAA 


7920 


CTATTGTACC 


ACTTATTTTG 


TATCCTTCGT 


TTTCTTGTTT 


TCAAGAGCTA 


GTTATAGTTT 


7980 


CAAACTATAT 


AAAAAGGGAG 


TTTTTCCTGC 


TCCCTTTAAT 


AGACTATAAA 


ATGGTGAATC 


8040 


TCAAAAGACA 


CCTTCACTCT 


ATCATTTGCT 


CCTGCACAAA 


ACGAGCATAA 


CGCTCATGAT 


8100 


TTTCCAGTAG 


TTCCTTATGA 


GTTCCTGAGC 


CAGTGATTTT 


CCCCTCCTCT 


AAGAAGAAAA 


8160 


TACAATCCAC 


ATCTTTTACC 


GTTGACAAAC 


GATGCGCTAT 


AATCACAACC 


GTCTTCTCCT 


8220 


TTAGTACAGA 


ATAGAGGCTA 


CTGATAATCG 


CATACTCAGA 


ATCCGCATCA 


AGATTAGCAG 


8280 


TGGCTTCATC 


AAATATAAGA 


ATTTCAGCAT 


CTTTTAAGTA 


GGCTCTAGCT 


ATTTGAAGTC 


8340 - 
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TTTCGTTCGC 


CCCCCTGACA 


AGAGTCGTCC 


GCGTTCACCA 


ACTTCAGTAT 


CTAGTCCCTC 


8400 


TTTCATGGAG 


CGAATCTCAT 


CACCTAGTGA 


TACTAAGTCT 


AGCACTTTCA 


TCAATTCATC 


8460 


ATCAGTTACT 


AAGCGATTCA 


AACCGAGACA 


AAGATTGTCA 


CGAATACTGC 


CAGATAAGAC 


8520 


TGCATTATTT 


TGTGAAACCC 


AAGCGATTTT 


ACTTCTCCAT 


TCTTTTAAGT 


TAAAATCATA 


8580 


TATACTTGAT 


TGCTCCATTA 


GAATATCTCC 


TGAAAGCGGT 


TTATAAAACC 


GCTCTAACAA 


8640 


ACGCACAATC 


GTTGATTTTC 


CTGATCCAGA 


TGGTCCAACA 


AAAGCAATTT 


TTTGCCCCTT 


8700 


GAAAATTGAA 


CAAGTAATAT 


CCTTTAAGAC 


AGGTCGATTT 


TCATCATAAC 


CAAAATAGAC 


8760 


ATGGTTAAAA 


TTCAACCCTC 


GTCCTGATAC 


CGATTTTCCT 


CCCTCAAATT 


TTTCTTTAGG 


8820 


AACTGCAAGC 


AAGTTCTCCA 


GTGCAACTGA 


AGATCCCTTG 


CTCCTAGAAT 


AAACAGTTAC 


8880 


AAAATTAGCT 


ATATTACTAA 


TAGGATTAAG 


TAATTGAAAG 


AGGTAAATCA 


AAAACGAAAC 


8940 


CAAGGTTCCC 


ACAGATATAT 


ATCCTGCGCT 


GACCCGATAA 


CCCCCATAGG 


TTAGCATCAC 


9000 


AGCTATAGTC 


GCAAAGATAA 


ATAAGAGAGC 


AAACGGGGTC 


TCAAAAGAAG 


TAACCCTATC 


9060 


TGATTTCAGT 


GAATTGTTTT 


GTACCCTTTC 


AATACAATTA 


TCCAAAACAT 


CCTGTACACT 


9120 


TTTCTCTGCT 


TGGTTAGTCT 


TAATTAATTC 


ATGTTCTTGA 


ATCTTTTCAG 


TCAATTGCCC 


9180 


TGTTAAATTT 


CCTCCTGTAA 


ACGACGACTA 


TACTTTTCAC 


TGATATTGGA 


AAGGGGCAAG 


9240 


ATAATAAACA 


TCATACAAGG 


AAGAGTGATG 


AATAAAAGTA 


GAGAAAGATT 


CCAATCAAGA 


9300 


CTAAATAAGA 


CTACAATGGA 


ACCAAGTACC 


ATAACTAAAC 


TCAGAATAAT 


ATTTGGGAAA 


9360 


GTCGTAATTA 


AAAACTCACG 


AATGACACTC 


GTGTCATTGA 


CAATGGCAGA 


AGTCAACTCC 


9420 


CCACTTTGGC 


TCTTATCAAA 


GAAGGATTTC 


TCTACATAAA 


TCAACCCCTC 


TATCACTTTT 


9480 


TTCCTGATTT 


TTGCTATCTT 


TTTTTCACCC 


GATTGACTAA 


ACAGATAGTA 


ACCAATAGAA 


9540 


GAAAACAAGG 


CTTGACCAAT 


AAAAATCAAA 


AACGATTGAA 


ATACTTTGGA 


GCCTATATTT 


9600 


TCAATAGAAC 


TCCCATCTAT 


TAAATCCTTT 


AAGATAAGGG 


GAAGCAACAA 


AGCAAGTAGA 


9660 


CTAGACAGAA 


CAAGTAAGAA 


ACTCCCCATA 


ATCACCTTAG 


TATCTACTCT 


TAATAATTTT 


9720 


AATTTCATAA 


ATACTCCTTA 


TAATATTTCA 


ACGGATAAAG 


TCGGGAATAA 


CTCAATTTGA 


9780 


GGATAAAATC 


TAATAAATCT 


TCCTATAACA 


AAACGCATAA 


CATCTAGGAT 


TTTATATACC 


9840 


TGATATTATG 


CGTTTTTAAG 


CACAAAGACT 


TCTTACACAA 


ACTTATCTAC 


AATTAGATTT 


9900 


TATTTGACAT 


GTTTTGCCAA 


TTCTTCTTGG 


GCTTTTTTAT 


TGGATTCTTC 


TTTTTCTTTC 


9960 


AACCATTTTT 


CTCTGGCTTT 


TGCATATTOG 


TCTGTTGTGA 


CAATCTTATC 


TTGTACTTTG 


10020 


AGGTATTTAT 


ATGATTCAAC 


CCCTTTTGTA 


CCGGTTAAAC 


CATAGGCAGC 


AGCAAATGGT 


10080 
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ACGGTTCTTC 
TCAATCAACC 
TTATTAGCTT 
TTGGCCTCAC 
ATATCGAGAT 
TAATCTTTCT 
AATTGCTGAA 
ATAGAACTAA 
ATTGGGAATT 
GCTTTCTCAG 
TTACCATAGT 
ACAAAGTTTG 
GACTGAGCCC 
TTGAGAACTG 
TAAGACTTCC 
ATGATATTGT 
CGAGCCGTAG 
TCATAGTAGG 
TTTTTCTTAT 
TACAAAATAC 
GCATTAACAG 
ACCAAAGTAT 
CTTGTTTTAC 
GCTTCTGATT 
GGCGCATATT 
TAGGTCAAAC 
TATTGGTCAT 
CGGTTTTCTG 
TTTGACGCCG 
CCTGCCAAGG 



TCAATGATGG TGTTCCCCCA 
AAGCTTGAAT ATCAGCATAT 
CTTCCAACAT TTGAGTATAG 
CAGGCTCTAG TCCAAGATTT 
AGGTTGACGG GTCTTGATAA 
GAGCAGCTGT TTGAGCAAAG 
TGTCAATCAC TACATTATCA 
CTCCTTGTAT GCCTACTTTA 
GAACACCCTT TGCTTCGAGT 
GATTGTAGTA AGGGTCTTGA 
TGACCATCTT AGAGGCTACA 
GAGGAACCAC TAGGTTACGC 
CATAAGATGT TCTGTCAAAA 
CTTCCTGAGT CGATTTCTTT 
TATCTAGGTT AAAATTAAAG 
TTTTGTATTT TTCTTTAATC 
TATAAGCACC AGCTGTAAAA 
TCAATTTCAC ATCGTCTACA 
ATTCAATAGC AGATTTTGAG 
TAGATGGATC CGCCTTCCCA 
GAAAAAGTAT CGTTGCAAGT 
ATTGAACCGT TTGGTCATCA 
CAGTGATATA GTCATCCAAA 
TTTTATCAGC TGCATATTGC 
CTTCTCCCTC AGAAGTAAAC 
CGTCCTGAGA AACAGTCCAA 
TTTCTAATAA CCCGTCTACC 
CTAGATAGTT CAAGCTAGAT 
TGCTAGAATT TCCACACGCG 
TTAGATATTT GCTCTTAGAC 
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CGCGAAACAC TTGGAAGAAC 
TTCTCATAAC GTTTGGCCGG 
ACATCCAGTC CAACTGCCTT 
TGCAGAAATC CTCCACTATT 
TCAGGTCCCC AACCGCCATG 
TAGCCTGAAC TGTCAAACTC 
GAACCTAAAA CAGATTCAAT 
TCTGTTACTT CCACAGTCTT 
TCTTTCTTAG CTTCCGCAAA 
CCATCCGCAA AGTTGATACC 
ACTTCACCAA AGTCTTTTCC 
AAAATCTTTG TTGCACCTTC 
GCAAAATTGA TAGCCTGACG 
TCAATGTCAC TTGTTTTAGA 
AAATATGAAG TTGAATTTTG 
CCTTCATAGC TGGAGCTGTT 
TTACGTTCCA GTGATTCTTG 
AAGACATTCT TAGCATCCCA 
ACAAGTGCTT TCATCAAGAA 
AAATCATCCC CTTTTGATTT 
GTTTTTGAAT TCCAGTAAAG 
AGTGCCTTGA CACCGACAGT 
CCAGCAACAG AGTCCTGCAC 
AAACCTGTCA CAAAATCCTG 
CACTTGGCAT CCTTACGAAG 
TCCTCTGCTA ATGATGGAAT 
AAATTTGCAA CAATATCGGA 
GGATCACTTG AATAAACATA 
CTCAATAAAA CTCCTGTACC 
TTTTTCATTT CCGG 



T AAAGAACT A 10140 

ATCTTGCTCT 10200 

AGCCTTGTCA 10260 

AGTATTAAAA 10320 

ATATAAATCA 10380 

ATCTGATGTT 10440 

TGATTGTTTG 10500 

ATCCAAGTGG 10560 

CTTAGCCTTG 10620 

TTGCCATTCC 10680 

CTTGATACTG 10740 

TTTCCCTTCA 10800 

GAAGTTTTTA 10860 

AGTATAATTG 10920 

CATACTATAG 10980 

AGGAAAAAGA 11040 

GTCGCTACCA 11100 

GTAATTAGGG 11160 

AGGTCCATTG 11220 
CAGGAAATCT * 11280 

TTCTGGTTTA 11340 

TGAAAAGTCG 11400 

TAGATACAAG 11460 

GGCAGTTACA 11520 

TTTGTAGGTA 11580 

AATATTCCCA 11640 

TGTTGCTGCG 11700 

GTTGTAGGTT 11760 

CAGGACAAGA 11820 
11864 
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(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

TAACTGCACT AAACATAATA TAAGGAGAGA AAATGTCTGC AATAGAACGT ATTACAAAAG 60 

CTGCTCACTT AATTGATATG AACGATATTA TCCGTGAAGG GAATCCTACT CTACGCGCGA 120 

TTGCTGAGGA AGTCACTTTC CCCCTATCTG ACCAGGAAAT CATCCTAGGC GAAAAGATGA 180 

TGCAATTCCT TAAACATTCC CAAGATCCTG TCATGGCTGA AAAAATGGGA CTCCGCGGTG 240 

GTGTTGGACT GGCTGCTCCC CAGTTAGATA TCTCAAAACG CATTATCGCT GTTTTGGTAC 300 

CTAATATTGT TGAAGAAGGC GAAACTCCAC AGGAAGCCTA CGATTTGGAA GCCATTATGT 360 

ACAATCCAAA AATCGTCTCT CACTCTGTTC AAGATGCTGC TCTTGGCGAA GGAGAAGGTT 420 

GCCTGTCTGT TGACCGTAAC GTGCCTGGCT ATGTTGTTCG CCATGCOCGC GTTACTGTTG 480 

ACTACTTTGA CAAAGATGGA GAAAAACACC GTATCAAACT CAAAGGCTAC AACTCCATTG 540 

TTGTTCAGCA TGAAATTGAC CACATTAACG GTATCATGTT TTACGATCGC ATCAATGAAA 600 

AAGACCCATT TGCAGTTAAA GATGGTTTAC TGATTCTTGA ATAAAGAAAA TCCCGTTGCA 660 

AGACGGGGTT TTGTGTTATA ATAGAGGCAT GAAAACAAAT GATATTGTCT ATGGTGTCCA 720 

CGCCGTTACC GAAGCCCTCC TTGCAAATAC AGGAAACAAA CTCTACCTCC AAGAAGATCT 780 

CCGAGGTAAG AATGTTGAGA AAGTCAAGGA ACTAGCTACA GAAAAGAAGG TGTCCATTTC 840 

TTGGACATCA AAAAAATCTC TCTCTGAGAT TACTGAAGGT GCTGTTCATC AAGGTTTTGT 900 

TCTACGAGTG TCTGAATTTG CCTATAGCGA GCTAGATTAC ATCCTTGCAA AAACACGCCA 960 

AGAAGAAAAT CCACTTCTAT TGATTCTAGA TGGTCTAACC GATCCCCATA ATCTGGGTTC 1020 

TATCTTGCGA ACAGCCGATG CGACCAATGT TTCAGGTGTC ATCATTCCCA AGCACCGTAC 1080 

TGTCGGAGTA ACTCCTGTCG TTGCCAAAAC AGCCACAGGT GCTATTGAAC ACGTtCCAAT 1140 

TGCCCGAGTG ACCAACCTCA GTCAAACCTT AGGATAAACT TAAGGATGAA GGTTTCTGGA 1200 

CCTTTGGAAC GGATATGAAC GGTACTCCTT GCCACAAGTG GAATACAAAA GGGAAAATCG 1260 

CCCTCATCAT TGGAAATGAA GGAAAAGGTA TCTCTAGCAA CATCAAAAAA CAGGTCGATG 1320 

AAATGATTAC CATTCCGATG AATGGACATG TTCAAAGCCT TAATGCCAGT GTTGCTGCGG 1380 
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CCATTCTCAT GTACGAAGTT TTCCGAAATA GACTATAAAA AAGTTTCCAG TCATCTGATT 1440 

GGAAACTTTT TTATGATTAA CTATGTTCTG TAATGAATTT ATAGGCTTCT TGACCAGCGA 1500 

TAGCTCCATC TCCAACCGCT GTTGTTACTT GGCGAAGGTC TTTCAAGCGA ACATCTCCAA 1560 

CTGCAAAGAT ACCGTCGACT GCAGTTTTCA TGTGGTTATC TGTCACAATC CATCCTGCCT 1620 

GATCTTGGAT ATTCAATTCT TTAACAAAAT CGCTAAGAGG GTCCAAACCA ACATAGATAA 1680 

AGACACCACC GAAGGCTTGT TCTGTCACTT GACCTGTTTT CACATTTTCA AATACGACTG 1740 

ATTCTACTCG GTTTTCACCC TTGATTTCCC TTACTACAGA ATCCCAGATA AAGCTGATTT 1800 

TTTCATTCGC AAAGGCGCGA TCTTGTAAAA CCTTTTGGGC ACGAAGTTGG TCACGACGGT 1860 

GAACAATGGT AACAGTCTTA GCAAAACGAG TCAAGAAGAG GGCTTCTTCA ACAGCTGAAT 1920 

CTCCACCACC AACTACCAAT AAATCTTGGT CACGGAAGAA AGCACCATCA CACACAGCAC 1980 

AGTAAGAAAC ACCACGACTG TTCAGTTCTT CTTCTCCAGG CACTCCCAAA GGACGGTGTT 2040 

TAGAACCAGT TGCTACGATA ACTGTACGTG TTTCATATGT TTGGTCATCA GTCATCACTT 2100 

TCTTAAAATC ACCATGGCTT CGACATTTTC AACATAACCA TAAATGTGCT CAACACCAAG 2160 

ATTTTCAAGT GGTTCAAACA TCTTTTCAGC CAATTCAGGT CCACTAATAT TAGCGTATCC 2220 

TGGGTAATTT TCGATATCAG ATGTATTATT CATCTGACCA CCTGGCAGAC CACCTTCAAT 2280 

CAAAGCTACT TTTAGATTGC TTCGAGCAGC ATACAAGGCC GCAGTCATCC cTGCAGGTCC 2340 

AGCACCGATA ATAATAGTAT CGTACATATA GATTCCTTCT TTCTTGGTGT AACTATCTTT 2400 

ATTCTAACTC TG 2412 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7760 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

CCGATTTGGT GGAATTTTTG TCTCATCATT TAGAAGGTGT TGCAAGAGCA GAGTTTACCT 60 

TGGTGCTTCA TACCAAATTG GGAGAAGCCT CTGTTTTGGC AAATATTGTA GATGTAAACA 120 

AGGATGAATG GATTTTAGGA ACAGTTGCTG GTGCCAATAC CTTATTGGTT ATTTGTCGAG 180 

ATCAGCACGT TGCCAAACTC ATGGAAGATC GTTTGCTAGA TTTGATGAAA GATAAGTAAG 240 

GTCTTGGGAG TTGCTCTCAA GACTTATTTT TGAAAAGGAG AGACAGAAAA TGGCGATAGA 300 

AAAGTTATCA CCCGGCATGC AACAGTATGT GGATATTAAA AAGCAATATC CAGATGCTTT 360 
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TTTGCTCTTT CGGATGGGTG ATTTTTATGA ATTATTTTAT GAGGATGCGG TCAATGCTGC 420 

GCAGATTCTG GAAATTTCCT TAACGAGTCG CAACAAGAAT GCCGACAATC CGATCCCTAT 480 

GGCGGGTGTT CCCTATCATT CTGCCCAACA GTATATCGAT GTCTTGATTG AGCAGGGTTA 540 

TAAGGTGGCT ATCGCAGAGC AGATGGAAGA TCCTAAACAA GCAGTTGGGG TTGTTAAACG 600 

AGAGGTTGTT CAGGTCATTA CGCCAGGGAC AGTGGTCGAT AGCAGTAAGC CGGACAGTCA 660 

GAATAATTTT TTGGTTTCCA TAGACCGCGA AGGCAATCAA TTTGGCCTAG CTTATATGGA 720 

TTTGGTGACG GGTGACTTTT ATGTGACAGG TCTTTTGGAT TTCACGCTGG TTTGTGGGGA 780 

AATCCGTAAC CTCAAGGCTC GAGAAGTGGT GTTGGGTTAT GACTTGTCTG AGGAAGAAGA 840 

ACAAATCCTC AGCCGCCAGA TGAATCTGGT ACTCTCTTAT GAAAAAGAAA GCTTTGAAGA 900 

CCTTCATTTA TTGGATTTGC GATTGGCAAC GGTGGAGCAA ACGGCATCTA GTAAGCTGCT 960 

CCAGTATGTT. CATCGGACTC AGATGAGGGA ATTGAACCAC CTCAAACCTG TTATCCGCTA 1020 

CGAAATTAAG GATTTCTTGC AGATGGATTA TGCGACCAAG GCTAGTCTGG ATTTGGTTGA 1080 

GAATGCTCGC TCAGGTAAGA AACAAGGCAG TCTTTTCTGG CTTTTGGATG AAACCAAAAC 1140 

GGCTATGGGG ATGCGTCTCT TGCGTTCTTG GATTCATCGC CCCTTGATTG ATAAGGAACG 1200 

AATCGTCCAA CGTCAAGAAG TAGTGCAGGT CTTTCTCGAC CATTTCTTTG AGCGTAGTGA 1260 

CTTGACAGAC AGTCTCAAGG GTGTTTATGA CATTGAGCGC TTGGCTAGTC GTGTTTCTTT 1320 

TGGCAAAACC AATCCAAAGG ATCTCTTGCA GTTGGCGACT ACCTTGTCTA GTGTGCCACG 1380 

GATTCGTGCG ATTTTAGAAG GGATGGAGCA ACCTACTCTA GCCTATCTCA TCGCACAACT 1440 

GGATGCAATC CCTGAGTTGG AGAGTTTGAT TAGCGCAGCG ATTGCTCCTG AAGCTCCTCA 1500 

TGTGATTACA GATGGGGGAA TTATCCGGAC TGGATTTGAT GAGACTTTAG ACAAGTATCG 1560 

TTGCGTTCTC AGAGAAGGGA CTAGCTGGAT TGCTGAGATT GAGGCTAAGG AGCGAGAAAA 1620 

CTCTGGTATC AGCACGCTCA AGATTGACTA CAATAAAAAG GATGGCTACT ATTTTCATGT 1680 

GACCAATTCG CAACTAGGAA ATGTGCCAGC TCACTTTTTC CGCAAGGCGA CGCTGAAAAA 1740 

CTCAGAACGC TTTGGAACCG AAGAATT AG C CCGTATCGAG GGAGATATGC TTGAGGCGCG 1800 

TGAGAAGTCA GCCAACCTCG AATACGAAAT ATTTATGCGC ATTCGTGAAG AGGTCGGCAA 1860 

GTACATCCAG CGTTTACAAG CTCTAGCCCA AGGAATTGCG ACGGTTGATG TCTTACAGAG 1920 

TCTGGCGGTT GTGGCTGAAA CCCAGCATTT GATTCGACCT GAGTTTGGTG ACGATTCACA 1980 

AATTGATATC CGGAAAGGGC GCCATGCTGT CGTTGAAAAG GTTATGGGGG CTCAGACCTA 2040 

TATTCCAAAT ACGATTCAGA TGGCAGAAGA TACCAGTATT CAACTGGTTA CAGGGCCAAA 2100 
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CATGAGTGGG AAGTCTACCT ATATGCGTCA GTTAGCCATG ACGGCGGTTA TGGCCCAGCT 2160 

GGGTTCCTAT GTTCCTGCTG AAAGCGCCCA TTTACCGATT TTTGATGCGA TTTTTACCCG 2220 

TATCGGAGCA GCAGATGACT TGGTTTCGGG TCAGTCAACC TTTATGGTGG AGATGATGGA 2280 

GGCCAATAAT GCCATTTCGC ATGCGACCAA GAACTCTCTC ATTCTCTTTG ATGAATTGGG 2340 

ACGTGGAACT GCAACTTATG ACGGGATGGC TCTTGCTCAG TCCATCATCG. AATATATCCA 2400 

TGAGCACATC GGAGCTAAGA CCCTCTTTGC GACCCACTAC CATGAGTTGA CTAGTCTGGA 2460 

GTCTAGTTTA CAACACTTGG TCAATGTCCA CGTGGCAACT TTGGAGCAGG ATGGGCAGGT 2520 

CACCTTCCTT CACAAGATTG AACCGGGACC AGCTGATAAA TCtACGGTAT CCATGTTGCC 2580 

AAGATTGCTG GCTTGCCAGC AGACCTTTTA GCAAGGGCGG ATAAGATTTT GACTCAGCTA 2640 

GAGAATCAAG GAACAGAGAG TCCTCCTCCC ATGAGACAAA CTAGTGCTGT CACTGAACAG 2700 

ATTTCACTCT TTGATAGGGC AGAAGAGCAT CCTATCCTAG CAGAATTAGC TAAACTGGAT 2760 

GTGTATAATA TGACACCTAT GCAGGTTATG AATGTCTTAG TAGAGTTAAA ACAGAAACTA 2820 

TAAAACCAAG ACTCACTAGT TAATCTAGCT GTATCAAGGA GACTTCTTTG ACAATTCTCC 2880 

ACTTTTTTGC TAGAATAACA TCACACAAAC AGAATGAAAA GGAGCTGACG CATTGTCGCT 2940 

CCCTTTTGTC TATTTTTTAA GGAGAAAGTA TGCTGATTCA GAAAATAAAA ACCTACAAGT 3000 

GGCAGGCCCT GGCTTCGCTC CTGATGACAG GCTTGATGGT TGCTAGTTCA CTTCTGCAAC 3060 

CGCGTTATCT GCAGGAAGTC TTAGGCGCCC TCCTTACTGG GAAATATGAA GCTATTTATA 3120 

GTATCGGGGC TTGGTTGATT -GGTGTGGCCG TAGTCGGTCT AGTTGCTGGT GGACTCAATG 3180 

TTGTCCTCGC AGCCTATATT GCCCAAGGAG TTTCATCCGA CCTTCGGGAG GATGCCTTCC 3240 

GTAAAATTCA AACCTTTTCT TATGCTGATA TTGAACAATT TAATGCGGGA AATCTAGTCG 3300 

TTCGAATGAC AAATGATATC AACCAGATTC AGAACGTTGT CATGATGACC TTCCAAATTC 3360 

TTTTCAGACT TCCCCTCTTG TTCATCGGTT CGTTTATCCT AGCGGTTCAA ACCTTACCTT 3420 

CTCTGTGGTG GGTGATTGTT CTCATGGTAG TCTTGATTTT TGGTTTGACT GCTGTCATGA 3480 

TGGGAATGAT GGGGCCTCGT TTTGCCAAGT TTCAAACCCT TCTTGAGCGC ATCAATGCCA 3540 

TTGCCAAGGA AAATTTACGT GGCGTTCGTG TGGTCAAGTC CTTTGTCCAA GAAAAAGAGC 3600 

AATTTGCTAA GTTTACAGAG GTCTCAGACG AGCTTCTTGG TCAAAACCTT TACATTGGTT 3660 

ATGCCTTTTC AGTAGTGGAA CCCTTTATGA TGTTGGTTGG TTACGGGGCG GTCTTCCTCT 3720 

CTATTTGGCT GGTCGCGGGA ATGGTTCAGT CGGATCCGTC TGTTGTTGGT TCCATCGCTT 3780 

CTTTTGTTAA TTACCTAAGC CAGATTATCT TTACCATTGT TATGGTTGGA TTTTTGGGAA 3840 

ATTCTGTCAG CCGTGCCATG ATTTCCATGC GTCGTATTCG AGAAATTCTT GACGCAGAGC 3900 
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CAGCTATGAC CTTCAAGGAT ATCCCAGATG AAGAGTTGGT TGGAAGTCTT AGCTTTGAAA 3960 

ATGTGACCTT TACCTATCCA ATGGACAAGG AACCGATGCT GAAAGATGTG AGCTTTACTA 4020 

TTGAACCTGG TCAAATGGTT GGTGTAGTTG GAGCGACTGG TGCAGGAAAG TCAACCTTGG 4080 

CTCAATTGAT TCCACGTCTC TTTGATCCAC AGGACGGGGC CATTAAAATC GGTGGCAAGG 4140 

ATATTCGAGA AGTGAGTGAA GGAACCCTGC GTAAAACAGT TTCCATCGTT CTCCAACGTG 4200 

CCATTCTTTT TAGTGGAACG ATTGCAGATA ACTTGAGACA GGGGAAGGGG AATGCTACTC 4260 

TATTTGAAAT GGAGCGCGCA GCCAATATTG CCCAGGCTAG TGAATTCATT CATCGTATGG 4320 

AGAAAACCTT TGAAAGTCCA GTTGAAGAAC GGGGAACCAA TTTCTCTGGT GGACAAAAAC 43 80 

AAAGGATGTC GATTGCGCGT GGGATTGTCA GCAATCCACG TATTCTGATT TTTGATGATT 4440 

CGACCTCAGC CTTGGATGCC AAATCAGAGC GCTTGGTGCA AGAAGCTTTG AATAAGGACT 4500 

TGAAGGGGAC GACAACCATT ATTATTGCTC AAAAAATTAG CTCGGTTGTC CATGCAGACA 4560 

AGATCTTGGT TCTAAATCAA GGACGATTGA TTGGTCAAGG TACGCATGCA GACTTGGTTG 4620 

CCAACAATGC CGTTTACCGT GAAATCTATG AAACACAGAA ATGAAAGACA AACTATAAGA 4680 

AAAGTCAATA GTTTTATCTA AACTATTTCT TATTTCAATT TGATGATTTG GCGATGATTT 4740 

TAGAGCACGG CAAAAAGCCC TTGAAAAAGT CCATTTTTTC AAAGGTAATC CTGTGTTAAT 4800 

TTCAGAAATT ACATCACTTT TTGTTCGTCA AATGGCAGCT CTTTTTTTAG GATATAAAAC 4860 

AGGGTTCGGA TAAGTTTTTT TGCAAGGTGG ATGATGGCTA CATTGTAATG TTTTCCTTGT 4920 

TCTAATTTAG TCTTAAGATA GGCCTTAAAA GCAGGCGAAA AGCGAGGGCA TGCTTTGGCA 4980 

GCTTGTATGA GTACCTACCG CAGATGAGGG GAACTCCGTT TGACCATTCT TCCTGCTAAA 5040 

TCAATCTGAT CTGACTGATA AATAGAAGAA TCCAGTCCAG CGAAAGCTTG TAATTGAGCA 5100 

GGATTATCAA AGGCATGAAT ATTTCGAATC TCAGCTAAAA TGACCGCCCC TAAACGATCC 5160 

CCAATCCCAG TAACCGTCGT GATGACCGAG TTGAACTCAG CCATCAAGTC ATTGACACAT 5220 

GTTTCCGCCT TGTCAATGAG CCTCTTGTAA TGTTTGATGT TTTCATTACA CGAGATAAAA 5280 

CGTCTATGCG TTATCAAACT CATTACCAAT TAAAACAAAA AGCTGTGGTT AGATCCTTTC 5340 

GGAAATTGTC AAGCGATTGG AGGAAATGAA CTAATCCACA GCGGCTTATT CCAAGTATAC 5400 

CACTTGGGCT TTGGCAGTAG CTAACTGCGC TAAATATAAT ATAAGGAGGA GTAAAATGAA 5460 

GACAGTTCAA TTTTTTTGGC ATTATTTTAA GGTCTACAAG TTCTCATTTG TAGTTGTCAT 5520 

CCTGATGATT GTTCTGGCGA CTTTTGCCCA AGCCCTCTTT CCAGTCTTTT CTGGACAAGC 5580 

GGTGACGCAG CTAGCCAATT TAGTTCAAGC TTATCAAAAT GGCAATCCAG AACTTGTATG 5640 
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GCAAAGCCTA TCAGGAATCA TGGTCAATCT TGGCCTGCTG GTTTTGGTTC TATTTATCTC 5700 

TAGTGTAATA TACATGTGTC TCATGACGCG CGTGATTGCA GAATCGACCA ACGAGATGCG 5760 

CAAAGGCCTC TTTGGTAAGC TTGCTCAGTT GACGGTTTCT TTCTTTGACC GTCGACAAGA 5820 

TGGCGATATC CTGTCTCATT TTACCAGTGA TTTGGATAAT ATCCTCCAAG CCTTTAACGA 5880 

AAGCTTGATT CAGGTCATGA GCAATATTGT TTTATACATT GGTCTGATTC TTGTCATGTT 5940 

TTCGAGAAAT GTGACGCTGG CTCTCATCAC CATTGCCAGC ACCCCATTGG CTTTCCTTAT 6000 

GCTGATTTTC ATCGTGAAAA TGGCACGCAA ATACACCAAC CTCCAGCAGA AAGAGGTAGG 6060 

GAAGCTCAAC GCCTATATGG ATGAGAGCAT CTCAGGCCAA AAAGCCGTGA TTGTGCAAGG 6120 

AATTCAAGAG GATATGATGG CAGGATTTCT TGAACAAAAT GAGCGCGTGC GCAAGGCAAC 6180 

CTTTAAAGGA AGAATGTTCT CAGGAATTCT TTTCCCTGTC ATGAATGGGA TGAGCCTGAT 6240 

TAATACAGCC ATCGTCATCT TTGCTGGTTC GGCTGTACTT TTGAATGATA AGTCTATTGA 6300 

AACAAGTACA GCCCTAGGTT TGATTGTTAT GTTTGCACAA TTTTCACAGC AGTACTACCA 6360 

GCCTATTATC CAAGTTGCAG CGAGTTGGGG AAGCCTTCAG TTGGCCTTTA CTGGAGCTGA 6420 

ACGAATTCAG GAAATGTTTG ATGCAGAGGA GGAAATCCGA CCTGAAAAGG CTCCAACCTT 6480 

CACTAAGTTG CAAGAAAGTG TTGAAATCAG TCATATCGTT TTTTCATACT TGCCTGATAA 6540 

ACCTATTTTG AAAGATGTCA GCATTTCTGC CCCTAAAGGC CAGATGACAG CAGTTGTTGG 6600 

GCCGACAGGT TCAGGAAAAA CGACTATTAT GAACCTCATC AATCGCTTTT ATGATGTTGA 6660 

TGCTGGTGGT ATTTATTTTG ATGGTAAAGA CATTCGTGGC TATGACTTAG ATAGTCTTAG 6720 

AAGCAAGGTG GGAATTGTAT TGCAAGATTC GGTCTTGTTT AGCGGAACGA TTAGAGACAA 6780 

TATCCGATTT GGTGTGCCAG ATGCTAGTCA GGAAATGGTT GAGGTAGCAG CAAAAGCAAC 6840 

CCACATTCAC GACTATATCG AAAGTTTGCC TGATAAGTAC GATACTCTTA TTGATGATGA 6900 

CCAGAGCATC TTTTCAACAG GGCAGAAGCA ATTGATTTCA ATCGCTCGAA CCCTGATGAC 6960 

AGATCCAGAA GTTCTCATTC TCGATGAAGC AACTTCAAAC GTAGATACGG TGACAGAAAG 7020 

CAAGATTCAG CATGCCATGG AGGTGGTTGT AGCAGGTAGA ACTAGTTTCG TCATTGCCCA 7080 

CCGCTTGAAA ACCATTCTCA ATGCAGATCA GATTATTGTC CTTAAAGATG GAGAAGTCAT 7140 

TGAACGTGGT AACCACCATG AACTTTTGAA GCTAGGTGGC TTTTATTCAG AACTCTATCA 7200 

CAATCAATTT GTTTTCGAAT AAGAAAGAAG TTGTCCTATG TGGGCAGCTT TTTCTTGTCC 7260 

ATAAAAAATG TTTATCACAG CCTTAAAAAA AACATATTAG ACGAAAGTCA TTTTGAGTGA 7320 

TATGATAGGA CTATCGTTAG CATTCGAAAG GAGAGGCATC ATGGCTAGAA CGGTTGTAGG 7380 

AGTTGCTGCA AATCTATGTC CCGTAGACGC AGAAGGCAAA ATCATTCATT CATCTGTATC 7440 
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TTGTAGATTC GCAGAGATCA TTCGTCAAGT CGGTGGTCTC CCTTTAGTCA TTCCTGTTGG 7500 

TGATGAGTCA GTTGTACGTG ATTATGTGGA AATGATTGAC AAACTCATTT TGACAGGAGG 7560 

CCAAAATGTT CATCCTCAGT TTTATGGAGA GAAAAAGACC GTCGAGAGCG ATGATTACAA 7620 

TCTGGTCCGT GACGAATTTG AATTGGCACT CTTGAAGGAA GCGCTTCGTC AGAATAAACC 7680 

AATTATGGCA ATCTGTCGCG GTGTCCAACT TGTCAATGTT GCCTTTGGTG GAACCCTCAA 7740 

TCAAGAAATC GAAGGTCAGG 7760 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

GAGGTTTTAA TTCACTTACC TCTsCCGTAT CTTTATTTAA AATGAATTCT TTTACGGTTG 60 

TATTTCTTGC AAAATCTTTT ACAACAATCT TAATGTTTAG TGTCTTGTCT ATTATTTGTT 120 

TAATATCATT AAATGATGTA TATTCTTTTC CATTTATATA AATATGTTGT TCTTGAATCT 180 

CACCATCGAA TCCATTATTT CTTTTATCAT TGATGTTAAA GACTACAGAT TTTCCATCAG 240 

CATATTCGAT ACTAGTATTT CCCTTAGGAT CAATGTTTAC TTCGGGTTTA ACATTATCAT 300 

ATAAAAACTG ATAGTGGACT CCAACTGCTT TAGCATTCAA ATCGCTATAG CCAGTTTGAA 360 

GATAAACATT TCCATCCATA TCTGTTACCT TATCTGGAAA TCCGTTTGCT TTATAGTCTT 420 

TCATTCCCCA GTCCATGATG TCACCGTCTT TAACATTCAG CTTAATATTA AAATCTCTAG 480 

TGTTATCAAT GTGTAAATCT CCGTAGATTA AATAATTATC TACAACCGAT TCATTAACTC 540 

TCAATTCCC A GTTAAAACCA CCCTTATCAG AAATCTTACC TCTTAAATAA AATTCTGGAT 600 

TTCGTACATA AATTTTATTA GATTTAGATG GATTAAAGTA GTTCTTATCC ATTGAAAGGT 660 

TTACTGGTTT GGTATCAATA AATAACATGG AGCCATCTTC TTTTATAGCT TCTACATTGA 720 

ACTTATCCTC TCCAGTGTAT TCTTTATCAT CCTTACCAAA TAATACAAGT TTAGAAGAAT 780 

CTGTCACAAG ATTTCCGTCT TTATCGATAG CTTCCCCTTT ATCGTTCATT TTAAATGTAA 840 

ACACTTGATA CCTTATAATG TTAAAGCCGT CCAAAGCCGA CATTAATACA GATTGGGTAC 900 

TTCTTCCATC TTCAACATTT CTACTATCAG CATAAATTGT TGTTTCTGAA AGGGCTCTTA 960 

GATTAGGATT GGCCTTTTGT ATTTTTGCTA TATCTTCCTT GCTATAGACT CCATTTCCTT 1020 
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CTAACATATC CGTTTTTCCA GGATTATAGG TAGTCACTTT TAGTGCATAG CCTTTTCTTA 1080 

GAATGATATT ATCCTTTAAC AGATATTGTT GTTTTTCTGA ATCAGAATAG ATTTTACCAG 1140 

ATTCCATTTT AGTTAAATTG TCTGGTTTGT TTTTTGAAAG ATCTCCTTCC CCTAATTCTA 1200 

TGACATTCCC ATAACTTGAT ACATAGGGAT ATTCTGATTT AGTTTCCTTA ATTTTTTCAG 1260 

GCATTCTAAT TTTAATTTCA GCTTTTTTCT GATCATTATC TTTAACAAAT AATCTCATAT 1320 

CTCCTGCAAA AGCTAATCCA TCCACAATAT CATTAATATT AGCGTATAGA TCAAATGTCA 1380 

TCGTTTTTGA GTGGAAATCA TACTTGGTCG CTTTGATTTC TATAGATTTA TAGTTATTCC 1440 
CATAATATAC CTTGGCATTT TTAGAAACAT TACTTATCTT TCCAAGAATT TCAAAGTGTC * 1500 

CATCTTTAGA CGGACTTAGA ACACCATAAA TTTTTGATTT GATTTCGTCA AGTTTCTCAG 1560 

TTTCATATTC TAGATCAGTC CCATCATCGT AGGCTATTAT ATTTCCTTTA TCATCGTATT 1620 

TATAATCGTA TTCCTCCATT CTCTTACCAG TTTCACTTGT AAAATCATCA ACTTCTCTAA 1680 

ATTTCTTTTT AATGAGTTTC TTTAAGTCTT TATTTTCAAA GTCTCTAATT GTTGAAATAT 1740 

TTCTATCAAT AGTAAAACTA GATTTTTCTT TAATAGACTC TTCATTTTCT TGATGATGAT 1800 

GTTCTACCCC AGTTGTATCT TTTTTTAGAC TACCCTCTTT TCCATTTCCT AAATTTTTAA 1860 

ATTTAGATTC TGCAATCTCG CCAAGCTTTT GATATTTAGA TGAATCTTGA TCAGGATCTA 1920 

CTAGATAATA GGAAATCATC CCCTTTTCAT CAGCCTGATT AGCAAATTTA ATTCTATGAA 1980 

TCTTTGTGAA ATTGCTAGAA CCATCTAATG CAATGACTTC AATGATTTTT CCCCTTAAAT 2040 

CTCCCGCACC TTTAATTTCA TAAATGGTAT TTCCGTCTTT ATCAAGTTTT CTATTTCTTC 2100 

CTTGACCCTC ACCTGCGTAA GTTACTTCAA GATTTTTTTC AACCTCTCCA TCTTCATTAA 2160 

CAAGAGCGGC GCCAGCATAC CAAACTTCGT TCGCAATCTC GTCAAATTTT TCAGGATGTT 2220 

CTTTTTGATC TCTCGCAAAT AGCGTTTCAT TCTTATACTG ATCTTTTACC TTATGATAAG 2280 

TATCCTTTGT AATCAACTTA ATTTTTTCAG GATTTGAAAA ATCAACCGAA ACAATCTTAG 2340 

GGGCGGTGTT ATCAATTTTT ACAGGAATAT AGGAAACCTG CCATGGGTAA TCTTTAGTTA 2400 

ATCTATATTT AAATTTATAG AAATATTGAC CTTCCGCAAT CGGTTCAAAT TGACCTCTTA 2460 

TCTTAGTAGC AGGATCTTGA TTATCCTTAC TTTCTGGTGC ATTTTCTTCT CTACCTCTAG 2520 

GATTATAGAT GAGTCCATCC CACTTCAAGT CACCCCAAAC TTTTAGTTTA GATGATTTGA 2580 

TTCCCTTTGC ATCATTGCTT TTAGAATTTA AAATTCCTCT AATAAAGTGT TCTCTCGAAA 2640 

TGACTTTTAA GTCTCTTTGA TTTTCTCCCT CTTTATTTGT AT TT ACT ATT GAAATCAATC 2700 

CTTCTTCTGC ACTTCTTAAT ACA 2723 
(2) INFORMATION FOR SEQ ID NO: 65: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11831 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

AAAAAAGTGG GAATGACTCA AATCTTCACT GAAGCTGGCG AATTGATCCC TGTAACAGTT 60 

ATTGAAGCAA CTCCAAACGT TGTTCTTCAA GTTAAAACTG TTGAAACAGA "CGGATACAAC 120 

GCTATCCAAG TTGGTTTCGA TGACAAACGC GAAGTATTGA GCAACAAACC TGCTAAAGGA 180 

CATGTAGCGA AAGCTAACAC GGCTCCTAAG CGCTTCATTC GTGAATTCAA AAACGTTGAA 240 

GGCTTGGAAG TTGGTGCTGA AATTACAGTT GAAACATTCG CAGCTGGAGA -CGTTGTTGAC 300 

GTAACGGGTA CTTCTAAAGG TAAAGGTTTC CAAGGTGTTA TCAAACGCCA CGGACAATCA 360 

CGTGGACCAA TGGCTCACGG TTCTCGTTAC CACCGTCGTC CAGGTTCTAT GGGGCCTGTT 420 

GCACCTAACC GCGTATTCAA AGGTAAAAAC CTTGCAGGAC GTATGGGTGG CGACCGCGTA 480 

ACAATTCAAA ACCTTGAAGT TGTAC AAGTT GTTCCAGAAA AGAACGTTAT CCTTATCAAA 540 

GGTAACGTAC CAGGTGCTAA GAAATCTCTT ATCACTATCA AATCAGCAGT TAAAGCTGGT 600 

AAATAATAAA GAAAGGGGAA ATCAGTCACA ATGGCAAACG TAACATTATT TGACCAAACT 660 

GGTAAAGAAG CTGGCCAAGT TGTTCTTAGC GATGCAGTAT TTGGTATCGA ACCAAATGAA 720 

TCAGTTGTGT TTGATGTAAT CATCAGCCAA CGCGCAAGCC TTCGTCAAGG AACACACGCT 780 

GTTAAAAACC GCTCTGCAGT ATCAGGTGGT GGACGCAAAC CATGGCGTCA AAAAGGAACT 840 

GGACGTGCTC GTCAAGGTTC TATCCGCTCA CCACAATGGC GTGGTGGTGG TGTTGTCTTC 900 

GGACCAACTC CACGTTCATA CGGCTACAAA CTTCCACAAA AAGTTCGTCG CCTAGCTCTT 960 

AAATCAGTTT ACTCTGAAAA AGTTGCTGAA AACAAATTCG TAGCTGTAGA CGCTCTTTCA 1020 

TTTACAGCTC CAAAAACTGC TGAATTTGCA AAAGTTCTTG CAGCATTGAG CATCGATTCT 1080 

AAAGTTCTTG TTATCCTTGA AGAAGGAAAT GAATTCGCAG CTCTTTCAGC TCGTAACCTT 1140 

CCAAACGTGA AAGTTGCAAC TGCTACAACT GCAAGTGTTC TTGACATCGC AAATAGCGAC 1200 

AAACTTCTTG TCACACAAGC AGCTATCTCT AAAATCGAGG AGGTTCTTGC ATAATGAATT 1260 

TGTATGATGT TATCAAAAAA CCTGTCATCA CTGAAAGCTC AATGGCTCAA CTTGAAGCAG 1320 

GAAAATATGT ATTTGAAGTT GACACTCGTG CACACAAACT TTTGATCAAG CAAGCTGTTG 1380 

AAGCTGCTTT CGAAGGTGTT AAAGTTGCCA ATGTTAACAC AATCAACGTA AAACCAAAAG 1440 



0 
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CTAAACGTGT TGGACGTTAC ACTGGTTTTA CTAACAAAAC TAAAAAAGCT ATCATCACAC 1500 

TTACAGCTGA TTCTAAAGCA ATCGAGTTGT TTGCTGCTGA AGCTGAATAA TCTAAGGAGG 1560 

AAATATCGTG GGAATTCGTG TTTATAAACC AACAACAAAC GGTCGCCGTA ATATGACTTC 1620 

TTTGGATTTC GCTGAAATCA CAACAAGCAC TCCTGAAAAA TCATTGCTTG TTGCATTGAA 1680 

GAGCAAGGCT GGTCGTAACA ACAACGGTCG TATCACAGTT CGTCACCAAG GTGGTGGACA 1740 

CAAACGTTTC TACCGTTTGG TTGACTTCAA ACGTAATAAA GACAACGTTG AAGCAGTTGT 1800 

TAAAACAATC GAGTACGATC CAAACCGTTC TGCAAACATC GCTCTTGTAC ACTACACTGA 1860 

CGGTGTGAAA GCATACATCA TCGCTCCAAA AGGTCTTGAA GTAGGTCAAC GTATCGTTTC 1920 

AGGTCCAGAA GCAGATATCA AAGTCGGAAA CGCTCTTCCA CTTGCTAACA TCCCAGTTGG 1980 

TACTTTGATT CACAACATCG AGTTGAAACC AGGTCGTGGT GGTGAATTGG TACGTGCTGC 2040 

TGGTGCATCT GCTCAAGTAT TGGGTTCTGA AGGTAAATAT GTTCTTGTTC GTCTTCAATC 2100 

AGGTGAAGTT CGTATGATTC TTGGAACTTG CCGTGCTACA GTTGGTGTTG TCGGAAACGA 2160 

ACAACATGGA CTTGTAAACC TTGGTAAAGC AGGACGTAGC CGTTGGAAAG GTATCCGGCC 2220 

AACAGTTCGT GGTTCTGTAA TGAACCCTAA CGATCACCCA CACGGTGGTG GTGAAGGTAA 2280 

AGCACCAGTT GGTCGTAAAG CACCATCTAC TCCATGGGGC AAACCTGCTC TTGGTCTTAA 2340 

AACTCGTAAC AAGAAAGCGA AATCTGACAA ACTTATCGTT CGTCGTCGCA ACGAGAAATA 2400 

ATATTAAACT AGTCGCTTAA GCAACTAGTA AATCCGCCAG CTCGGTAGCG CTCCATAGGA 2460 

GTGCAAGCCG CTGTGGTACA ACATTTAAAG GAGAAAATAT AAAAATGGGA CGCAGTCTTA 2520 

AAAAAGGACC TTTCGTCGAT GAGCATTTGA TGAAAAAAGT TGAAGCTCAA GCTAACGACG 2580 

AAAAGAAAAA AGTTATTAAA ACTTGGTCAC GTCGTTCAAC GATCTTCCCA AGTTTCATTG 2640 

GTTACACTAT TGCAGTTTAT GACGGACGTA AACACGTACC TGTTTACATC CAAGAAGACA 2700 

TGGTAGGCCA CAAACTTGGT GAATTTGCAC CAACTCGTAC TTACAAAGGT CACGCTGCAG 27 60 

ACGACAAGAA AACACGTAGA AAATAAGGAG AACATAAATG GCAGAAATTA CTTCAGCTAA 2820 

AGCAATGGCT CGTACAGTAC GTGTTTCACC TCGTAAATCA CGTCTTGTTC TTGATAACAT 2880 

CCGTGGTAAA AGCGTAGCCG ATGCAATCGC AATCTTGACA TTCACTCCAA ACAAAGCTGC 2940 

TGAAATCATC TTGAAAGTTT TGAACTCAGC TGTAGCTAAC GCTGAAAACA ACTTTGGTTT 3000 

GGATAAAGCT AACTTGGTAG TATCTGAAGC ATTCGCAAAC . GAAGGACCAA CTATGAAACG 3060 

TTTCCGTCCA CGTGCGAAAG GTTCAGCTTC ACCAATCAAC AAACGTACAG CTCACATCAC 3120 

TGTAGCTGTT GCAGAAAAAT AAGGAGGTAA AATCGTGGGT CAAAAAGTAC ATCCAATTGG 3180 

TATGCGTGTC GGCATCATCC GTGATTGGGA TGCCAAATGG TATGCTGAAA AAGAATACGC 3240 
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GGATTACCTT CATGAAGATC TTGCAATCCG 
AGCAGTTTCA ACTATTGAAA TCGAACGCGC 
TGCTAAACCA GGTATGGTTA TCGGTAAAGG 
ACTTAACAAA TTGACTGGAA AACAAGTACA 
TTTGGATGCT CACCTTGTAG GTGAAGGAAT 
CCGTCGTGCA CAAAAACAAG CAATCCAACG 
AACTCAAGTA TCAGGTCGTT TGAACGGTGC 
AGGAACTGTT CCGCTTCACA CACTTCGTGC 
TACTACATAC GGTAAACTTG GTGTTAAAGT 
TCGTAAAAAC ACTAAAGGAG GTAAATAACC 
TCGTGAGTTC CGTGGAAAAA TGCGCGGTGA 
TGAATACGGT CTTCAAGCTA CAACTAGCCA 
TCGTATCGCC ATGACTCGTT ACATGAAACG 
ACACAAATCA TACACTGCTA AAGCTATCGG 
TGAAGGTTGG GTAGCACCAG TTAAACGTGG 
TGAAGAGATT GCACGTGAAG CGCTTCGACT 
ATTCGTAAAA CGTGAAGCAG AATAAGGAGA 
TGTTAAAGAA CTTCGTGGTC TTTCTCAAGA 
AAAAGAATTG TTTGAACTTC GTTTCCAAGC 
CTTGAAAGAA GTTAAAAAAC AAATCGCTCG 
ATAGACTAGG GAAGGAGAAA TTTCAATGGA 
TGTTGTATCT GACAAAATGG ACAAGACAAT 
CCCAGTCTAT GGTAAACGTA TTAACTACTC 
TGTTGCCAAA GAAGGCGATA TCGTACGTAT 
ACGTTTCCGT CTTGTAGAAG TTGTTGAAGA 
AGAAAACTGA AATGATTCAA ACAGAAACTC 
GCGAAATCTT GACTATCAAA GTTCTTGGTG 
ATGTTATCGT GGCATCTGTA AAACAAGCTA 
TTGTTAAAGC AGTTATCGTT CGTACTAAAT 



541 

TAAATTCGTT CAAAAAGAAC 
AGTAAACAAA GTTAACGTTT 
TGGTGCTAAC GTTGATGCaC 
CATCAACATC ATCGAAATCA 
TGCTCGTCAA TTGGAGCAAC 
TGCAATGCGT -GCTGGAGCTA 
AGATATCGCC CGTGCTGAAG 
AGATATCGAT TACGCTTGGG 
ATGGATCTAC CGTGGTGAAG 
AATGTTAGTA CCTAAACGTG 
AGCAAAAGGT GGAAAAGAAG 
CTGGATCACT AACCGCCAAA 
TGGTGGTAAA GTTTGGATTA 
TGTGCGTATG GGATCTGGTA 
TAAAGTGATG TTCGAAATCG 
TGCTAGCCAC AAATTGCCAG 
AGGCATGAAA CTTAATGAAG 
AGAACTCGCG AAGCGCGAAA 
TGCTACTGGT CAATTGGAAC 
CATCAAAACA GTTCAATCTG 
ACGCAATAAT CGTAAAGTTC 
CACAGTTGTA GTTGAAACAA 
TAAAAAATAC AAAGCTCATG 
CATGGAAACT CGCCCGCTTT 
AGCGGTCATC ATCTAATCAA 
GTTTGAAAGT CGCAGACAAC 
GTTCAGGACG TAAATTTGCA 
CTCCTGGTGG TGCGGTTAAA 
CAGGTGCTCG TCGTGCTGAT 



TTGCTGACGC 3300 

CACTTCACAC 3360 

TCCGTGCAAA 3420 

AACAACCTGA 3480 

GTGTTGCTTT 3540 

AAGGAATCAA 3600 

GATACTCTGA 3660 

AAGAAGCAGA 3720 

TTCTTCCAGC 3780 

TTAAACACCG 3840 

TAGCATTCGG 3900 

TCGAAGCTGC 3 960 

AAATCTTCCC 4020 

AAGGGGCACC 4080 

CTGGTGTATC 4140 

TTAAATGTAA 4200 

TAAAAGAATT 4260 

ACGAATTGAA 4320 

AAACAGCTCG 4380 

AAGCGAAATA 4440 

TTGTTGGACG 4500 

AACGTAACCA 4560 

ATGAAAACAA 4620 

CAGCTACAAA 4680 

ACCTGAAAGG 4740 

AGCGGTGCTC 4800 

AACATCGGTG 4860 

AAAGGTGACG 4920 

GGTTCATACA 4980 
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TCAAATTTGA CGAAAACGCA GCAGTTATCA TCCGTGAAGA CAAAACTCCT CGCGGAACAC 5040 

GTATCTTTGG CCCAGTTGCA CGTGAATTGC GTGAAGGTGG CTTCATGAAG ATCGTGTCAC 5100 

TTGCTCCAGA AGTACTTTAA TTTTTAGGAA CAAACTAGTC CCCTAGCTTC AAGCTAGGGT 5160 

GCCCTTATGG GCGTAAGAAA AATCAAGGAG AAACCTAATG TTTGTAAAAA AAGGCGACAA 5220 

AGTTCGCGTA ATCGCTGGTA AAGATAAGGG AACAGAAGCT GTTGTCCTTA CTGCCCTTCC 5280 

AAAAGTAAAC AAAGTTATCG TTGAAGGTGT TAACATTGTT AAGAAACACC AACGTCCAAC 5340 

TAACGAGCTT CCTCAAGGTG GTATCATCGA GAAAGAAGCA GCTATCCACG TATCAAACGT 5400 

TCAAGTTTTG GACAAAAATG GTGTAGCTGG TCGTGTTGGA TACAAATTTG TAGACGGTAA 5460 

AAAAGTTCGC TACAACAAAA AATCAGGCGA AGTGCTTGAT TAATCACGAA GGAAAGGAGA 5520 

AGTATAATGG CAAATCGTTT AAAAGAAAAA TATCTTAATG AAGTAGTTCC TGCTTTGACA 5580 

GAACAATTCA ACT ACT CATC AGTGATGGCT GTGCCTAAAG TAGATAAGAT TGTTTTGAAC 5640 

ATGGGTGTTG GTGAAGCTGT ATCAAACGCT AAAAGCCTTG AAAAAGCTGC TGAAGAATTG 5700 

GCACTTATCT CAGGTCAAAA ACCACTTATC ACTAAAGCTA AAAAATCAAT CGCCGGCTTC 5760 

CGTCTTCGTG AAGGTGTTGC GATCGGTGCA AAAGTTACCC TTCGTGGTGA ACGTATGTAC 5820 

GAATTCTTGG ATAAATTGGT ATCAGTTTCA CTTCCACGTG TACGTGACTT CCACGGTGTC 5880 

CCAACAAAAT CATTTGATGG ACGCGGGAAC - TACACACTTG GTGTGAAAGA ACAATTAATC 5940 

TTCCCAGAAA TCAACTTCGA TGACGTTGAC AAAACTCGTG GTCTTGACAT CGTTATCGTA 6000 

ACAACTGCTA ACACTGACGA' AGAGTCACGT GCATTGCTTA CAGGCCTTGG AATGCCTTTT 6060 

GCAAAATAAT ATAGGAGGTA AATCTAATGG CTAAAAAATC AATGGTAGCT AGAGAGGCTA 6120 

AACGCCAAAA AATTGTTGAC CGTTATGCTG AAAAACGTGC TGCATTAAAG GCGGCAGGGG 6180 

ACTACGAAGG TTTATCTAAA TTACCTCGCA ACGCCTCACC GACTCGTTTA CATAATCGTT 6240 

GTAGGGTTAC GGGGCGCCCA CATTCAGTTT ACCGCAAATT TGGTCTGAGT CGTATCGCTT 6300 

TTCGCGAACT TGCGCATAAA GGTCAAATTC CTGGTGTAAC AAAAGCATCT TGGTAATTTA 6360 

AGATATCAAG AGCGTCAAAA CTCCAAGTAA AAATAGGAAA CTTGACGAAG AAACTAAAGT 6420 

TTCTAGGAAA GTTTATCTTT TTCACACAGA GTTTAGCCCG GGTTCAATTG GGCTTGCCAA 6480 

TTTGAACACG AGCTACAGCT TTGGCAAAAA AGACCAATTT GCTTTGGAGC ATTGCTTCTG 6540 

CATTAAATTG TCTATTTTTG CTCGTGCTGT TACGCTCTTT GTATCATGTA TTAACTAGCA 6600 

AGTGCAACTT GCAAACTACT AGTAAGAGGA GAAAAACAAA ATGGTTATGA CTGACCCAAT 6660 

CGCAGACTTC CTAACTCGTA TTCGTAATGC TAACCAAGCT AAACACGAAG TACTTGAAGT 6720 

ACCTGCATCA AACATCAAAA AAGGGATTGC TGAAATCCTT AAACGCGAAG GTTTTGTAAA 6780 
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AAACGTTGAA ATCATTGAAG ATGACAAACA AGGCGTCATC CGTGTATTTC TTAAATACGG 6840 

ACCAAATGGT GAGAAAGTTA TCACTAACTT GAAACGTGTT TCTAAACCAG GACTTCGTGT 6900 

CTACAAAAAA CGTGAAGACC TTCCAAAAGT TCTTAACGGA CTTGGAATTG CCATCCTTTC 6960 

AACTTCTGAA GGTTTGCTTA CTGATAAAGA AGCACGCCAA AAGAATGTTG GTGGTGAGGT 7020 

TATCGCTTAC GTTTGGTAAA ATCAAGATAC AAAGCTCGTA AAGAACAAAG CAAAATTAGG 7080 

AAGTTGGAGA AGTTTGTTTA CAAACAAGCC AACTTATCTA TTTTGCACAG TTCTTAGAGC 7140 

GTGTTCAGTT CAGCTCTTGA ACTAAATAAG TATCTGAACC CCGTGAAAAC TGGCCGTTCT 7200 

GGCCTGACAA TTTAACAGGA GAAAATAAAC ATGTCACGTA TTGGTAATAA AGTTATCGTG 72 60 

TTGCCTGCTG GTGTTGAACT CGCTAACAAT GACAACGTTG TAACTGTAAA AGGATCTAAA 7320 

GGAGAACTTA CTCGTGAGTT CTCAAAAGAT ATTGAAATCC GTGTGGAAGG TACTGAAATA 7380 

ACTCTTCACC GTCCAAACGA TTCAAAAGAA ATGAAAACTA TCCACGGAAC TACTCGTGCC 7440 

CTTTTGAACA ACATGGTTGT TGGTGTATCA GAAGGATTCA AGAAAGAACT TGAAATGCGT 7500 

GGGGTTGGTT ACCGTGCACA GCTTCAAGGA TCTAAACTTG TTTTGGCTGT TGGTAAATCT 7560 

CATCCAGACG AAGTTGAAGC TCCAGAAGGA ATTACTTTTG AACTTCCAAA CCCAACAACA 7620 

ATCGTTGTTA GCGGAATTTC AAAAGAAGTA GTTGGTCAAA CAGCTGCTTA CGTACGTAGC 7680 

CTTCGTTCAC CAGAACCATA TAAAGGTAAA GGTATCCGTT ACGTTGGTGA ATTCGTTCGC 7740 

CGTAAAGAAG GTAAAACAGG TAAATAATGT TGAGTGGTTG ATCATCAACC ACCAACCTAT 7800 

TTTCCAACTT TGTGCATAGC ACACGATTTA AAACTAAAGA GGTGAAAACT GTGATTTCAA 7860 

AACCAGATAA AAACAAACTC CGCCAAAAAC GCCACCGTCG CGTTCGCGGA AAACTCTCTG 7920 

GAACTGCTGA TCGCCCACGT TTGAACGTAT TCCGTTCTAA TACAGGCATC TACGCTCAAG 7980 

TGATTGATGA CGTAGCGGG* GTAACGCTCG CAAGTGCTTC AACTCTTGAT AAAGAAGTTT 8040 

CAAAAGGAAC TAAAACTGAA CAAGCCGTTG CTGTCGGTAA ACTCGTTGCA GAACGTGCAA 8100 

ACGCTAAAGG TATTTCAGAA GTGGTGTTCG ACCGCGGTGG ATATCTATAT CACGGACGTG 8160 

TGAAAGCTTT GGCTGATGCA GCTCGTGAAA ACGGATTGAA ATTCTAATAG GAGGACACTA 8220 

GAAAATGGCA TTTAAAGACA ATGCAGTTGA ATTAGAAGAA CGCGTAGTTG CTGTCAACCG 8280 

TGTTACAAAA GTTGTTAAAG GTGGACGTCG TCTTCGTTTC GCAGCTCTTG TTGTTGTTGG 8340 

TGACCACAAT GGTCGCGTAG GATTTGGTAC TGGTAAAGCT CAAGAAGTTC CAGAAGCAAT 8400 

CCGTAAAGCA GTAGATGATG CTAAGAAAAA CTTGATCGAA GTTCCTATGG TTGGAACAAC 8460 

AATCCCACAC GAAGTTCTTT CAGAATTCGG TGGAGCTAAA GTATTGTTGA AACCTGCTGT 8520 
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AGAAGGTTCT GGAGTTGCCG CTGGTGGTGC AGTTCGTGCC GTTGTGGAAT TGGCAGGTGT 8580 

GGCAGATATT ACATCTAAAT CACTTGGTTC TAACACTCCA ATCAACATTG TTCGTGCAAC 8640 

TGTTGAAGGT TTGAAACAAT TGAAACGCGC TGAAGAAATT GCTGCCCTTC GTGGTATTTC 8700 

AGTTTCTGAT TTGGCATAAG AAAGGGGATA AAATGGCTCA AATTAAAATT ACTTTGACTA 8760 

AGTCTCCAAT CGGACGCATT CCATCACAAC GTAAAACTGT TGTAGCACTT GGACTTGGCA 8820 

AATTGAACAG CTCTGTTATT AAAGAAGATA ACGCTGCTAT CCGTGGTATG ATCACAGCAG 8880 

TATCTCACTT AGTAACAGTT GAAGAAGTAA ACTAATGAaG TTTTAGGGGA TGTGCACTGT 8940 

ACCATCCCCT AAAACTAGAT ATAGTCATCT ATGATGACAT CGTATAGGCG AGTTGATGGG 9000 

GGAGACAACC TTTTCTCCCT TATCGGCGCT AGCATTTTAC AAAAGAGGAG AAAATAAAAA 9060 

TGAAACTTCA TGAATTGAAA CCTGCAGAAG GTTCTCGTAA AGTACGTAAC CGCGTTGGTC 9120 

GTGGTACTTC ATCAGGTAAC GGTAAAACAT CTGGTCGTGG TCAAAAAGGT CAAAAAGCTC 9180 

GTAGCGGTGG CGGAGTTCGC CTTGGTTTTG AAGGTGGACA AACTCCATTG TTCCGTCGTC 9240 

TTCCAAAACG TGGATTCACT AACATCAACG CTAAAGAATA CGCAATTGTG AACCTTGACC 9300 

AATTGAACGT CTTTGAAGAT GGTGCTGAAG TAACTCCAGT TGTTCTTATC GAAGCAGGAA 9360 

TTGTTAAAGC TGAAAAGTCA GGTATTAAAA TTCTTGGTAA CGGTGAGTTG ACTAAGAAAT 9420 

TGACTGTGAA AGCAGCTAAA TTCTCTAAAT CAGCTGAAGA AGCTATCACT GCTAAAGGTG 9480 

GTTCAGTAGA AGTCATCTAA GAGAGGTGAC CTATGTTTTT TAAATTATTA AGAGAAGCTC 9540 

TTAAAGTCAA GCAGGTTCGA TCAAAAATTT TATTTACAAT TTTTATCGTT TTGGTCTTTC 9600 

GTATCGGAAC TAGCATTACA GTTCCTGGTG TGAATGCCAA TAGCTTGAAT GCTTTAAGTG 9660 

GATTATCCTT CTTAAACATG TTGAGCTTGG TGTCGGGGAA TGCCCTAAAA AACTTTTCGA 9720 

TTTTTGCCCT AGGAGTTAGT CCCTATATCA CCGCTTCTAT TGTTGTCCAA CTCTTGCAAA 9780 

TGGATATTTT ACCCAAGTTT GTAGAGTGGG GTAAACAAGG GGAAGTAGGT CGAAGAAAAT 9840 

TGAATCAAGC TACTCGTTAT ATTGCTCTAG TTCTCGCTTT TGTGCAATCT ATCGGGATTA 9900 

CAGCTGGTTT TAATACCTTG GCTGGAGCTC AATTGATTAA AACTGCTTTA ACTCCACAAG 9960 

TTTTTCTGAC GATTGGTATC ATCTTAACAG CTGGTAGTAT GATTGTCACT TGGTTGGGTG 10020 

AGCAAATTAC AGATAAGGGA TACGGAAACG GTGTTTCCAT GATTATCTTT GCCGGGATTG 10080 

TTTCCTCAAT TCCAGAGATG ATTCAGGGCA TCTATGTGGA CTACTTTGTG AACGTCCCAA 10140 

GTAGCCGTAT CACTTCATCT ATCATTTTCG TAATCATTTT GATTATTACT <3TATTGTTGA 10200 

TTATTTACTT TACAACTTAT GTTCAACAAG CAGAATACAA AATTCCAATC CAATATACTA 10260 

AGGTTGCACA AGGTGCTCCA TCTAGCTCTT ACCTTCCGTT AAAAGTAAAC CCTGCTGGAG 10320 
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TTATCCCTGT TATCTTTGCC AGTTCGATTA CTGCAGCcTG CGGCTATTCT TCAGTTTTTG 10380 

AGTGCCACAG GTCATGATTG GGCTTGGGTA AGGGTAGCAC AAGAGATGTT GGCAACTACT 10440 

TCTCCAACTG GTATTGCCAT GTATGCTTTG TTGATTATTC TCTTTACATT CTTCTATACG 10500 

TTTGTACAGA TTAATCCTGA AAAAGCAGCA GAGAkCCTAC AAAAGAGTGG TGCCTATATC 10560 

CATGGAGTTC GTCCTGGTAA AGGTACAGAA GAATATATGT CTAAACTTCT TCGTCGTCTT 10620 

GCAACTGTTG GTTCCCTCTT CCTTGGTGTG ATTTCCATTT TACCGATTGC AGCTAAAGAT 10680 

GTATTTGGTC TTTCTGATGT TGTTGCCTTT GGTGGAACAA GTCTCTTGAT CATTATCTCT 10740 

ACAGGTATCG AAGGAATCAA GCAATTGGAA GGTTACCTAT TGAAACGTAA GTATGTTGGT 10800 

TTCATGGACA GAACAGAATA AAAGTATTTA CTGAATCAGT AAATACTGAG GGAGTGGAGG 10860 

TTTAAACTCT GACATTTGTA AGAGTTGGAT CTCCCCTCTT CTATTTTGTT TTTAAATCGG 10920 

GGTGAAAAGA CTTTTTGCTT CTATTTAAAA ATAAAATAAG GAGATCAAAT CATGAATCTT 10980 

TTGATTATGG GCTTACCTGG TGCAGGTAAG GGAACTCAAG CAGCAAAAAT CGTAGAACAA 11040 

TTCCATGTTG CACATATCTC AACAGGTGAT ATGTTCCGCG CTGCAATGGC AAATCAAACT 11100 

GAAATGGGTG TTCTTGCTAA GTCATATATT GACAAGGGTG AATTGGTTCC TGACGAAGTT 11160 

ACAAATGGAA TCGTAAAAGA ACGCCTTTCA CAAGATGATA TTAAAGAAAC AGGATTCTTA 11220 

TTGGATGGTT ACCCACGTAC AATTGAACAA GCTCATGCCT TGGACAAAAC ATTGGCTGAA 11280 

CTTGGCATTG AACTAGAAGG TGTTATCAAT ATTGAAGTGA ACCCTGACAG CCTTTTGGAA 11340 

CGTTTGAGTG GGCGTATCAT CCACCGCGTA ACTGGAGAAA CTTTCCACAA GGTCTTTAAC 11400 

CCACCAGTTG ACTATAAAGA AGAAGATTAC TACCAACGTG AAGATGATAA GCCTGAGACA 11460 

GTAAAACGTC GTTTGGATGT TAATATTGCT CAAGGAGAAC CAATCATTGC TCACTACCGT 11520 

GCCAAAGGTT TGGTTCATGA CATCGAAGGT AATCAAGATA TCAATGATGT CTTCTCAGAT 11580 

ATTGAAAAAG TATTGACAAA TTTGAAATAA AGCGTTTTTC ACACTTGCAA AAATCCGCTA 11640 

CAAATGTTAT ACTGAGATAG TCTGACTTAT AATTGTTGTC TCTGTGTCTA GAGGCATCGA 11700 
ATCGAAATTT ATGGAGGTGC TTTTGCGTGG CAAAAGACGA TGTGATTGAA GTTGAAGGCA , 11760 

AAGTAGTTGA TACAATGCCG AATGCAATGT TTACGGTTGA ACTTGAAAAT GGACATCAGA 11820 

TTTTAGCAGG G 11831 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 10726 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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<D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 



CCCGGCATTT 


GAAAGCTATT 


CGTGAAGGAT 


TTATGATGGC 


AATGCCTTTG 


ATTTTAGTCG 


60 


GCTCTTTATT 


TCTTATTCTA 


ATCAGTTGGC 


CTCAAGAGGC 


TTTTACAAAT 


TGGCTGAATA 


120 


GTGTTGGATT 


GCTAAGTATC 


TTGACAACTA 


TGAATCAGTC 


AACAGTAGCG 


ATTATCTCCT 


180 


TGGTCGCTTG 


TTTCGGTATT 


GCCTACAGGT 


TGTCGGAAGG 


ATATGGTACA 


GATGGTCCGT 


240 


CGGCAGGGAT 


CATAGCCTTA 


TCCAGTTTTG 


TATTGATGGC 


ACCTCGTTTT 


TCGAGTATGG 


300 


TTTATGATAA 


AAATGGGGAG 


CAGGTCAAGC 


AGTTATTTGG 


CGGCGCAATA 


CCATTTTCTA 


360 


GCCTGAATGC 


ATCTTCTTTG 


TTTATGGCGA 


TTACTATTGG 


ATTGGTTACA 


GCAGAGATTT 


420 


ATCGTATGTT 


TATCCAGCGC 


GGAATTACGA 


TAAAAATGCC 


AAGTGGTGTC 


CCAGATGTAG 


480 


TAAGTAAATC 


ATTTTCAGCT 


CTTTTATCTG 


GTTTTACTAC 


TTTTGTTTTG 


TGGGCTTTGG 


540 


TCTTAAAAGG 


TCTTGAAGCG 


GCAGGAGTTG 


CAGGAGGTCT 


CAACGGACTC 


CTAGGTGCAA 


600 


TTGTTGGAAC 


ACCGCTTAAG 


TTAATTGCAG 


GAACGCTTCC 


AGGTATGATT 


CTATGTGTTA 


660 


TTGTAAACTC 


ATTCTTTTGG 


TTCTGTGGAG 


TTAATGGGGG 


ACAAGTTTTA 


AATGCTTTTG 


720 


TAGACCCAGT 


TTGGTTACAA 


TTTACTACAG 


AAAACCAAGA 


AGCTGTGGCT 


GCAGGACAAA 


780 


CACTCCAACA 


CATTATTACA 


TTACCGTTTA 


AAGATTTATT 


TGTATTTATT 


GGTGGCGGTG 


840 


GAGCGACTAT 


TGGTCTTGCG 


ATTTGTCTCT 


TCCTATTTAG 


TAAGAGTCGT 


GCGAATAAAA 


900 


CATTAGGTAA 




/l 1 AVV— V»\j ILIA 


TTTTTAATAT 




ATTCTATTTA 


960 


CGTTTCCAAC 


AGTTTTAAAT 


CCGATTATGC 


TGATTCCGTT 


TATTGCTACT 


CCTACAATCA 


1020 


ATGCCTTGAT 


TACCTATGTA 


TCAATGGCTG 


TAGGATTAGT 


ACCCTATACA 


ACAGGTGTAA 


1080 


TCCTTCCGTG 


GACAATGCCA 


CCGATTATAG 


GAGGCTTCCT 


TGCAACAGGG 


GCTAGTTGGC 


1140 


GAGGAGCTCT 


ATTACAAGTT 


GTTTTGATTT 


TGGTTTCTGT 


AGCAATTTAT 


TATCCATTCT 


1200 


TCAAAATTGC 


AGATAAACGC 


AATCTTGAAA 


AAGAAAAAGC 


TACTGTTGGA 


GGGAAATAAG 


1260 


ATGGTTATCA 


GAGTATTTGA 


TCAACAGAAA 


AATACTTATT 


CTAGCTTTGC 


CTTAGAGGAA 


1320 


TTAAGTTACT 


ATATGAATCG 


GGTCTTTAAG 


ACTAACATAG 


AGCTTGTCGA 


GGAGAAGGAA 


1380 


GCGGATATTT 


TTGTAGGATT 


AGTCAATAAA 


GAGGACAGAA 


AAGACCATGT 


TCTTATCTCA 


1440 


TTAGACAAGG 


GTAAGGGGAG 


AATTGAGTCT 


AATACAATTG 


TAGGTTTACT 


TATTGGAATT 


1500 


TACCGAATGT 


TTCATGAATT 


TGGGGTTGTG 


TATACTAGAC 


CAGGGCGCAG 


ACATGACTTT 


1560 


GTTCCAGAGT 


TACGATTTGA 


AGATTTTTTA 


GATAAACAGC 


TATCTATAGA 


TGAAACAGCC 


1S20 
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AGTTACTATC ATAGGGGAGT ATGTATAGAG GGAGCGGATT CATTTGAAAA TATACTAGAT 1680 

TTCATTGATT GGCTACCTAA GATTGGGATG AACAGTTTTT TCATCCAGTT TGAAAATCCT 1740 

TACTCTTTTT TGAAACGTTG GTATGAACAT GAATTTAATC CATATCTAAA TAAAGAACAA 1800 

TTTTCAAATG AATTAGTACA AGAATTGAGT GATAGGTTGG ATAAAGAATT GCAAAAAAGA I860 

GGTCTTATTC ATCATCGTGT TGGTCATGGA TGGACAGGTG AAGTTTTAGG TTACTCTTCA 1920 

AAATTTGGCT GGGAATCAGG TCTTAGTATT TCAGAGGAGA AGAAACCCTA TGTCGCTGAA 1980 

ATAAACGGGA AACGAGAATT GTTTAATACG GCTCCGATTT TAACCAGCCT GGATTTTTCA 2040 

AATCCAGATG TAGCTGATAA GATGGTAGAA ATTATCAAGG ATTATGCCAA GAAAAGACCT 2100 

GATGTTAACT ACTTACATGT ATGGTTGTCG GATGCTCGTA ATAATATTTG TGAATGCGAA 2160 

AACTGTAGAC AAGAATTGGT TTCGGATCAG TATATTCGTA TTCTCAATCA ATTGGATAGG 2220 

GCTTTAACGA GTGAGGGATT AGATACAAAG ATTTGTTTTC TGCTTTATCA TGAGTTGTTA 2280 

TGGGCACCTC AGAAAGAAAA ATTAGATAAT CCTGAACGCT TTACCATGAT GTTTGCACCG 2340 

ATTACAAGAA CATTTGAAAT GAGTTATGCA GATGTAGATT TTGACAATTC CATACCTACG 2400 

CCTAAACCTT ATATGCGTAA TAAAATTATA CTTCCGAATT CTCTTGAGGA AAATTTATCT 2460 

TATCTTTTTG AGTGGCAAAA AGCATTTAAA GGAGATAGTT TCGTATATGA CTATCCTTTA 2520 

GGGCGTGCTC ATTATGGCGA TTTAGGCTAT ATGAAAATTA GTCAAACTAT TTACAGAGAT 2580 

GTATCTTATC TTTCCAACCT ACATTTGAAC GGGTACATTT CGTGTCAAGA ATTACGTGCC 2640 

GGATTCCCTC ATAATTTTCC TAATTATGTC ATGGGGGAAA TGCTCTGGAA GAAGACAAGA 2700 

AGTTATGAAG AATTGATTGA AGAATACTTT TCTGCTTTGT ATGGGGAAAA TTGGCAGTCT 2760 

GTTGTTGAAT ATTTAGAAAA ATTATCCATT TATTCCTCTT GTGATTATTT TAATGCAATT 2820 

GGCAGCCGTC AAAGTGATGT TTTAGCGAAT CATTATTATA TAGCTTACAA TCTAGCTGAT 2880 

AATTTTTTAC CAATTATTGA GGAAAATATT TCTAAGTTAT TAAATAGTCA AAAGGATGAA 2940 

TGGAAACAGC TCAGTTATCA TCGTGAATAT GTTGTTAAGA TGGCGAAGGC TTTATATCTT 3000 

CAAGCAACTG GAAAAACAAG GCAAGCTCAA GATGAATGGA GAAATGTGTT GAATTATATC 3060 

CGTGGGCACG AATTGCTATT TCAATCTAAT TTGGATGTTT ATCGTGTAAT TGAAGTAGCA 3120 

AAAAATTACG CTGGTTTCCA CTTATAAATC ATAAGTATAG AAAATGAACT AAGGTATTCA 3180 

GAGAAGATTG ATCCTAAATA TTATGAAATT TAAGGATTTT TAAGATATTT AGGGTCAACT 3240 

TTCTATTTAT ATCGTAGCGA AGTCATTTTA ATAATGATGT GTAAAAGATG GATCAAGATT 3300 

GAGGAGGAAG AAAGATGAAA TCAAAAGAAG AAATAAATAT GCTTGGTTTT ACAATTGTCG 3360 
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CTTACGCAGG 


AGATGCAAGG 


TCAGATTTGA 


TGGATGCTTT 


GGCGTTTGCG 


AGAGATGGAT 


3420 


ATTTTGAACA 


GGCAAGAGAA 


TTGGTTGAGT 


CTGCAAACGA 


CTCAATAGTG 


TCTGCCCATC 


3480 


GAGAACAGAC 


TAATTTATTA 


GCGGAGGAGG 


CATATGGAGA 


TAATTTTGAA 


GTGAGCTTTA 


3540 


TTATGATTCA 


TGGTCAAGAT 


ACTTTGATGA 


CAACGATGCT 


ATTGTATGAT 


CAGGTAAAGT 


3600 


TTTTTATTGA 


TGAATATGAA 


CGAATTCGAA 


AGATTGAAGA 


ACATATTGGT 


TTGCAATGAG 


3660 


GATTAGTCAT 


GGAAAATTTA 


CAGGTTAAAG 


CCTTACCGAA 


GGAGTTTTTA 


TTAGGAACTG 


3720 


CTACCGCTGC 


TTATCAAGTA 


GAGGGTGCAA 


CTAGGGTAGA 


TGGCAAAGGA 


ATAAATATGT 


3780 


GGGATGTTTA 


TTTGCAAGAA 


AATAGTCCGT 


TCTTACCAGA 


TCCAGCTAGT 


GATTTTTATT 


3840 


ATCGTTACGA 


AGAGGATATA 


GCTTTGGCGG 


CAGAACATGG 


TTTGCAGGCT 


TTGCGTTTAT 


3900 


CTATTTCTTG 


GGTTCGTATA 


TTTCCTGATA 


TAGATGGGGA 


TGCTAATGTA 


TTAGCTGTTC 


3960 


ATTATTACCA 


TAGAGTTTTT 


CAGTCTTGCT 


TAAAACATAA 


TGTGATTCCG 


TTTGTTTCTT 


4020 


TACATCATTT 


TGATTCGCCT 


CAGAAAATGT 


TAGAAACAGG 


GGATTGGTTG 


AACAGAGAGA 


4080 


ATATTGATCG 


TTTCATACGA 


TATGCTCGCT 


TTTGTTTCCA 


AGAATTTACA 


GAAGTCAAGC 


4140 


ATTGGTTTAC 


AATCAATGAA 


CTGATGTCTC 


TTGCTGCAGG 


TCAATATATA 


GGAGGTCAGT 


4200 


TTCCTCCAAA 


TCATCATTTT 


CAATTATCTG 


AAGCAATTCA 


AGCGAATCAT 


AATATGTTGT 


4260 


TGGCGCATGC 


TCTTGCAGTC 


CTCGAATTTC 


ATCAATTAGG 


GATTGAGGGA 


AAGGTAGGTT 


4320 


GTATTCATGC 


TTTAAAGCCA 


GGCTATCCTA 


TTGATGGGCA 


AAAAGAAAAT 


ATTTTGGCAG 


4380 


CTAAAOGGTA 


TGATGTTTAT 


AATAATAAAT 


TTCTATTAGA 


TGGAACTTTT 


TTGGGCTACT 


4440 


ACAGTGAGGA 


CACGCTTTTT 


CACTTGAATC 


AAATATTGGA 


AGCTAATAAT 


TCTAGCTTTA 


4500 


TTATTGAAGA 


TGGTGATTTA 


GAAATTATGA 


AGAGAGCTGC 


ACCTCTTAAT 


ACGATGTTTG 


4560 


GGATGAATTA 


TTATCGTTCA 


GAATTTATTC 


GTGAATACAA 


AGGTGAAAAT 


AGACAAGAAT 


4620 


TTAATTCAAC 


AGGAATAAAA 


GGACAGTCTT 


CTTTTAAATT 


AAATGCTCTA 


GGTGAATTTG 


4 680 


TAAAAAAACC 


TGGTATTCCG 


ACAACAGATT 


GGGATTGGAA 


TATTTATCCT 


CAAGGGTTAT 


4740 


TTGATATGTT 


GCTTCGTATC 


AAAGAAGAAT 


ATCCTCAACA 


TCCGGTCATT 


TATTTAACTG 


4800 


AAAATGGTAC 


AGCCCTTAAA GAAGTTAAGC 


CAGAGGGCGA 


GAATGATATT 


ATTGATGACA 


4860 


GTAAGAGAAT 


CCGTTATATT 


GAGCAACATT 


TACACAAAGT 


TTTAGAGGCT 


CGAGATAGAG 


4920 


GAGTCAATAT TCAAGGCTAT TTTATATGGT CTTTGCAAGA TCAATTTTCT TGGGCGAATG 


4980 


GCTACAATAA 


GCGATATGGT 


CTTTTCTTTG 


TTGATTATGA 


AACACAGAAG 


AGATATATTA 


5040 


AGAAAAGTGC 


TCTTTGGGTA 


AAAGGGCTAA 


AACGGAATTA 


AGGTTAGCGA 


TTTGACTGAT 


5100 


GTTTAATATG 


TTTTAAATAT 


GAGGTTGAAT 


TTTTTATAGG 


AGGAGTTTTA 


TGGATAAGCT 


5160 



WO 98/18931 



PCT/US97/19588 



549 

AGTCGCTGCC ATTGAAAAGC AACAAGGGAA ATTTGAAAAA ATTTCTACTA ATAACTATAT 5220 

GATGGCTATT AAAGATGGAT TCATTGCTAC TATGCCTTTA ATTATGTTTT CAAGCTTTTT 5280 

GATGATTATT ATTATGATTC CTAAAAATTT CGGAGTAGAG TTACCGAGTC CAGCTATTGT 5340 

CTGGATGAGA AAAGTGTATA TGTTAACCAT GGGAGTTTTG GGTATTATTG TTTCAGGGAC 5400 

TGTTGGAAAG TCATTAGTTG GAAATGTTAA CAGAAAAATG CCTCACGGAA AGGTAATAAA 54 60 

TGATATTTCT GCAATGTTGG CAGCCATATG TAGTTATCTG GTATTAACTG TAACGCTTGT 5520 

AGTTGATGAG AAGACGGGAT CTACAAGTTT GTCGACAAAC TATTTAGGAT CTCAAGGATT 5580 

GATAACTTCG TTTGTCAGTG CCTTTATTAC TGTAAATGTT TACCGATTCT GTATTAAGCG 5640 

AGACATTACT ATTCATTTAC CTAAGGAAGT TCCTGGGGCT ATATCACAAG CTTTTAGAGA 5700 

TATTTTCCCT TTTTCTTTTG TTTTACTTAT TAGTGGTTTG TTAGATATTG TATCTCGGTT 5760 

TAGTTTAGAT GTTCCTTTTG CCCAAGTATT TCAACAACTA TTGACTCCTA TTTTTAAGGG 5820 

GGCAGAATCA TATCCTGCTA TGATGTTGAT TTGGTTTATG TGTGCTTTGC TTTGGTTTGT 5880 

TGGAATTCAT GGACCATCTA TTGTCTTACC TGCTGTTACA GCTTTGCAAC TGAGCAATAT 5940 

GGAAGAGAAT GCTCAACTTC TTGCAAATGG GCAGTTCCCT TATCATTCTT TAACACCTAA 6000 

TTTCGGGAAT TATATCGCTG CTATTGGAGG AACGGGGGCT ACCTTTGTTG TACCATTTAT 6060 

TTTGATTTTC TTTATGCGGT CTAAACAATT AAAATCGGTA GGTAAAGCTA CAATTACTCC 6120 

TGTTTTATTT GCGGTAAATG AACCTCTTCT ATTTGGTATG CCTGTTATTT TGAATCCCTA 6180 

TCTTTTTGTC CCTTTTTTGA TGACTCCACC AGTGAATGTA TTTCTAGGAA AGGTCTTTAT 6240 

TGATTTCTTT GGAATGAATG GATTTTATAT CCAGTTACCT TGGACCTTTC CTGGTCCCTT 6300 

GGGATTGTTA ATTGGAACGA ATTTTCAACT TATCTCCTTT GTATTTTTAT CTTTGATTTT 6360 

AGTTGTCGAC ATATTGATTT ATTTGCCATT CTGTAGAGCG TATGATAGAC AGTTACTGGT 6420 

GAAAGAAGAT ATTGCAAGCT CAAATGATAT TATTTTAGAG GAGGATACAA GTGAAATAAT 6480 

TCCTGGTGAG ATAGATGAAA TAAAAAGTAA GGAGTTGAAA GTACTGGTTC TTTGTGCAGG 6540 

GTCTGGAACA AGTGCGCAAT TAGCCAATGC AATTAACGAG GGGGCTAACT TAACAGAGGT 6600 

TAGAGTGATT GCGAATTCAG GAGCGTACGG AGCTCATTAT GATATTATGG GTGTTTATGA 6660 

TTTAATTATT CTGGCCCCAC AAGTTCGGAG TTATTATAGA GAGATGAAGG TGGATGCAGA 6720 

AAGATTAGGT ATTCAGATAG TTGCTACCAG AGGAATGGAA TATATTCATT TAACAAAGAG 6780 

TCCAAGTAAA GCCTTACAAT TTGTATTGGA GCATTACCAA GCTGTGTAGT AAGTTTTTCC 6840 

ATCTTTTATT TGAGTAAAGA TTTTGTTTAC AGATAGGCTT GGATTTAAAA ACGTTCCCCC 6900 

\ 
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TTTTTTAATA TAAGAATCCC TCTTTCACAA TTGTAAAAAG AGGGATTTTG TATTTTATCT 6960 

CTTAGACCAA GTTCTCTTCA TAAAGAGAAG GAGGATTGGG TAAATCTCCA AGCGCCCTGC 7020 

AATCATTGCA AAGGATAGGA GAATTTTTGA GATGGGACTA AAGATTGAGA AACTAGAAGT 7080 

GGTTCCTAGA ATAGGCCCGA TATTATTGAA ACAGCTAAAG ACAGCGCTGG TCACGACCAG 7140 

AAAATCATTG CTATCTAGGC TGACAATAAA GATAAGCGCT AGCAAAATCA TAGCATAGAT 7200 

GACAAAGTAC TTGAGAATCT TATGCTGGGT ATCTTTGTCA ATCACCGTTT TATTAACATG 7260 

GAGGGTCAAA ACACGGTGGG GCGATAGGAT TGACAAAATT TGGTTTTTGG CAATTTTTGA 7320 

AAGGATGAGG CCTCGAATAA TCTTGAGTCC ACCTGCAGTT GATCCAGCAG AGCCACCGAT 7380 

TGCCATGAGG AAAAGGAGGA TAAACTGGGA GAAGAGGGGC CAGTTGGTAA TATCTCCATA 7440 

TCCAAAACCA GTTGTTGTAA TGATGTTGGA AACCTGGAAG AAGGTCATTT CAAAGCTCTT 7500 

TGAAAACCCT GGGTAGAGGT AGAGGGTGTT GAGGCTAATC AAGCCTGTAG AAACCAGTAC 7560 

AATGACCAAG TAAGCCCTAA GCTCTTCATC TCCAAAGAAG GCCTTGATGC GACGGAGCAT 7620 

GAGGTAGTAG TAGAGGTTGA AATTTACTCC AAAAACCAGA ACTCCGATAC TGACCAGATA 7680 

GGTAATCAGT GAGCTGCCAT AGTGGGCAAT TCCGTCGTTA TAGACGGTAA AGCCTCCAGT 7740 

TCCCGCTGTC CCCATAGCAA TAACAAAACT ATCGTAGAGA GGCATACCGG CTAGATAATA 7800 

GATGATGACA AAGAGGGAGA AGAGAGCTAG ATAAAGGAGA TAGAGAATCT GGGCAGTGTT 7860 

TTTTAGTTTG GATACAACCT TGCCAAAAAC AGGACCTGGA ACCTCAGCCT TCATCACCTC 7920 

TAGGTGGCTA TTTTTGGCAT TGTCCATAAT AGCAAGTGCA AAAACAAGCA CTCCCATCCC 7980 

TCCAATCAAG TGGGTAAAAC TTCGCCAGAA GAGGAGGGAA CGGCTGAGAA CCGAAACGTC 8040 

GTTCAAAATA CTTGCTCCAG TAGTTGTAAA TCCAGAACTA ATTTCAAAAA AGGCATCAAT 8100 

AAGGCTGGGG ATTTGCCCAG AAAAGACAAA GGGGAGACCA CCAAAGAAAG ACCAAAGGAT 8160 

CCAACAGAGG GCAACGATCA AGACTCCCTC CTTGGCATAA ATCCGTTGAT TTTTTGGCTT 8220 

CTGTAAACTC CCTGAACCGC CTAACAATAC GAGAATCCCT ATGGTCGAAA AGAGGGCTGT 8280 

AAAGACTTGG CTCGATTCAC GGTAATAGAC AGCAATCGCA ACAGGAACCA AAAGAAGAAC 8340 

AGCTTCAATC AAAAGTAATT TTGAAAGGAG GTAACGAATC ATACTTTTAT TCATTTCTTA 8400 

CCTCGCGATC AAGTCATAAA TCTTGGTGAT GTTTGGCAAC AAGGTTGTTA CTAGGAGCTT 8460 

GTCTCCAACT TCCAACATAT CCTCCCCAGT TGGGAAAATA GTCTTGCCCT TTCGAATAAT 8520 

GGCTGCAATA AGAACCCCTT TTTTCAATTT CAGTTGAGAA AGAGGTTTGG CAGTCATTTT 8580 

ATTGGCTTCC TTGATATGGA ATTGCAGGGT TTCGATTTGG CCATTGGCTA GATGGTGCAT 8640 

AGCTTGAAGG TCTGAATACT GGGCATTAAG TCGACCACGA ATAAAGTGCA TAATCGTATC 8700 
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TACAGCGATG CTTTTAGGTG TGATGATACT TGAAAAATCA GGCGCATTGA TAATCTCGAG 8760 

GAGACTGGTA CGATTGACCT TAGTAATATT TTTCTGTACA CCTACCCTGT CAAGGAACAT 8820 

AGATGTAATC AGATTTTCCT CATCGACTCC TGTTAGAGTC GCAACGGCAT CATAGTGTTG 8880 

AGCACTTTCT TCCAGCAGGA TATCTTTTGC GGTTCCATCT CCTTGAACGA TGTAGAGATT 8940 

TGGGAATTTC TCGCTAAAGA AGCTGGCGAT TTCAGGATTG ATTTCAATGA CTTTTGTATC 9000 

GATACGACTA TCTTTGAGAA TACCAAGTAG ATAATAGGCA ATTCTACCTG CCCCAACGAT 9060 

GAGAAGGCTC TTCACGGCGC GTGATTTAAA ATAATTATGG AAGAGTATCA TATCGACACG 9120 

GTTACCAGTG ACAAAGATTC TATCTTTATC CTGTACAGTC ATGTCACCGC TTGGAATGAT 9180 

AATTTGATGA TCCCTCTCTA TCGCACAGAC AATGACATTA CCAAATTTTT TACGAAAATC 9240 

AGAAATGGGC ATTTGGCAAA GACCGCTGGT GGACTTGACG ACAAATTCCA TGAGGCTAAC 9300 

GCGTCCACCA GCAAAGCGTT CGACAGACAG GGCGTTGGGG AAGTCAATGA TATTCGCGAT 9360 

AGCGCGGGCA GCCAAGAGCT CAGGATTAAC GATAAGAGAA AAACCGAGAA TATTCTTTTC 9420 

CTTGAAATAA GAGTTAGAAT ATTCAGGGTT CCGCACCCGA ACGATAGTTT CTTTAGCTCC 9480 

CATTTTCTTG GCTAGAACTG CTGCAATCAT GTTGACTTCA TCGTGCTCAG TCAGGGCGAT 9540 

AAAGATATCA CAATCTTGGA CGCTGGCTTG CTCAAGAATG GCAAAATCGG CCCCGTTACC 9600 

AAGGATACCA ATGATATCAA AGCGACTGAC AATATGATTG AGAACAGCTT CGTCTTGCTC 9660 

AATCAGCAAA ACATCATGCT TTTCTGCAAC CAAGGAGCGA CAGAGGGCAA AACCAACTTT 9720 

TCCCCCTCCG ACAAGGATAA TTTTCATAAT AAAACCTACT TTTTCATGAT GTAACTATCA 9780 

TACCCTTTTT CAAGAAAAAA TGCACCTACT AGCTAATAAC AAGAGTTTTT AGTGAAAATT 9840 

CGCTATAAGG TAAAACTATA CCCTAACCAA TTGAAATAGC TATTAGCGAC TTTCTCTGAA 9900 

ATATGGTATG ATAAAGGATA TACAAGGAGA TAAAATGAAT AATAATTTAC TGGTATTACA 9960 

ATCAGACTTT GGTCTGGTTG ATGGTGCGGT ATCGGCTATG ATTGGAGTGG CTTTAGAAGA 10020 

GTCTCCAACC TTAAAAATAC ATCACTTGAC GCACGATATC ACGCCTTATA ATATTTTTGA 10080 

GGGGAGCTAT CGTCTCTTTC AGACGGTGGA TTACTGGCCT GAGGGAACGA CGTTTGTATC 10140 

GGTTGTCGAT CCAGGTGTCG GTTCGAAACG TAAGAGTGTA GTTGCCAAGA CTGCAAAAAA 10200 

TCAATACATT GTCACGCCAG ATAATGGGAC GCTTTCCTTT ATCAAGAAAC ACGTTGGCAT 10260 

TGTAGCCATT CGTGAGATTT CTGAGGTGGC CAATAGGCGT CAAAACACAG AGCATTCTTA 10320 

TACCTTCCAC GGTCGTGATG TCTATGCCTA TACTGGTGCT AAACTGGCCA GTGGTCACAT 10380 

TACTTTTGAG GAAGTAGGGC CAGAGCTCAG TGTGGAACAG ATTGTAGAGC TTCCAGTCGT 10440 
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AGCGACCATC ATAGAAGATC ATCTGGTGAA GGGAGCCATT GATATTCTGG ATGTGCGTTT 10500 

CGGTTCGCTT TGGACCTCTA TCACACGGGA AGAATTTTAC AAGCTGGAAC GAGAATTTGG 10560 

TGATCGTTTT GAAGTGACCA TCTATCATGC TGATATGCTG GTCTATCAAA ATCAGGTTGT 10620 

CTATGGCAAA TCATTTGCAG ATGTGAGAAT TGGGCAACCs ATcTTTACrc TCAGCaTCTt 10680 

CGATTAGCTG GGCAATTCGT TCTAGTTGGA TTTCGTCAAT CAAGGT 10726 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7163 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

TTATCTTTAA CGATATCAAT CAAGATCTGG TCAATAAAGG GATTGGGGCT TATCGTGAAG 60 

TTGGCATCCA AGCCCATGGA TATGTCTGTG ACGTGACAGA CGAGGACGGT ATCCAAGCCA 120 

TGGTCAAGCA AATCGAACAA GAGGTTGGTG TCATTGACAT CCTCGTTAAT AACGCTGGTA 180 

TTATCCGCCG AGTTCCAATG TGCGAAATGA GCGCCGCTGA TTTCCGTAAG GTCATCGATA 240 

TTGACTTAAA CGCACCATTT ATCGTTTCAA AGGCAGTTAT TCCTTCTATG ATAAAGAAAG 300 

GGCATGGAAA GATTATCAAT ATTTGTTCGA TGATGAGCGA ACTGGGACGT GAAACAGTTA 360 

GCGCTTATGC TGCTGCTAAA GGGGGCTTGA AAATGTTGAC CCGCAACATT GCGTCTGAAT 420 

ACGGTGGAGC CAATATCCAA TGTAACGGAA TTGGACCGGG TTATATTGCC ACTCCTCAAA 480 

CAGCACCTCT TCGTGAATTG CAAGAAGATG GTTCTCGCCA CCCATTTGAC CAGTTCATCA 540 

TTGCAAAAAC ACCTGCTGCA CGTTGGGGAA ATACTGAAGA TTTGATGGGC CCTGCTGTCT 600 

TTCTCGCTAG TGATGCCAGC AATTTTGTCA ATGGCCACAT CCTATATGTA GATGGCGGTA 660 

TCTTAGCCTA CATCGGAAAA CAACCTGAGT AAAAATAGAA AGAAGATCTT ATGAAAATCG 720 

CATTAATCAA TGAAAATAGT CAAGCTAGCA AGAATCACAT TATTTACGAT AGTCTAAAAG 780 

AAGCGACAGA TAAAAAAGGC TACCAATTAT TTAACTATGG TATGCGTGGA GAAGAAGGAG 840 

AAAGTCAATT AACTTATGTG CAGAACGGAC TAATGGCTGC CATCCTTTTA AATACAAAGG 900 

CAGTTGACTT TGTTGTTACC GGCTGTGGTA CGGGTGTAGG GGCTATGCTT GCTTTAAACA 960 

GCTTCCCTGG TGTTGTCTGT GGTCTAGCAG TGGACCCAAC TGACGCTTAC CTTTATTCTC 1020 

AAATCAATGG TGGTAACGCC TTGTCTATCC CTTATGCCAA AGGATTTGGC TGGGGGGCAG 1080 

AACTGACCCT CAAATTGATG TTTGAACGCT TATTTGCTGA AGAAATGGGC GGTGGCTACC 1140 
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CAAGAGAACG TGTAATCCCT GAACAACGCA ACGCTCGTAT CTTAAACGAG GTGAAACAAA 1200 

TCACCCACAA TGATTTGATG ACCATCCTTA AAATAATCGA CCAAGACTTC CTCAAAGACA 1260 

CCATCTCTGG CAAATACTTC CAAGAATACT TCTTTGAAAA CTGCCAAGAT GATGAAGTTG 1320 

CTGCTTATTT GAAAGAAGTA TTAGCCAAGT AAAGCTATTC TAAACCAGAA AGGAACTAAT 1380 

GGATGACGAA AATATTACTG TTTGGCGAAC CATTAATTCG AATTTCACCA TTAGATGCCA 1440 

CCAGTATCGG CGATCATGTT GCCAGTTCGA CTTATTTTGG CGGATCAGAA ATTAACATCG 1500 

CTTGTAATTT GCAAGCCCTG GGTATCTCAA CGAAAGTTTT TACCGCACTC CCTGCCAACG 1560 

AGATTGGAGA TCGTTTTCTC ACATTCTTGA AACAGCACCA AATCGATACC AGTTCAATCT 1620 

GTCGGCTTGG CGATCGAATC GGCCTCTACT ATTTGGAGAA CGGCTTTGGT TGTCGTCAAA 1680 

GTGAAGTTTT CTACGATCGT AAGCATACGA GTATCAGCCA GATTCGGCCA AACATGCTAG 1740 

ATATGGATTC TCTCTTTCAG GGGATTAGCC ATTTTCATTT TAGTGGAATC ACCGTAGCTA 1800 

TCGGTCAAGA GGTCCGTGCG ATCCTTCTCC TACTCTTGGA AGAAGCCAAG CGCCGAGGAA 1860 

TTGTCGTTTC AATGGATCTC AATCTGAGAA CAAAGATGAT TTCAGTCCTA GAAGCCAAGT 1920 

ATGAATTTTC TAAGTTTGCA CGTTTTACTG ACTATTGCTT CGGTATTGAT CCTCTCATGA 1980 

TTGATGACCA AAATCTAGAG ATGTTTCCAA GAGACAGTGC TAGCCTAGAA GAGGTGGAAA 2040 

ATCGCATGCG ACTTTTAAAA GAAGCCTATG GTTTCAAGGC CATTTTCCAT ACCCTCCGCT 2100 

CTAGTGATGA GCAAGACAAA AATGTCTATC AAGCCTATGC TCTAGAAGAA CTATTTGAAG 2160 

AGTCTGTCCA ACTAAAAACT GCAGTCTATC AACGAATTGG TAGCGGGGAT GCCTTTATAT 2220 

CTGGTGCCCT TTACCAACTA CTC CATC ATT CCTCCCTAAA AACTACCATT GACTTTGCAG 2280 

TTGCGAGCGC AACTCTCAAA TGCACTCTTC CAGGAGACCA TCTCTCCACT TCCTCAACTA 2340 

GTATTGAAAA TTTACTGGCA AATGCACAAG ATATCATTCG TTAGGAGAAT TACATGACCA 2400 

AATCAGATAC GATTATTGAA CTAAAAAAAC AAAAAATTGT CGCTGTTATT CGAGGAAATA 2460 

CAAAGGAAGA AGGACTACAA GCCTCGATTG CTTGTATCAA GGGCGGTATC AAAGCTATTG 2520 

AAATCGCCTA TACCAATCAG TATGCAGGAC AAATCATCAA GGAACTTGTA GACTTGTATC 2580 

AGGACGATCA GAGTGTTTGT ATCGGTGCAG GTACTGTGCT TGATGCCGTA ACTGCTAGAG 2640 

ATGCCATTCT AGCTGGAGCA AATTACGTTG TTTCTCCATC TTTCCATGCT GAAACTGCGA 2700 

AAATGTGCAA TCTCTACAGC ACACCGTACA TTCCAGGCTG TATTACCCTC ACAGAGATCA 2760 

CGACTGCACT TGAAGCCGGT AGTGAAATCA TCAAACTCTT CCCAGGTAGT ACTCTCAGTC 2820 

CAGCATATAT CTCTGCAGTC AAGGCACCGA TCCCACAAGT TTCCGTAATG GTAACCGGAG 2880 
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GAGTCGGCCT AAACAACATC CCTCAATGGT TCGCTGCTGG TGCAGATGCC GTTGGAATTG 2940 

GTGGCGAACT CAATAAACTC GCTTCCCAAG GCAACTTTGA CCGCATCAGC GAGATTGCCC 3000 

AACAGTATAT TACACTCAGA TAAAATCATA ACTACCCGTC TAACGGGTGG TTTATCTCAG 3060 

AGCTATAAGC CCAAATCATC AGCCAGCGCC TAAAGACGCT GGCTTTCACG TTGTTCAAGC 3120 

CTTATTGCTC TTGACTCGTC ACTTGCCTCT TTAAGAGACT TTGGTATTAC TTACCACTAT 3180 

CCCTAAAGGG ATCCTCATAT TCTTTTACAC TCAATTTATC TAGTGCTATA GTAGATTGAA 3240 

ACTGGAATAG TACACCTCTG CTTCTAAAAC ATTGTTAAAA ATCGATTTGA CTGTCCTGAT 3300 

CGATTTTGTC CTGTTCTTAT TTCATTTTAC TATATATCAT ACTTTACTCG TTCTCAAATT 3360 

TTCATACTCA TGAAGAAATC ATCCACTCGA TAATTTCTTT AATCTTGACT ATATTTCTTA 3420 

ATTGTGGCTT CATTAAGCCC TACTGGACTT ACATAATAAC CTTCCTCCCA GAAATGCCGA 3480 

TTCCCAAACT TGTACTTGAG ATTGGCGTGT TTGTCAAACA TCATGAGTGC ACTTTTGCCT 3540 

TTTAAATACC CCATAAAACT TGAAACACTT AGCCTCGACG GAATACTGAC TAACATGTGT 3600 

ACATGGTCTG GCATTAAGTG ACCCTCGATC ATTTCAACAC CTTTATAACT ACACAAGCGA 3660 

TGAAATATTT CGTCTAAACT ACTTCTATAT TGATTATAGA TGACTTTTCG TCTATACTTA 3720 

GGGGTGAACA CAATATGATA GAACACCTCC ACTTTGTGTA TGATAAACTA TGAGTCTTTT 3780 

GTGCCATATT TTTTCTCCTT TCGCTTTACA ATTGGATTGA ACACCTTTAT TGTATCGCGT 3840 

TTGGAGTTTT TTTGGTATAA CCTTCGACGC GCACCCGTAT AGCGGGTGGT TGTTTTGTCT 3900 

CGCACCTCAC GGAGCGAGAC GGACTAATAT AGTGGAGTGA AATAGGATAC GAACAAATTG 3960 

ATTAGGAAAA TCAAATGAAT TTATAGAAAT CTTTTAGCAG TTATAACGTT CTATTCTAGT 4020 

TTCAAAACGC TATAGTCACA TAATAATGAA GTAAAAAAGG ATAAGTATCA ACTTATCCTT 4080 

TTTTAAAAGA AAAATCCGAA GATATTTGGC CTTCTTCGGA TTTTTTCTAT TTTCCACAGT 4140 

TTCATGTAAT TCATCTAGAT GATGAACAAA TTAGTTGTTC TTTCCTCTAC GGAATAGATA 4200 

AAATGCCCCA AGTAGCAAGA ACCCTAGACT TGCCAAGATT GACTGACCTT CTCCTGTCTG 4260 

AGGGAGATTC TTTTGATCCG AATGGTTCTT TTCCTCTTCA GATTTTTCCT TTTCTTTTGA 4320 

ATTCTGTACT TGTGGCTGAG CTGCTTGCTC TAGCTTTTTA AAGACTTCCT GATCTGGAGC 4380 

TGATTCCTGG GTTTCAGGAT TATAGTAGGC AATCTTATAT TCATCCCCTT CTTTTCGAAT 4440 

GGTATAGACT CCACGTTTCA AAACTTGGAA TTGGTTGGAA ATAGTAGAGA CAGAATCATC 4500 

ATATTTCACA ATGCCCCAAA CTCCTTGTTT AGCATCATAA ACAGACTGAA GGGTTTCGTT 4560 

ATTTTCGATG AGGCTACTTT CTAACTCTTT TATCATTTGA TTGAAGGTGG CACGATCCAC 4620 

GTTAGGAATG AGCATATAGC CATAAGAATC TCTATTTTGC TTATGAGCCT GACTAATCGT 4680 
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AAGAAATTCA 


TTTTCAACTT 


CCTTGTCTGA 


CTGTCCTTCA 


TTGATATCCT 


TCCAGGCTCC 


4740 


CTTTTGCAAA 


GCCTTACTCA 


TACTGATTGA 


ACTCTTCTTA 


AAGAAAAAGT 


AACCAATATT 


4800 


CTTTTTCGAA 


TCGAACGATT 


CTAAAAAGAC 


ACTTTGGGTT 


TCAGGATAAT 


CCTTTTCTTG 


4860 


TTCTGTAAGG 


GAGGCTTCTT 


TATCATTGAC 


ATAGACTTTA 


TATGGATTAC 


CTGATTCCAG 


4920 


TTTTCTCTGG 


TCAATTGTAG 


TTGCAGCAGT 


ATCTGTTGAA 


GTGTTTTGGA 


TATTGCTTCC 


4980 


TAAAAAGGCG 


ATCTTATCCT 


TTAGCATAAA 


CCAGCTCTTA 


TGAGCAGTCA 


ATGTTTGATT 


5040 


CCAGTTGGTG 


AAATCCATGG 


TTGCTGTCGC 


ATTGGCATCA 


TCTAGTTTGC 


TCGTTCCAAC 


5100 


GAAAGCAGAC 


GGTAAAACTT 


TACCTGTATC 


GCTATCCGCT 


CTCTTAGCAT 


CCGTCTCTGT 


5160 


TGTACCAGGC 


ATCTTATATG 


GATTAACTGT 


TGGCCAGTAG 


CCATCGCTAT 


AGTGACTCAA 


5220 


ATCGCCATTG 


TAAAGATAGA 


ACATCCCATC 


ACTCGTATAC 


CAACCACGTT 


TATTTTCCTT 


5280 


GTTCATGTGT 


TCGTAATTCA 


AGGTACGACT 


GGAAAAGAGT 


GACAAGCCAA 


ATCCAAACCC 


5340 


TTTCTCTGCA TTGTACATGG 


CTGTTTTATC 


CATCTTGTTA 


AAGGCAGATA 


GGTAACTTGG 


5400 


TCTTGGAACA 


CTTGCGACTC 


CTGCATCACT 


TAACAAGGAT 


TGCATCAAAC 


TGATATCCTT 


5460 


ATAAGTCTTC 


AAATTCTTAA 


AGACATCATA 


ATAACTATCC 


GATTGAACAA 


TGGTCTTCAC 


5520 


AAGACTCTGC 


AAACATTGTT 


TGGTTTCTCC 


TTCAGACATA 


TCCGCTATTC 


GGTGAATCCC 


5580 


TCTTAGTACT 


TCTACTGCGG 


CCACGTGCCC 


CTCGCTATTT 


GCACGACTGA 


TCGAGCGTCC 


5640 


ACGACTCATA 


TCCATCAACT 


CTCCATTCAC 


CAGCAAAGGA 


GCAAACGATT 


TATCAATCCA 


5700 


GTGGTACATG 


GTTTGCATTT 


TATCTTTATC 


GATTGGATTC 


TTGGTCTTTT 


GAATGACTGG 


5760 


CAACAGTTGA 


GACAGGCCAT 


CAATCAAAAC 


ATTCCCATAA 


GCACCCGTAT 


AGGCAACATT 


5820 


GGTGTGGTCG 


ATATAGGATC 


CATCTTGATA 


AAAACCTTCA 


CCTTGGTCTA 


CCAACTTGAA 


. 5880 


CACTTGCTCA ATCGAGCGAA 


TGGTAGAAGA 


AATTTCTTGA 


TCATCCTTAC 


GCAGTAAACC 


5940 


AGCTATTACT 


TTTACCCTTC 


CCATATCAAC 


TAAGTTTCCA 


CCTAGAGCCT 


TGAATGGGTT 


6000 


ATCAGTCGTC 


TTTCGGAAAT 


GTTCGGGATC 


TGGTACAAAT 


TTTTCAATCA 


CATCTGTATA 


6060 


TTTTTTAATT 


TCCTCATCAG 


AGAAGTATTC 


TTTCATCAGA 


GACAAGGTAT 


TGTTGATGGC 


6120 


ACGAGGTGTA 


CCGATTTCAT 


AATCCCACCA 


GTTCCCAACA 


ATGCTCTTTT 


CACTATTGTA 


6180 


GACATGTTTA TGCATCCATT 


CCATGGAATC 


CCTGACTGTT 


CGAACGACAG 


TTTCATCTTG 


6240 


ATAATAACGA 


GAAGAAGGAT 


TGGTCACTTG 


CTTGGCCATC 


TCCTCCAATT 


TCCGATAAGT 


6300 


GGCAGTCAGA 


TTTGCAGACG 


TTTTATAATT 


TGAAAATTTT 


TCCCACAAAT 


AGGTGCGGTC 


6360 


CGCCTGACTT 


GAAATACTGG 


ATAGGCTATC 


AGCTACCTTT 


CCTTCCAATT 


CCTGGTTTAA 


6420 
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TTTGGCCATC TGTTCATTTT TAGAATCATA GTATTGATTC CCAGCGATGA TGCCATTCCA 6480 

GTCATCCAAA CGGTCTGTGT ATGCATCCTT AACAGAGGCC AGAATCTTCA AAGGAATCTT 6540 

TTTCACTTCC TTGCCATCTT TACTGACAAT GACATTGGTT GTCCCTTCCT TAAGAGGTTC 6600 

TAAAATTCCA TTTTTGACTG AAGCAACGTC AGGATTTTCT ACCTTATAAG TATAGTCCGC 6660 

AAGAGAAAAA ACATGTTTTT TTCCAATTGG TAAATCAATC TTTTCCTCAA GCTGTTTATC 6720 
TGTTTGAGAA TCCTCAGAAA GCTGGTCTGC TACCTCTACC AGCTCAATAT CCTTAAAGGA . 6780 

AACAGTCCCA GTTCCTGTTT CATAGAATAA CTCCAGCTTG ATTTTATCAA CATCTAAAGT 6840 

CGGGCTATAG TCTGCTTCAA TGGTCTGCCA GTCCTTTGTT CCTGACGTCG TTGCAGAATT 6900 

CCACAATCGC TTGTCCTTAC CACTTTCCTC AATGATACGA ACTTTGGCAA TCCCGATTTT 6960 

ATTATCTGTT TTAATCTTGA AACGCAGTTT ATACTTTTTC TTAGCTTCAA TAGGAACCAT 7020 

ACGGTGAAGC GCTGCCCTTA ATTTCTCATG GCTTGAGATA GTGATAGCCC CATCCTTAGC 7080 

CTCAATGACT CGAGTTGAGG CATCTGCACT ATTCTTCTGG TCTACCCAAG CTGACCACCC 7140 

CCTGAGCTTT GCTTCCTGTC CGG 7163 



(2) INFORMATION FOR SEQ ID NO: 68: 

{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9244 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

CGTTATAACA TACATGTAAG CGGTACCCAA AATGGTGCCA AGTCAAAATT TTTAAGGAGG 60 

AAAATACATG TCTTCACATC CAATTCAGGT CTTCTCAGAA ATTGGGAAAC TGAAAAAAGT 120 

TATGTTGCAC CGTCCAGGCA AGGAGTTAGA AAACTTGTTG CCGGACTATC TTGAAAGGCT 180 

TCTTTTTGAT GATATTCCTT TCTTGGAAGA TGCTCAAAAA GAACATGATG CATTTGCCCA 240 

AGCTCTTCGC GATGAAGGAA TTGAGGTTCT CTACCTAGAA CAACTCGCTG CTGAATCATT 300 

GACCTCTCCA GAAATCCGCG ATCAATTTAT CGAGGAATAC TTAGACGAAG CCAACATCCG 360 

TGATCGTCAA ACCAAGGTTG CTATTCGTGA ATTGCTTCAC GGCATCAAGG ACAACCAAGA 420 

ATTGGTTGAA AAAACAATGG CTGGGATTCA AAAAGTTGAA TTGCCAGAAA TTCCTGACGA 480 

AGCTAAAGAT CTAACTGACT TAGTTGAATC AGAGTATCCA TTTGCAATTG ACCCGATGCC 540 

AAACCTCTAT TTCACTCGCG ACCCATTTGC AACAATTGGA AACGCCGTAT CGCTTAACCA 600 

CATGTTTGCA GACACTCGTA ACCGTGAAAC ACTCTACGGT AAGTATATCT TCAAATACCA €60 
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CCCAATCTAT GGCGGAAAAG TGGATTTGGT CTACAACCGT GAAGAAGATA CGCGTATCGA 720 

AGGTGGAGAC GAGTTAGTTC TTTCTAAAGA CGTCCTTGCA GTAGGTATCT CTCAACGTAC 780 

AGACGCAGCT TCTATCGAAA AACTTTTGGT CAACATCTTC AAGAAAAATG TTGGCTTCAA 840 

GAAAGTTTTG GCCTTTGAAT TTGCTAACAA CCGTAAATTC ATGCACTTGG ATACTGTCTT 900 

CACTATGGTA GACTATGACA AGTTCACTAT TCACCCAGAA ATCGAAGGCG ACCTTCACGT 960 

TTACTCAGTT ACTTACGAAA ACGAAAAACT TAAAATCGTT GAAGAGAAAG GTGACTTAGC 1020 

TGAACTTCTT GCTCAAAACC TTGGTGTAGA AAAAGTTCAT TTGATTCGTT GCGGTGGTGG 1080 

CAATATCGTA GCAGCTGCGC GTGAACAATG GAACGACGGT TCTAACACTT TGACCATCGC 1140 

ACCTGGTGTG GTAGTTGTTT ATGACCGCAA TACCGTGACC AATAAGATTT TGGAAGAATA 1200 

CGGGCTTCGC TTGATTAAGA TTCGCGGAAG TGAATTGGTT CGGGGCCGTG GTGGACCTCG 1260 

TTGTATGTCT ATGCCATTTG AACGTGAAGA AGTGTAATCG CTGTTCGATA TTCGTCAATA 1320 

GAAAATGTAA AAAATAGAAA GAGGAAATAA TAAAATGACA AATTCAGTAT TCCAAGGACG 1380 

CAGCTTCTTA GCAGAAAAAG ACTTTACCCG TGCAGAGTTA GAATACCTTA TTGGTCTTTC 1440 

AGCTCACTTG AAAGATTTGA AAAAACGCAA TATTCAACAC CACTACCTTG CTGGCAAGAA 1500 

TATCGCTCTC CTATTTGAAA AAACATCTAC TCGTACTCGT GCAGCCTTTA CAACTGCGGC 1560 

TATCGACCTT GGTGCTCACC CAGAATACCT CGGAGCAAAT GATATTCAGT TGGGTAAAAA 1620 

AGAATCTACT GAAGATACTG CTAAAGTATT GGGACGTATG TTTGACGGGA TTGAATTCCG 1680 

CGGATTCAGC CAACGTATGG TTGAAGAATT GGCAGAATTC TCAGGCGTTC CAGTATGGAA 1740 

CGGTCTAACT GACGAATGGC ACCCAACTCA AATGCTCGCT GACTACTTGA CTGTTCAAGA 1800 

AAACTTCGGT CGCTTGGAAG GCTTGACATT GGTATACTGT GGTGATGGAC GTAACAACGT 1860 

TGCCAACAGC TTGCTCGTAA CAGGTGCTAT CCTTGGTGTC AATGTTCACA TCTTCTCACC 1920 

AAAAGAACTC TTCCCAGAAA AAGAAATCGT TGAATTGGCA GAAGGATTTG CTAAAGAAAG 1980 

TGGCGCACAT GTTCTCATCA CTGAAGATGC TG ATGAAGCA GTTAAAGATG CAGACGTTCT 2040 

TTACACAGAC GTTTGGGTAT CAATGGGTGA AGAAGACAAA TTCGCAGAAC GTGTAGCTCT 2100 

TCTTAAACCT TACCAAGTCA ATATGGACTT AGTTAAAAAA GCAGGCAATG AAAACTTGAT 2160 

CTTCCTACAC TGCTTGCCAG CATTCCACGA TACTCACACT GTTTATGGTA AAGACGTTGC 2220 

TGAAAAATTT GGTGTAGAAG AAATGGAAGT AACAGACGAA GTCTTCCGCA GCAAGTACGC 2280 

TCGCCACTTC GATCAAGCAG AAAACCGTAT GCACACTATC AAAGCTGTTA TGGCTGCTAC 2340 

ACTTGGTAAC CTTTATATTC CTAAAGTATA ATTTTAGATA ATAAACCGTC TACCAACAGC 2400 
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TATGAGGGCT GCGACTAATA GCTTTAGTCC GGTCCTCTTT TATGTAATGG TAATCTATTA 2460 

TTTCTTATAA AATATGTGAA AAATCATTAA ATTGAAATCT AAACGCATTC TATTGAGTGT 2520 

GATAAAGGAG AATTTATGGC AAATCGTAAA ATTGTAGTAG CTTTGGGAGG AAATGCGATT 2580 

CTTTCTTCTG ACCCATCAGC AAAGGCTCAA CAAGAAGCTT TAGTTGAAAC AGCTAAGCAT 2640 

CTTGTAAAAT TGATTAAAAA TGGAGATGAT CTGATTATCA CTCACGGTAA TGGACCTCAA 2700 

GTTGGGAATC TCTTGCTCCA ACATTTGGCA TCAGACTCTG AAAAGAACCC TGCCTTCCCA 2760 

CTCGACTCAC TTGTCGCTAT GACAGAAGGT AGCATCGGTT TCTGGTTGAA AAATGCTTTG 2820 

CAAAATGCTC TCTTGGATGA AGGCATCGAA AAAAATGTTG CCTCTGTTGT AACGCAAGTT 2880 

GTCGTAGATA AAAATGATCC AGCTTTTGTT AACTTGAGTA AACCAATCGG TCCTTTCTAT 2940 

TCAGAAGAAG AAGCAAAAGC AGAAGCCGAA AAAAGCGGAG CGACTTTCAA GGAAGATGCT 3000 

GGCCGTGGCT GGCGTAAGGT CGTTGCCTCA CCAAAACCTG TTGACATCAA AGAAATTGAA 3060 

ACCATCCGTA CTCTTTTAAA TAATGGTCAA GTCGTCGTAG CTGCAGGTGG TGGCGGTATT 3120 

CCCGTCGTCA AAGAAAACAA TGGACATTTG ACTGGTGTCG AAGCGGTTAT TGATAAAGAC 3180 

TTCGCTTCCC AACGTTTGGC AGAATTGGTT GATGCAGACC TCTTCATCGT TTTGACAGGT 3240 

GTAGATTATG TATTTGTTAA CTACAACAAG CCAAACCAGG AAAAATTGGA ACATGTGAAT 3300 

GTTGCCCAGC TGGAAGAATA TATCAAACAA GATCAGTTTG CACCAGGTAG CATGCTTCCA 3360 

AAAGTAGAAG CAGCTATCGC TTTTGTCAAT GGTCGTCCAG AAGGAAAAGC AGTTATTACT 3420 

TCCCTTGAAA ATCTAGGCGC CTTGATTGAA TCTGAAAGCG GAACAATTAT TGAAAAAGGA 3480 

TAAGTTGTTT TACTAATAAG ATGTATTCTA TTTCTAGTAT CTTTATATCA AATTAGAAAT 3540 

TATTCTTGAA AACATGTACA ATATTTCAAA AGATACTAGT TTTAGACTTT AATATGGTAA 3600 

AACAAATATA AATAGAAAGC GTTTTCTTGA ATGTTTATTT AAGAAAGTAG TTGGTTTTTT 3660 

ACACTTTGTT AGACATCAGG AGGAAAAACA AATGAGTGAA AAAGCTAAAA AAGGGTTTAA 3720 

GATGCCTTCA TCTTACACCG TATTATTGAT AATCATTGCT ATTATGGCAG TGCTAACTTG 3780 

GTTTATCCCT GCGGGGGCCT TTATAGAAGG TATTTACGAG ACTCAGCCTC AAAATCCACA 384 0 

AGGGATTTGG GATGTCCTCA TGGCACCGAT TCGGGCTATG CTAGGTACTC ATCCAGAGGA 39Q0 

AGGTTCGCTC ATTAAAGAAA CGAGCGCAGC GATTGATGTA GCCTTCTTCA TCCTTATGGT 3960 

TGGTGGTTTC CTTGGCATTG TCAACAAAAC TGGTGCTCTT GACGTAGGGA TTGCCTCTAT 4020 

CGTGAAGAAG TATAAGGGCC GCGAAAAAAT GTTAATTTTG GTACTGATGC CTTTGTTTGC 4080 

CCTCGGTGGT ACAACTTATG GTATGGGTGA AGAAACAATG GCCTTCTATC CACTCCTTGT 4140 

GCCAGTTATG ATGGCCGTTG GTTTTGATAG CCTGACTGGT GTTGCAATTA TTTTGCTCGG 4200 
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TTCTCAAATC GGCTGTTTGG CATCTACTCT GAATCCATTT GCGACAGGTA TTGCTTCAGC 4260 

GACTGCGGGA GTTGGTACAG GGGACGGTAT CGTACTTCGT CTGATCTTCT GGGTTACCTT 4320 

GACTGCTCTT AGTACTTGGT TTGTTTACCG TTATGCGGAT AAGATTCAAA AAGATCCGAC 4380 

TAAGTCACTG GTTTATAGTA CTCGCAAAGA AGATTTGAAA CACTTTAACG TAGAAGAATC 4440 

TTCATCTGTA GAATCTACAC TTAGCAGCAA ACAAAAATCA GTTCTCTTCT TATTTGTGTT 4500 

GACATTCATC TTGATGGTAT TGAGCTTCAT TCCATGGACA GACCTTGGCG TTACCATTTT 4560 

TGATGACTTT AATACTTGGT TGACTGGTCT TCCAGTTATT GGTAATATTG TCGGTTCATC 4620 

TACTTCTGCA CTAGGTACTT GGTACTTCCC AGAAGGCGCA ATGCTCTTTG CCTTTATGGG 4680 

TATCCTGATT GGTGTTATTT ATGGTCTTAA AGAAGATAAG ATTATCTCTT CCTTCATGAA 4740 

TGGTGCTGCT GACTTGCTCA GTGTTGCCTT GATCGTAGCG ATTGCTCGTG GTATTCAAGT 4800 

TATCATGAAC GACGGTATGA TTACCGATAC AATCCTCAAC TGGGGTAAAG AAGGCTTGAG 4860 

CGGTCTATCT TCACAAGTCT TTATCGTTGT AACTTATATC TTCTATCTAC CTATGTCATT 4920 

CTTGATCCCA TCTTCATCTG GTCTTGCCAG CGCAACTATG GGTATCATGG CTCCACTTGG 4980 

AGAATTTGTA AATGTCCGTC CTAGCTTGAT TATCACTGCT TACCAATCTG CTTCAGGTGT 5040 

CTTGAACTTG ATTGCACCAA CATCTGGTAT TGTGATGGGA GCTCTTGCAC TTGGACGTAT 5100 

CAACATTGGT ACTTGGTGGA AATTCATGGG CAAACTCGTA GTGGCTATTA TTGTAGTGAC 5160 

CATCGCCCTT CTTCTCCTTG GAACCTTCCT TCCATTCCTA TAAAATAGTG AGTGAGGTGA 5220 

TTCCATGAAA ATAGATATAA CAAATCAAGT TAAAGATGAA TTTCTTATAT CATTAAAAAC 5280 

CTTGATTTCC TATCCTTCAG TACTCAATGA AGGAGAAAAT GGAACACCTT TTGGACAAGC 5340 

AATCCAAGAT GTCCTAGAAA AAACTTTAGA GATTTGTCGA GACATAGGTT TCACTACCTA 5400 

TCTTGACCCT AAAGGTTATT ACGGATATGC AGAAATCGGT CAGGGAGCAG AGCTTCTGGC 5460 

CATTCTCTGT CATTTGGATG TTGTTCCATC AGGTGATGAA GCAGATTGGC AGACACCGCC 5520 

ATTTGAAGCA ACTATCAAAG ACGGCTGGGT ATTCGGACGT GGTGTCCAAG ATGATAAAGG 5580 

CCCTTCGCTC GCAGCTCTCT ATGCAGTAAA AAGCTTGCTG GACCAAGGTA TTCAGTTCAA 5640 

AAAGCGCGTA CGCTTTATCT TTGGTACCGA TGAGGAAACC CTCTGGCGCT GCATGGCACG 5700 

CTACAATACC ATCGAAGAAC AGGCCAGTAT GGGCTTTGCA CCTGACTCAT CTTTTCCTCT 5760 

GACCTATGCT GAAAAAGGGC TTCTACAGGT CAAACTTCAT GGCCCTGGAT CGGATCAACT 5820 

AGAGCTTGAA GTAGGAGGCG CCTTTAACGT TGTACCAGAC AAGGCCAACT ACCAAGGTCT 5880 

CCTCTATGAA CAGGTTTGTA ACGGTCTCAA AGAAGCTGGT TATGATTACC AAACCACTGA 5940 
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ACAAACCGTA ACGGTTCTCG GAGTGCCAAA GCATGCTAAG GATGCTAGTC AAGGTATCAA 6000 

TGCTGTCATC CGACTAGCTA CCATTCTTGC TCCTCTCCAA GAACACCCTG CTCTCAGTTT 6060 

TCTTGCAACA CAAGCAGGTC AAGACGGCAC AGGAAGACAA ATCTTTGGTG ATATAGCAGA 6120 

TGAACCTTCT GGTCACCTAT CCTTTAATGT CGCAGGTCTC ATGATCAATC ATGAACGTTC 6180 

TGAAATCCGT ATTGACATTC GGACTCCTGT CTTAGCTGAC AAGGAAGAAC TAGTAGAGTT 6240 

GCTTACAAGA TGTGCACAAA ACTACCAACT CCGCTACGAA GAGTTTGACT ATCTAGCGCC 6300 

TCTATACGTC GCAGAAGACA GTAAACTCGT TAGCACACTG ATGCAAATCT ACCAAGAAAA 6360 

GACTGGCGAT AACAGTCCTG CTATTTCATC CGGTGGTGCC ACTTTTGCTC GCACCATGCC 6420 

AAATTGTGTA GCCTTCGGCG CCTTATTCCC AGGAGCGAAG CAGACAGAAC ATCAGGCAAA 6480 

TGAATGTGCC GTTCTAGAAG ATTTGTACCG TGCTATGGAT ATTTATGCCG AAGCCGTCTA 6540 

TCGACTTGCA ACTTAATCAG GCAACTGTTT CTACCAAAAA AAATCGACCG ATTAATGAAC 6600 

TGCACCCCAA AAGTTAGACA GAATAAATCT AACTTTTGGG GTGTTTTATT ATGAAATTGA 6660 

GTTATGAAGA TAAAGTTCAG ATCTATGAAC TAAGAAAGCA AGGACAAAGC TTCAAACAGC 6720 

TTTCAAAAAG ATTTGGTGTG GATGTTTCTG GTCTAAAGTC ATCTGAATCT TTGAGATGAG 6780 

CTTTATAAAT CGCTTTTTTC AGTTTTTGCA CTGGTGTTTC GATAAACTCA AACTTTTTAG 6840 

CCGTGGTATT GCCTGATTTT ATAGTATATT GAAACTAGAA TAGTACACCT CTCCTTCTAA 6900 

AACATTTTTA GAAATCGATT TGACTGTCCT GATCGATTTG TCCTGTTCTT ATTTCATTTT 6960 

ACTATATTTG AGCCACTTCG TCTTTAACGG CTTTATTCAT AAGCTCTTGT AATTTTTCTT 7020 

TACTATCAAT TACTTCTGAT TTTCCGTTGT AATTTATTGT AATAGGTTTT AACTTACCTA 7080 

ATTTCTCGAC ACGCTCATTA ATTTGATCTT TTTTGAAGGC TGCTTATGTT TTTCCTAAGA 7140 

TTTTTTCAAA AATATATTTA TCAGATAGCG GTTTGTCTTC TTCTTCAGCT TGGTTTTTGT 7200 

ATTAATTTGA AACATAAGGA ACAAATCCTT CATAGTAACC TAATGCTCCC ATAAGTTCAA 7260 

AAGCTTGTTT TCTAATTCAA ACCATTGCAA CTCAGATTTC AGCTTTTCAG ATAAATCCTG 7320 

CTCATCCAAA TAATGACTTG AAATTAGTGC TGAACTCGTT TCTGTATCCT GTACAGGCTG 7380 

AGCACCCATA CCAGCAAAAA ATAAACTCGT TCCTAGCAAG ACCGAACAAG CTCCTATTGC 7440 

ATATGGCCTC AAAGAAAAAC GCTGCTTTCT CTCAAATTGA AATTCTTTCA TCCCATCTCC 7500 

CATCATTCAT TATTACTGTA TATTTTGTAT ATCAGAAATA GTTTGTATTC ACAAATCTTT 7560 

CTAGTTATTC CCTTATCATT CCTAATTAAG GGAGATAACA TACAATAATT TTTAGTTAAA 7620 

TGTATATCGA TGTTTTTTGT TTTTCTTAAT AAACGCAATA CAAAAAGAGC CTGTTACCAA 7680 

GCTCTTTGTA CTCAATGAAA ATCAAAGAGC AAATTAGGAA ACTAGCCACA GGTTGCTCAA 7740 
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AACACCGTTT TGAGGTTGCA GATAGAACTG ACGAAgTCAG 


CTCAAAACAC 


TGTTTTGAGG 


7800 


TT/T* & *** ft T & r* 
1 luUAutA i Au 


AAL. i\jACUAA 


GTCAGTAACA 


TCTATACGGC 


AAGGCGACGC 


TGACGTGGTT 


7860 


rii/"* 7s 7v TV i\ mm 


TTCGAAGAGT 


ATTAGTCTAT 


TATTTCTTCT 


CAGCGCGAAG 


GGCTGACAAG 


7920 


ATTTGTGTTC 


GGATATCATC 


CACACCATTT 


GGAGTATTTG 


GTAAAAAGAT 


AGTTTGATTT 


7980 


CCTTTAGAGG 


CAAAGGTATT 


CAAGGTATCC 


AAATACTGGT 


TGGTCAAGAG 


GATAGACATG 


8040 


ATTTGTTCTT 


CTGTCATGCC 


AACATTGGCT 


TCCTTGAGTT 


CGGTGATAGA 


CTCTGCCAAT 


8100 


CCATCCACAA 


TCGCCTTACG 


TTGTTGGGCA 


ATCCCCACAC 


CATGAAGGCG 


GTCTTTTTCT 


8160 


GCTTCTGCTT 


CAGCTGCAGT 


GACAATTTTA 


ATCTTGTCAG 


CTTCCGCCAA 


TTCTTGTGCT 


8220 


GCGACCCGCT 


TACGTTGCGC 


CGCATTGATT 


TCATTCATGG 


ATTGCTTAAC 


TTCTGCATCT 


8280 


GGTTCGACCT 


TGGTAATCAA 


GGTTTTCACG 


ATAATGTAGC 


CGTAAGTGGT 


CATTTCTTCT 


8340 


GCTACTTGGT 


GTTGAACTTC 


AAGGGCAATC 


TCATCTTTTT 


TCTCAAACAA 


TTCATCCAAG 


8400 


GTTAATTTTG 


GAACAGAAGA 


GCGAAGAGCA 


TCTTCGATAT 


AAGATTTAAT 


CTGAGATTCT 


8460 


GGACGTATGA GTTTATAGTA AGCATCTGTC ACGCTCTGCT 


CGTTGACACG 


GTACTGAGTC 


8520 


GCTACATTCA 


TCATAACGAA 


CACATTGTCC 


TTGGTCTTAG 


TCTCAACCAC 


AATATCACTT 


8580 


TGCAACAAGC 


GCAACTGAAT 


CCGTGCTGCA 


ATCGAGTCAA 


TCCCAAAAGG 


CAAGCGAATA 


8640 


TGAATACCGC 


TATTAGCAAC 


CTTTTGGTAT 


TTCCCAAAGC 


GTTCAATAAT 


CGCCACCGAC 


8700 


TGCTGACGAA 


CCACATAAAC 


TGTACTCAGT 


GTGACTATCA 


CCAATAGGAG 


CACACAAACA 


8760 


ATCAGAAAAA 


TCATGAAAAA 


TATTGCCATA ATGGAACCTC 


CACAAGTATT 


TTTCTAGTAT 


8820 


TATAGCACAT 


TTAAAGAAGG 


CTGTGCCGTT 


TTTACTGCGA 


TTTTTCCTGA 


AATGTCAATA 


8880 


ATTAGAGGTG 


AATTGTCCTA 


TTGTCGTCCA ATCTCTTGCT 


AAAATAACTC 


TTTATAAAAG 


8940 


GCAATCGTTT 


CTTCTAAGGT 


TGGCATAAAT 


GGATTTCCTG 


GTGCGCAGGC 


ATCAATCAAG 


9000 


GCATTCTTAG 


AAAGGTATTC 


AAAGTCGAAA 


TCTTTTTCTT 


CAATACCAAG 


TTCAGTCAGT 


9060 


TTCTTAGGAA 


TACCTACTGT 


CTCAGAAAGC 


TTCTCAATCT 


CAGCAATCGC 


ATAATCGGCA 


9120 


CATTCTTGAT 


CTGATTTACC 


TTCTACATGA 


AGTCCCAAGG 


CTTTGGCAAC 


ATTGCGGAAA 


9180 


GCTTCTGGTA 


CACGTTTAGC 


ATTTTCACGT 


TCTATAACTG 


GTAGCAACAT 


GGCACAGCAC 


9240 


ACGG 












9244 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8898 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

GATCTGAACT TTATCATCAT AACTTAATTT CATAATAAAA ACACCCCAAA AGTTAGATTT 60 

TTTCTGTCTA ACTTTTGGGG TGTAGTTCAG TCATTGGACT GACGTTTTTT TGTATGCTTA 120 

TTTTGATTTG ATGTAGTTGA TACCATCTGC TTTTGGTGCG ACTGCTTTTC CAAAGAAGGC 180 

TGCTAAGACA AGAATTGTCA AAACATAAGG TGCAATTTGA AGATAAACCG CTGGCACTCC 240 

TTGTAGGAAC GGCAATTGAG AACCGATAAC AGCCAAACTT TGTGAAAGTC CAAAGAAGAG 300 

ACTAGAAAGC ATAGCACCGA TTGGATTCCA TTTCCCAAAG ATCATCGCAG CAAGGGCGAT 360 

AAATCCAGGT CCAACAATAG TTGTCACTGA GAAGTTAACT GAGATTGATT GCGCATAAAT 420 

CGCTCCGCCA ATTCCACCTA GAAAACCTGA AATAATAACC CCTAAATATC TCATCTTGTA 480 

GACGTTGATT CCCAAGGTAT CCGCTGCTTG AGGATGTTCA CCGACAGAGC GGAGACGAAG 540 

ACCAAATTGA GTCTTAAAGA GAATAAACCA AGCAAGGAAT GAGAAGGCAA TCGCCAGATA 600 

ACCAAGTAGA CTAGTTGACT TGAAGAAGAT ATCACCAATC ACTGGGATAT TTGCCAAGAC 660 

TGGGAAATCA AAGCGTCCAA AAGTTTGACT TAGGTTGTCG GTTTGTCCTT TGTTATAAAG 720 

AACTTTAACT AAGAAAACAG CCAAGGCAGG CGCCATCAAG TTCAATACCG TACCGCTGAC 780 

AACATGGTCT GCACGGAAAT GAACCGTCGC TGCTGCGTGG ATGATAGAGA AAACACTACC 840 

AACCAATCCT GCTACAAGCA AGGATAGCCA TGGAGTTGCT GCTCCAAATT GTTCTGCAAA 900 

TTCAAGGTTA AAGACAACTC CAGAAAAGGC ACCCATAACC ATAATTCCTT CAAGGCCAAC 960 

GTTTACCACA CCACCACGTT CAGAGAAAAC ACCACCGATA CTTGTAAAGA TGAGAGGTGC 1020 

TGAGTAAATC AGCATAGAAG ACACCAAGAG GGGGAGCAAG GTTATAATAG ACATCTTTAC 1080 

TTACCTCCTT TAACTTGTTT TTTCGGTTTG ACAAAGCGTT CGATAAGGTA ATGAACACTG 1140 

ACAAAGAAGA TAATAGACGC TGTTACAATG CTGACAAGCT CAGATGGTAC CTGCGCCGCA 1200 

TTCATACCAG GAGCCCCAAC TTGGAGAACG CCAAATAGGA AGGCTGCAAA GAGTATACCA 1260 

ATTGGTGAGT TGGCCGCAAG CAAACTAACC GCCATTCCGT TAAATCCGAT AGCTAATGAC 1320 

GAACCTTGAA CATAGACGTT CTGGAAGGTT CCCAAACCTT CAACAGCTCC ACCAAGACCT 1380 

GCCAAGGCAC CTGAAATAAT CATAGATAGG ATAATAGTCC GCTTGGCAGA AATACCAGCA 1440 

TATTCTGAAG CATGTGGATT AAGACCAACT GCACGGATTT CAAAACCAAG AGTTGTTTTC 1500 

TTGAGCATGA ACCAAATAAC TGCAACGGCA ATGATGGCAA AGAAAATACC AATATTCATC 1560 

CGTGAGTTAC CAGTCAACTC AGCCAACCAA GGTGTCTGAT AGGTTGCATT AGCCCCAACA 1620 
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CGAATGGTCG AATCTGTACT TTGCATGAAG TCTTTAGGGA AAGCATGGAT AAAGGCATTC 1680 

CCTACATACA AGACAATGTA GTTCATCATG ATGGTTACAA TAACCTCTGA CGTCCCTAGA 1740 

TAGGCCCTAA GAATACCTGG AATCGCTCCG ACAATCCCAC CAGCAATCAA GGCAATCACG 1800 

ATGGTTGCTA GAATCATCAA GGGACGGGGC ATATCTGGAT GCGACAGGGC AAACCAACCA 1860 

CTGAGAATCC AACCTGCCAA AGCCTGACCA GGAAGTCCGA CGTTAAAGAA ACCAGCTCGA 1920 

CTGGCAACGG CAAAACCAAG ACCAATCAAG ACCAGAGGAC CCATAGCACG GAAGATTTCT 1980 

CCAATCCCAC GCAGACTGCC AAAGGCTGTA TAGAACAATT CTTCGTAGCC CCAAATAGCA 2040 

TCATAACCGA AGATCCACAT GACAATGGCT CCGAGTAAAA TTCCTAGGAA TACAGAAATC 2100 

AAGGGAACCG AAATTTGTTG TAATTTTTTA GACATCACTC TTCTCCTTTC CCAAGTTTCC 2160 

ACCAGCCATC AAGACACCAA GTTCTTGTTT ATTGGTTGTT TCTGGTGATA CAATACCTTG 2220 

AATCTTACCA TCGTGGATAA CGGCAATACG GTCTGAGACG TTTAAAATCT CATCCAATTC 2280 

AAAGCTGACA ACAAGGACAG CCTTGCCATT ATCACGCTCT TCAATCAAGC GTTTGTGGAT 2340 

ATACTCAATG GCACCGACAT CCAACCCACG AGTTGGCTGG CTAACGATAA GGAGATCAGG 2400 

ATCTCGATCA ATTTCACGAG CAATAATTGC TTTTTGTTGA TTTCCTCCTG AGAGTGCAGC 2460 

TGCAGGAACT AATTCACTGG CAGCGCGAAC ATCAAACTCT TCCATCAGCT TTTTAGCATA 2520 

AGAAGTAATA TTTGAATAAT TCAAAATTCC ATTTTTACTA TGTGGTTCTT TATAGTAGGT 2580 

TTGAAGGGCA ATATTTTCAG ATATCATCAT TTCCAAAATC AAGCCATCAC GGTGACGGTC 2640 

TTCTGGAACG TGCCCAACAC TTAGTTCTGT AATCTGACGT GGGTGCAAGC CTACAATTGA 2700 

ATCTCCTTTT AGCTCAATGC TACCAGATTC AACCTTACGA AGACCTGTAA TGGCTTGAAT 2760 

CAGTTCAGAC TGACCATTTC CATCAATCCC CGCAATACCA ACAATCTCTC CAGCACGAAC 2820 

ATCCAAGGAC AGATTTTTAA CAGCTGGAAC ACCACGGTTT TCATTGACCA CCAAATCTTT 2880 

GATAGACAAA ACCACTTCTT TTGGTTTAGA GGCTTGCTTC TCTGTTTTAA AGGAAACAGA 2940 

ACGTCCTACC ATCATTTCCG CCAAATCAGC ATTGGTAGCC CCTGCAATTT CAACGGTTTC 3000 

AATTGATTTC CCACGACGGA TAACTGTAAC ACGGTCAGAA ACTGCTCGAA TTTCATCCAA 3060 

TTTGTGGGTA ATCAAGATAA TTGATTTTCC TTCTTTGACA AGATTTTTCA TAATAGCCAT 3120 

CAACTCATCA ATTTCTGATG GAGTCAAAAC AGCCGTTGGT TCGTCAAAGA TAAGGATATC 3180 

AGCCCCCCGA TAAAGTGTTT TTAAAATTTC TACACGTTGT TGGGCTCCAA CTGAGATATC 3240 

TGCTACCTTG GCAGAAGGGT CAACAGCTAA GCCATAACGT TCAGAAAGAG CCTTGATTTC 3300 

TTTGCTAGCT CCAGCGATAT CTAGCACACC ATTTTTAGTC AATTCACTAC CTAAAATGAT 3360 
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GTTTTCAGCC ACTGTGAAGG CTTCAACCAA CATAAAGTGC TGGTGAACCA TCCCGATTCC 3420 

CAAGCTAGCT GCTTTAGATG GGGAGTCGAG ATTGACAACT TGACCGTTGA CCGCGATTTC 3480 

ACCACTAGTT GGTTCAAGAA GGCCTGCTAA CATGTTCATT AGCGTGGACT TACCAGCCCC 3540 

ATTTTCTCCT AAAAGTGCAT GAATTTCACC TTTTCGTAGG TGCAAGTTGA TTTTGTCGTT 3600 

GGCAACAAAT CCACCAAACA CCTTGGTAAT ATCACGCATC TCAATGACAT TTTCGTGTGC 3660 

CATGTGCTCT TCCTTTCAGA GTCTTATTTT ATTTCAATAA AACTTGCTAG TTTGTCTAGT 3720 

AGCAAGCTTT ACTTAGACAA AATGACTTTG TCTCAACTCT TAAAAAAGCG GCCCTTGGCC 3780 

GCTTCCTAAG AAATGACTTC CATCCATTAT TTTTCAGGAA CTTTTACGCT TCCATCAAGG 3840 

ATTTTAGCTT TTGCATCTTC GACAGCTTTT TTACCTTCTT CTGAAAGGTT TGTTACTGCC 3900 

AAGTCAACCC CTTTATCCTT CAATGAGTAA ACGATCACTT GACCGCCAGG GAATTCTCCT 3960 

CTTTCTGCCT TGTTAGAAAT ATCTTTTACA GTTGTACCAA CTTGTTTCAA AGTAGATACA 4020 

AGAACAAAGT TTGATTCTTT GCCATCTTTA GAAGTGTATT TACCTTCTGC TTCTTGGTCA 4080 

CGATCAACAC CGATAACCCA AACTTTTTCA TTTTCAGGAC GGCTTTCGTT GAGAGATTTT 4140 

GCCTCTGCAA AGACACCTGC ACCTGTACCA CCAGCTACTT GGTAAACAAT ATCTGCACCG 4200 

GCTGCGTATT GTGCGGCTGC AATTGTTTTA CCTTTAGCCG CATCACCAAA TGAACCAGCG 4260 

TAGTCAACTT GGACTTTGAT AGATGGGTCT ACTGACGCAA CACCAGCCTT GAATCCTGCT 4320 

TCAAAACGAG AGATAACTTC AGATTCGATA CCACCTACAA AACCAACTTG TTTTGTCTTA 4380 

GTTGTTTTTG CTGCAGCCAC ACCTGCAAGg TAACCTGACT CATTATCAGC GAAAGTTACG 4440 

CTCGCAACAT TCTTTTGGTC TTTAATCACA TCATCAATCA AGACATAGTT CAAGTCAGTG 4500 

TGTTCTTTTG CTGCATCTTT AACTGCATTA TTAAGGGCAA AACCAACACC GAAGATTAGG 4560 

TTGTAACTTC CAGCCGCTTG TTGCAAGTTG TTAGCGTAGT CAGCTTCACT TGTTGATTGG 4620 

AAGTAAGTGA AACCGTTATC TTTTGAAAGA TTGTGTTCTT TACCCCAAGC CTGCAAACCT 4680 

TCCCAAGCTG ATTGGTTGAA TGATTTGTCA TCAACACCAC CAGTATCAGT GACGATTGCT 4740 

GCTTTTGTCT TCACATCAGA AGATGAAGCT GCGTTACGAG AAGAGCGGTT ACCACATGCA 4800 

GCAAGTCCAA CTGCTGCCAC TGCAACTAGG CCAAGACCTA GCCATTGTTT CTTGTTCATT 4860 

ACTGAACCTC CTAAATAAGA TGTGCAACGA TGTTGCAAGT ATGGATTGGT TGGCCACAAG 4920 

GACCGTGCCA CTCAGAGAGC GACTCAGACT AGTTTAAGTC TGTAAAAGAG TATGGAAGTA 4980 

ATTCCCCGAC CGTCATCTCG ACCGTCGATT TATCTTTTGC GACTAAGGTC ACTTTTAGAT 5040 

CTTGTTCAAA AAATTCAGCC ATCACTTGGC GACAAGCACC ACATGGCGAG ATCGGTTTTT 5100 

CAGTTTGACC ATAGACAATC AATTCTGAAA ATTCTCTTTG GCCTTCAGAT ATAGCCTTAA 5160 
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AAATAGCTGT TCTCTCACCG CAATTGGTCA AAGGATAGCT AGCATTTTCA ATATTCACTC 5220 

CCGTGTAAAC ACTTCCGTCT TTAGCTACTA AAACTGCTCC GATAGGAAAG TGAGAATAGG 5280 

GGACATAGGC ATGTTTGCTG GTTTCAATTG CCAGTTCAAT CAACTCAGTA GTCGCCATCT 5340 

GCCAATTCTC CTTTTAAAAT AGCTACCCCA GCTGACGTTC CGATACGGGT CGCACCTGCT 5400 

TCGACAAAGG CAAGAGCATC TGCATAAGAA CGAGCTCCAC CGGCGGCCTT GACACCCATA 5460 

TCAGATCCAA CTGTTTCACG CATTAATGTA ACATCTGCTA TCGTAGCACC ACCAGTTGAA 5520 

AAGCCAGTAG ATGTTTTGAC AAAGTCAGCC CCAGCTTTTT GGGCCAATTG GCAAACAACA 5580 

ACTTTTTCTT GGTCTGTCAG AAGGCAAGCT TCAATAATGA CTTTCACTAA CTTATCACCA 5640 

CTTGCTTCCA CTACTGCGCG AATATCTGAC TCAACCAAGG CTAAATTACC TGATTTGAGA 5700 

GCTCCAACAT TGATCACCAT ATCAATCTCA TCTGCACCAT TTTGGATAGC TTCTTTTGTC 5760 

TCAAATGCTT TCACGGCTGA AGTTGTTGCT CCCAAAGGGA AACCTACTAC TGTGCAAACC 5820 

TTAACATCTG TGCCTTCAAG TCCTTTTTTA GCATGTTCAA CCCAGGTCGG ATTAACGCAA 5880 

ACACTGGCAA AGTCATACTC TCTAGCCTCA GACAACAAAC TATCAATTTG TTTTTTCTTT 5940 

GCATCTTGTT TTAAAAGCGT ATGATCTATA TATTTATTTA ATTTCATTTC GGTTTTCCCT 6000 

CCATTTAGGA GATGATTTCT ACAATTTCAC GGATTTTTTT CACTTCATCA CTTATTTTAA 6060 

CACATTTTTG GAAATCTGTA ACTAGTTGAG GTGGAATTTT TTCATTTGTG TATACTTTTG 6120 

CAACAATTTC ACCCTTTTGA ACGGAGTCTC CAATCTTCTT TTCAAAAACA ATTCCTGTTT 6180 

CATAGTCCAA GGCATCAGAC TTAACTGCAC GACCAGCACC CAGCCTCATG GCATAAAGAC 6240 

CAAAGTCCAT AGCTGGAAGA GCTGAAATGA CACCCGTTTC CTGAGCAGGG ATTTCCACCA 6300 

CATGAGCTAC ATTTACAGGA CGATAGAGGT CTTCCAAGTC TCCACCTTGG GCTTGCACCA 6360 

TTTCCTCAAA CTTAGCCAGT GCTTGACCAT TCTCAAGATG TTGGTGAACT TCTTCAACAG 6420 

TTTTGTTAAC ATTTGCCAAA CCAAGCATAA TTTGAGCCAA TTCACAAATA AAGTGGGTAA 6480 

TATCCTGACG TCCTTGACCT TGCAAAATCT CCAATGCTTC AAGGATTTCC AGACGATTTC 6540 

CAATCGCTCG TCCCAAAGGC TGGCTCATAT CCGTAATCAC TGCTACTGTC TTCCGTCCAA 6600 

CAACCTTACC AAGATCTACC ATAGTTTGAG CCAACTCACG CGCCTCATCA ACCGTCTTCA 6660 

TGAAGGCACC CTCACCGACA GTCACGTCTA GCAAAATAGC ATCCGCCCCT GCCGCAATTT 6720 

TCTTGCTCAT CACCGAACTC GCAATCAAAG GAATCGTGTC GACAGTTGCG GTCACATCAC 6780 

GAAGGGCATA GAGAAGCTTA TCTGCTTTGA CCAGCTGGTC TGATTGCCCA ATGACAGATA 6840 

CTCCAATATC CTGAACCTGA CGAATAAAAT CCTCTTGACT ACGTTCTACT TGATAGCCCT 6900 
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TAATGGACTC CAATTTATCA ATTGTTCCGC CTGTATGGCC AAGACCACGA CCACTCATTT 6960 

TTGCTACAGG CACACCGAAG CTAGCAACAA GAGGAGCTAA AATCAAGGTT ACCTTATCGC 7020 

CGACACCACC AGTAGAATGC TTGTCAACTT TCACACCATC AATGGCTGAC AGGTCAAACT 7080 

CTTGCCCAGT CTTAACCATA TTCATCGTTA AATCAGAGAT TTCTCGAGTC GTCATTCCTT 7140 

TAAAATAAAC AGCCATAGCA AAGGCAGACA TCTGATAATC AGGAACAGTT CCTGATACAT 7200 

AGCCTTCTAT CAGCCATTCA ATTTCACTTG AAGTCAGTTC TTGACCGTCT CGTTTTTTTT 7260 

GGATTAAATC AACTGCTCTC ATTCTTTCAC ACTTCTAAGG ATATAGTATC CCTTGTCTTT 7320 

TTTAAGGATT TCACAATTGC CAAACACATC TTCCATCTTA GACTTGGCAC TTGGAGCTCC 7380 

TTGTTTTTTC TGGATGACGA TGGTCAAATC TCCACCAATT TCCAAGAAAT CTTTACTTTT 7440 

CTCGATGATT TCATGAACGA CTTGCTTGCC CGCACGGATA GGAGGATTGG AAATGACATG 7500 

GTCAAATCGC CCTTGAACTC TTGCATAAAT ATTAGATTGA AATATCGTCG CTTTTGCATT 7560 

ATTTTTTTCA GCATTTCTCT GAGCTAAATC CAGGGCACGA GTGTTAATAT CAACCATGGT 7620 

CGCCTGAACT CCGTAAACCT TGACCAAGGA CAAACCTAAT GGACCATAAC CACAGCCTAC 7680 

ATCTAGGACT GTCTCTCCTT GGTTGACATC CAGACACTTG AGCAAGAGTT GACTTCCAAA 7740 

GTCAACCATT TTCTTGCTAA AAACACCCGC ATCTGTCAAA AAAGTCATTT TTTCTCCCAA 7800 

CAAGTCCACT CTCAACTCAT GAATGTCGTG AGCAGCGTCA GGATTTTCTG CATAGTACAT 7860 

TTTACTCATG ACACTATTTT ACCATAATTT GACTCAAATT GTAAATCGTT TACAAATTGA 7920 

TAATAAAACG AAAAAGACCG AAGAAAGCAA GTCACGAAGC CATTTTCTTC AATCTCTTTC 7980 

AACACTTATA AATAATAAAC CATTTAGAAC TATAAATATC ACAGTCCAGA TAAAAACAAA B040 

AAGTTTATCA TCTATAATCA GGCAGATTAT TATTTCTATT GCTTAACCTT AAAATACTTT 8100 

ATTATCAACA AAATTCCTAA CAAAATGTTT AGATAAAAGC CCAACTGATA CGTTTATGTC 8160 

AGGATTTCCA AACTTGTCCA AAGTCGTATC AAATCTTCTA GTGACATGTG GAAGAAATAA 8220 

CCCTCTGTCG CAATCCGTAG GACTAAAAAG CAATAACTAC CCGCAGCAAT CCATTTCGTC 8280 

CATCGTTTTT TAGTAAGAAA GCAATTAAGA ACGAACAAAT AAAGACAGCT GTTACAATAG 8340 

CATGTTCCAT CAAAAAAGTA AAACCGTAAT AGGTTTCCAC AAAGCATCTA CCATTATCTG 8400 

CATTGGTTCC TTTTATAAAA GGTAAAGCAA AACTTAAAAT AAAACAGAGT TCCAATATGT 8460 

AACGTTTTAA GATTTTCATA GTACACCTCC TATAAGTTGT GAACTAAAAA GCCCCCTTTA 8520 

TAAGCTTATA AATCAGTAGA ATCTATCTCC TATTTCATCA ATAAATTGAT CACTTATACT 8580 

ATATACCATT GACTTACCAC ATTCAAGAAA CCGCTTTATT TTTTTAGCTT TTTATGGTAT 8640 

GATAGACAAA ATATCTAGGG GAAAACAAAT GACCAACGAA TTTTTACATT TTGAAAAAAT 8700 
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CAGCCGCCAG ACTTGGCAAT CTTTACATCG AAAGACAACA CCTCCTTTGA CAGAAGAAGA 87 60 

ATTGGAATCT ATCAAGAGTT TTAATGACCA AATCAGTCTC CAAGACGTTA CAGATATCTA 8820 

TCTCCCCTTG GCTCATTTGA TTCAGATTTA CAAGCGAACT AAGGAAGATT TAGCCTTTTC 8880 

AAAAGGAATT TTCCTCCA 8898 



(2) INFORMATION FOR SEQ ID NO: 70: 

( i > SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 13168 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

TATCTTAACG aGGATTGGGT TTATCGTCAG TCTTATTGCC CTAATTGTGG GAACAATCCC 60 

TTAAATCATT TTGAAAATAA TCGGCCTGTA GCAGATTTTT ACTGTAATCA TTGTAGTGAG 120 

GAGTTTGAAC TAAAGAGCAA AAAAGGAAAT TTTTCATCAA CAATCAATGA TGGTGCTTAT 180 

GCAACGATGA TGAAGCGTGT GCAGGCAGAT AATAATCCTA ATTTCTTTTT TTTAACTTAC 240 

ACAAAAAATT TTGAGGTAAA TAACTTTCTT GTCCTTCCGA AGCAATTTGT TACACCGAAA 300 

TCGATTATTC AAAGAAAACC ACTTGCACCA ACTGCTAGAC GAGCAGGTTG GATTGGTTGT 360 

AACATTGATT TATCACAAGT ACCTTCTAAA GGAAGGATAT TTCTTGTGCA AGATGGACAA 420 

GTTAGAGATC CAGAAAAAGT TACAAAAGAA TTTAAGCAAG GTTTATTTTT AAGGAAGAGC 480 

TCTCTGTCAT CAAGAGGTTG GACAATAGAA ATTCTAAATT GTATAGATAA GATAGAGGGT 540 

TCAGAATTTA CCCTTGAAGA TATGTATCGT TTTGAAAGTG ACCTAAAAAA TATCTTTGTT 600 

AAGAACAATC ATATCAAAGA AAAGATTAGG CAACAGCTTC AAATATTAAG AGACAAAGAA 660 

ATAATAGAAT TTAAAGGTAG AGGAAAGTAT CGGAAATTAT GAAAACGAAA CAACTTGTTG 720 

CATCAGAAGA GGTGTATGAT TTCTTAAAAG TCATCTGGCC TGATTATGAA ACTGAAAGCC 780 

GTTACGATAA CCTAAGTTTA ATCGTCTGTA CCTTATCAGA TCCCGATTGT GTGAGATGGT 840 

TATCTGAAAA TATGAAATTT GGTGACGAAA AACAACTAGC TTTGATGAAG GAAAAATATG 900 

GGTGGGAAGT AGGAGATAAA TTGCCAGAGT GGCTACATAG CTCCTATCAT AGATTATTGT 960 

TAATAGGTGA ATTATTGGAA AGCAATCTAA AACTGAAAAA GTATACAGTA GAAATTACAG 1020 

AAACTTTATC ACGTTTAGTA AGTATAGAGG CTGAAAATCC AGATGAAGCC GAACGACTTG 1080 

TAAGAGAAAA GTATAAGAGT TGTGAAATTG TTCTTGATGC AGATGATTTT CAGGACTATG 1140 
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ACACTAGCAT ATATGAATAG GTAGATGTTT TTATTTTGTC AACAAAAAAG AGGCTCGCAC 1200 

CTCTTTTTCT TATTTCTTTT TATGATTTAA TACGGCATTG AGGACAATAG CGAGTAGGCT 1260 

GGCTACGACG ATTCCGTTTG AGAAGAACAT TTGGAAGGCT GTCGGCATGC TGACAAAGAG 1320 

ATTACTGTTG TTGAGACCGA CACCTGCAGC GATTGAAACA GCTGCGATAA GGAAGTTGTG 1380 

TTCATTGTTA GCAAAGTCAA CACGGGCGAG GATTTGCATC CCTTGAATTG ATACAAAACC 1440 

AAACATTACC AGCATGGCAC CACCGAGGAC GGAGCTTGGA ATGATTTGGG CAAGGGCGCC 1500 

AAACTTAGGA AGCAGTCCAA GGAGAACCAG GAAACCAGCT GCGTAGTAGA TTGGCAGGCG 1560 

TTTTTTGATG CCTGACAATT TAACCAAACC AACGTTTTGT GAAAATCCGG TGTAAGGGAA 1620 

GGTGTTAAAG ATTCCTCCGA GAAGTACGGC CAAACCTTCT GCGCGGTATC CGTTGCGAAG 1680 

GCGCGTGCTG TCGATTGGAT CCTTTGTGAT ATCAGACAAG GCCAGATAAA CAGCAGTTGA 1740 

CTCAACCATA GACACCGTTG CGATGATACA CATCATGACA ATAGATGAGA TTTCAAAGGT 1800 

TGGCATCCCA AAGTAGAGTG GAGTTGGGAC ATGGACAAGT GGAGCTACCG CAACAGGAGA 1860 

GAAGTCCACC AAGCCCATAG TAGCAGCAAT GGCAGTTCCA ACAACCAGAC CAATCAAAAT 1920 

AGAGATAGAC TTGATAAATC CTTTGGTAAA GATGTPGATC AAGAGGATAA TCAGAACAGT 1980 

AATAGCTGCA AGCAAGAGAC TTTGACCAGT TGGCTCTGGA ACGTTATTTC CCATATTTCC 2040 

AATAGCGACA GGGATCAAGG TTAAACCAAT CGTGGTAATA ACAGATCCTG TTACGATAGA 2100 

TGGGAAGAGA TTGGCTACTT TTGAGAAGAT GCCTGAAACA AGAACCACGT AAATCCCAGA 2160 

TGCGATAAGG GCACCAAACA TAGCGCCACT ACCATGGCTT TGCCCAATCA TAATCAAGGG 2220 

AGCGACCGAC TGGAATGCAA CTCCAAGAAC GACTGGGAGT CCAATCCCAA AGTATTTGTT 2280 

GAGTTGGAGT TGGAGGAAGG TTGCCACCCC ACACATGAAG ATATCTGTAG AAATCAGGTA 2340 

GGTCAACTGC TCAGCTGAAT AGCCAAGGGC TGTCGCAATC ATGATGGGAA CCAGGATAGA 2400 

TCCTGAGTAC ATGGCTAGTA AGTGCTGCAA GCCAAGAACG GCTGCTTGCG AGTGTTTTTC 2460 

TTGAGTTTGC ATTAGAGATC TGCCTCCTTA AATACGACTT GACCATTTTC AAAACAATCC 2520 

AAACGAGCAA GTGATAGGAC AGGGTAGCCT GCTTTTTCAA GCAAATCACG ACCATCTTGG 2580 

AAGGATTTCT CAATCACGAT ACCGATAGCT TGGACTGTGG CACCGGCCTG TTCGATGATT 2640 

TGAATCAAGC CTTTAGCAGC TTGGCCATTA GCAAGGAAAT CGTCGATAAT CAAAACCTTG 2700 

TCCTCTGGTG AGAGGAATTT TTCAGCGATA GAAACGGTGC TGGTCACCTG CTTGGTAAAG 2760 

GAGTAGACTT GAGCAGTTAA GATGCCTTCG TTCATGGTGA TGTTCTTAGC TTTTTTGGCG 2820 

AAAATCATGG GAACGTTTAA GGCTTCAGCT GTAAAAACGG CTGGGGCAAT ACCCGACGCT 2880 

TCAATGGTTA CGACCTTGGT AATGCCAGTA GTAGCAAATT TTTCCGCAAA AACCTTACCA 2940 
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ATCTCTCGCA TCAAGCTAAA GTCAACTTGG TGGGTTAAAA AGGAATCTAC CTTGAGGATG 3000 

TTATCACCCA AGATATGCCC ATCCTTGAGG ATGCGCTCTT CTAATAATTT CATAAGACCT 3060 

CCTAAAGTCT AAAAGTTAAT TTACTTGTTG TTTAAATATT TCTATAGTGA TCCCTTTTGC 3120 

TAATACTATA TATTTGATAA AACTATTACG AGCGAAGCGA GTCTTATCAA ATATTTCCCG 3180 

TTGTAGTGGT ATCATAGACA ATAATCTTGT TATTGTCTAT GACGGGATTT TTGAGAGTAA 3240 

AATAGTTCGG GGAACTATTT TAGCCTAAGC CTAGAAATGA AAGAGCTAGG GGCTCAAAAA 3300 

TTAGGGATGA AATTCCCTGG ATTCCTGAAA TTATTCACAG GATAATTTCA CCTCCCGTCC 3360 

GCACTAATTA AGGGAAATAT TAAAAAAAGA CCTACTTAAT CTCTAAGTAA GTCCCCTAAA 3420 

TAGACATGGC AAAAACGGCC ATATCTCACT GCTGACTTAC TTATTGTTAG GTGTTCCGGC 3480 

ACCTTGTAGA AACGTCGTGC CAATTCACGA CATAAACAAG TAAAACGATA TTCAATTTTA 3540 

AATAGGCTTG AGCCAATGTT TTTATTTTAC ACTAAATAAC TTTAGAAATC AACTATTTTG 3600 

TTAGTGTTTT GGTTTAAAAA ACGAACAAAA AGAAGAGAGG GTGAACAAAA ACTCCATTGT 3660 

AAGCTAACAG TTATACTAAA TGAAAATCAA AGAGCAAACT AGGAAGCTAT CCACAACCTC 3720 

AAAACACTGT TTTGAGGTTG TGGATAGAAT TGACAGAGCC AGTATCATAT ACCTACGGTA 3780 

AGGCGACGTT GACGTGGCTT GAAGAGATTT TCGAAGAGTA TTAGAAGATT TTTCCATCAT 3840 

AAAAGGCATA CTATCAAGCT TTTAGACACC TGACAATATG CCTTTTTCTA ACTTTAAAGA 3900 

CTTTTCCCAA TTTTTATTAT TCTACTCGCT AAATCTTAAA AAATAGCCAT CTGGATCCAA 3960 

AACTGCAAAT TTATGAGGAT AGATATAGGG ATCACTGACA CGAAACTTTC TTTTGGTCAA 4020 

GGGACGATAA ATAGGATAGT TTGCCTTCAT CACTCTTTAA TAGAGTTTTG AAACATCCTT 4080 

TATGCCAAAG GAGAGATTGA CTCCACGACC AAAGGGATAG GTCAGTTCAG CTAGTTGATC 4140 

CTTTGTTCCC TCCTCTAACA TTAGTTGACA CTCTTCAAGA GAAAGAGAAA GTTTTCTTCT 4200 

GGACGTTGGT ATTCAATCCT AAAACCCAGT AAACCACAGT AGAAGGACCG GGACTGTTCG 42 60 

ATATTCGATA CAAGCAACTC GGGAATGACC GCATTGTAGT CCATATAGAA AATCCTTACA 4320 

AGTCAATTTC CAAGACAATC GGTGTATGGT CTTGGCGAGC ACCTGAGTCA ATCATATCAG 4380 

ATTTAGTGAC CTTGTCAGCG ATACGGTTAC TTGTGAGCCA GTAGTCGATT CTCCAGCCTG 4440 

TATTGTTGAT TTTAGAAGTT TTGCTGCGTT GTGCCCACCA AGTGTAGCGT TCAGGAACAT 4500 

CGCCATGAAC ATGGCGGAAG GTGTCTGTAA ATCCAGTTGC CAAAAGGTTG GTAAATCCAG 4560 

CACGTTCCTC GTCAGTAAAT CCAGGTGAAC GGCGGTTGCT AGCAGGATTT GCAAGGTCGA 4620 

TTTCATTGTG GGCTACGTTG TAGTCACCGG TCGCAAGGAC TGGTTTTTCT TTGTCTAGTT 4680 1 
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CAGCCAAATA CTCAGCATAT TTGGCATCCC AGACTTGGCG TTCTTCCAAG CGTTTGAGAC 4740 

CGTCACCAGC GTTTGGAGTG TAAACTTGGG TTACGAAAAA TGCATCAAAT TCTAGAGTGA 4800 

TGATACGACC TTCCAAGTCC ATGGTAGAAG GGGCACCGAT TTCTGGGAAG CTGATAGTAG 4860 

GTGTAAGTTC TTTCTTATAA AGGAACATGG TTCCAGCATA GCCTTTACGG GCAGGCTCTT 4920 

GGGAAGAGCG CCACGTGTTT TCGTAGCCTG GGAAGAGTTC TTCTAAAATT TCCACGTGTT 4980 

TCTTTGTAGG TCCTTTGGCA GAAAGCTTGG TTTCTTGGAT AGCAATGATA TCAGCATTTT 5040 

CAGCGACCAA GGTTTGTAGG ACTTCTTGGG ACAATTTGGC ACGAGCTGAG TCACTAGTTA 5100 

GGGCAGCGTT TAGGGAATCA ATATTCCATG AGATAAGTTT CATAAAGTTA CCTTTTTCAT 5160 

TCAGATTATA GATTTTATTA TACCAAAAAA AGATCTATTT CCCCAACGTA TGGTTTGAAA 5220 

AATTACTCTC TTTCGTTTAT AATTAAGAAT GATTTTATGA AAGGGAGTGA AAATACATGA 5280 

AATTCTACTC TTATGACTAT GTACTCAGCC AAATCGGTCA GCAAAATGGT ATCATGGTTG 5340 

GCTTTGGGAT TGTTCTATTA GCTGTGACAG TTTTTTTTGC TTTCAAGGCA TACCATAATA 5400 

AAAAGGGAAG CGAATTTCGT GAGTTGGTCA TGATTTCAGA TCTGGCCTTA TTTAGCTCTG 54 60 

CTTTTGGTCA GCATCACGAC TTATCAAAAC AATCAAGTTT CTAACAATAA ATTTCAAACT 5520 

TCACTTCATT TCATCGAGGT TGTTTCCAAA GATTTGTGAG TAGACAAGTC AGAAGTCTAT 5580 

GTTAATACTT CCACAAACAC AGATGGCGCA CTTATCAAGG TGGGAGATCG CTATTATCGT 5640 

GCCCTAAATG GAAGTGAGCC AGACAAGTAC CTGTTAGAGA AAGTCGAATT GTATAAGACA 5700 

GACGCAATTG AACTGGTGGA TGTGAACAAA TGACACTTAA TTATATCGAA ATTTTAATCA 57 60 

AACTGGTCTT GACTCTCAAA TAGCTCAACA ACAATGTTCA CTTTGTGAAA CGTTTGATTG 5820 

ATGGTAAGCC AACTCTCCTT ATCAAAAATG GGAATATTGA CCCAGAAGCC TGTCGTTCAG 5880 

TTGGTTTGTC TGCATCGGAT GTATCCCTCA AACTTCGTAG CCAAGGGATT TTCCAGATGA 5940 

AGCAAGTCAA ACGAGCTGTG CAAGAGCAAA ATGGGCAACT CATCGTTGTG CAAATGGGAG 6000 

ATGAAAATCC TAAGTATCCA GTTGTGACTG ACGGTGTGAT TCAAGTAGAT GTCTTGGAAT 6060 

CGATTGGTCG TAGCGAAGAG TGGTTGCTTG ATAACCTCAG TAAACAAGGG CATGACAATG 6120 

TAGCCAATAT CTTTATTGCT GAATATGACA AGGGTGCTGT TACAGTCGTA ACTTATGAAT 6180 

AAGAAAAACC TGGGGTCTTG TACTCTTCGA AAATCTCTTC AAACCGCGTC AACGTCGCCT 6240 

TGCCGTATGT AGGTTACTGA CTTCGTCAGT TCTATCTACA ACCTCAAAGC AGTGCTTTGA 6300 

GCAGCCTGCG GCTAGTTTCC TAGTTTGCTC TTTGATTTTC ATTGAGTATT GGCCTCAGGT 6360 

TTCCATTTGC AATCAGAAAG GGATTTTATG TCCATTATTC AAAAACTTTG GTGGTTTTTC 6420 

AAGTTAGAAA AACGCCGTTA TCTAGTCGGA ATTGTGGCCC TGATCTTGGT TTCCGTCCTC 6480 
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AATCTCATTC CTCCTATGGT TATGGGGCGG GTCATTGATG CCATCACATC GGGGCAATTA 6540 

ACCCAGCAGG ACCTCCTTCT TAGCCTATTT TACTTGCTAC TTGCAGCCTT TGGTATGTAC 6600 

TATTTGCGCT ATGTGTGGCG TATGTATATC CTTGGGACCT CTTATTGCTT GGGACAGATC 6660 

ATGCGGTCTC GCTTGTTTAA GCATTTCACA AAAATGTCGT CAGCCTTTTA TCAAACCTAT 6720 

CGGACGGGTG ATCTGATGGC ACACGCAACC AATGATATCA ATGCCTTGAC TCGTTTAGCA 6780 

GGTGGCGGTG TCATGTCTGC GGTGGATGCC TCTATCACGG CTCTGGTGAC TTTGTTGACC 6840 

ATGCTCTTTA GCATCTCATG GCAGATGACT CTTGTTGCCA TTCTCCCCCT ACCTTTCATG 6900 

GCCTATACGA CTAGTCGCCT AGGGAGAAAG ACTCATAAGG CCTTTGGCGA ATCCCAAGCT 6960 

GCTTTTTCTG AACTCAATAA CAAGGTACAG GAGTCCGTAT CAGGTATCAA AGTGACCAAG 7020 

TCTTTCGGTT ATCAGGCAGA CGAGTTGAAG TCTTTTCAGG CAGTCAATGA ATTAACCTTC 7080 

CAAAAGAACC TGCAAACCAT GAAATATGAT AGTCTCTTTG ACCCTATGGT TCTCTTGTTT 7140 

GTTGGTTCGT CCTATGTTTT AACGCTTTTG GTTGGCTCCT TGATGGTTCA GGAAGGGCAG 7200 

ATTACAGTTG GGAATCTAGT CACCTTTATC AGCTATTTGG ATATGCTGGT CTGGCCTCTT 7260 

CTGGCCATCG GTTTCCTCTT TAATACTACT CAGCGAGGGA AGGTTTCTTA CCAGCGGATT 7320 

GAAAATCTTT TGTCTCAGGA ATCTCCTGTA CAAGACCCTG AGTTTCCTCT GGATGGTATT 7380 

GAAAATGGGC GTTTGGAGTA TGCCATTGAC AGCTTTGCTT TTGAAAATGA GGAAACACTG 7440 

ACGGATATTC ACTTTAGTTT GGCAAAAGGG CAAACACTGG GCTTGGTTGG GCAGACAGGC 7500 

TCTGGGAAAA CGTCCTTAAT CAAGCTCCTC TTGCGTGAAT ACGATGTGGA TAAGGGTGCC 7560 

ATTTATCTAA ACGGTCACGA TATTCGGGAC TATCGTCTGA CAGACCTTCG CAGTCTCATG 7620 

GGCTATGTTC CTCAGGACCA GTTTCTTTTT GCGACTTCAA TCCTAGACAA TATCCGCTTT 7680 

GGCAATCCTA ACTTGCCCCT TTCAGCGGTC GAGGAAGCTA CTAAGCTAGC CCGGGTTTAC 7740 

CAAGATATTG TAGACATGCC TCAAGGATTT GATACGCTGA TTGGTGAAAA AGGAGTCACT 7800 

CTTTCTGGTG GTCAAAAGCA ACGGTTGGCT ATGAGTCGGG CTATGATTTT AGACCCTGAT 7860 

ATCTTGATTT TGGATGATTC CTTATCCGCC GTAGATGCCA AGACAGAGTA TGCGATTATC 7920 

GACAACCTCA AGGAGATGCG AAAGGACAAG ACAACCATTA TCACTGCCCA TCGCCTCAGT 7980 

GCTGTTGTCC ATGCAGATTT TATTTTAGTT CTACAAAATG GTCAAATTAT CGAACGAGGC 8040 

ACGCACGAAG ACTTGCTAGC TTTGGATGGC TGGTATGCCC AAACCTACCA GTCTCAGCAG 8100 

TTGGAAATGA AAGGAGAAGA AGATGCAGAA TAAACAAGAA CAATGGACTG TATTGAAGCG 8160 

CTTGATGTCT TATCTCAAGC CTTATGGACT CCTGACCTTT TTGGCACTCA GTTTTCTCCT 8220 
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AGCGACGACG GTCATTAAAA GTGTCATACC CCTCGTGGCT TCCCACTTTA TCGACCAGTA 8280 

TCTCAGCAAT CTTAACCAAC TAGCCGTTAC CGTTTTGCTG GTCTACTATG GTCTCTACAT 8340 

CCTACAAACT GTAGTTCAGT ATGTCGGCAA TCTTCTCTTT GCGCGCGTGT CTTACAGTAT 8400 

TGTTAGGGAT ATTCGTCGGG ATGCCTTTGC CAATATGGAG AAACTGGGCA TGTCTTACTT 8460 

TGACAAGACG CCAGCAGGTT CTATCGTTTC TCGTTTGACC AACGATACCG AGACGATTAG 8520 

TGATATGTTT TCTGGGATTT TATCCAGCTT TATCTCAGCA GTTTTTATCT TTCTGACAAC 8580 

CCTTTATACC ATGTTGGTGC TGGATTTTCG TTTGACGGCT TTAGTCTTGC TCTTTCTTCC 8640 

TTTGATTTTC CTTTTGGTCA ATCTCTATCG AAAAAAGTCA GTGAAAATCA TCGAGAAAAC 8700 

CAGAAGTCTC TTGTCAGATA TCAATAGTAA GCTGGCAGAG AATATCGAGG GAATCAGGAT 8760 

TATTCAGGCC TTTAATCAAG AGAAGCGCCT GCAGGCAGAA TTTGATGAAA TCAACCAAGA 8820 

ACACTTGGTC TACGCCAACC GTTCTGTAGC CTTGGATGCC CTCTTTTTGA GACCTGCCAT 8880 

GAGTTTGCTG AAACTTCTAG GCTATGCAGT CTTGATGGCC TACTTTGGCT ACCGTGGTTT 8940 

TTCTATCGGG ATAACGGTCG GGACCATGTA TGCCTTTATC CAGTACATCA ACCGCCTTTT 9000 

TGACCCCTTG ATTGAGGTGA CGCAAAACTT TTCAACTCTG CAAACGGCTA TGGTTTCTGC 9060 

AGGTCGTGTC TTTGCCCTGA TAGACGAGAG GACCTATGAA CCTCTTCAAG AAAATGGGCA 9120 

AGCCAAAGTC CAAGAAGGCA ATATCCGTTT TGAACATGTG TGTTTCTCAT ATGACGGTAA 9180 

ACATCCGATT CTGGATGACA TTTCTTTCTC TGTTAATAAG GGTGAAACCA TTGCCTTTGT 9240 

AGGTCATACA GGTTCAGGGA AATCGTCTAT TATCAATGTC CTCATGCGCT TTTATGAATT 9300 

CCAGTCAGGG AGAGTTCTCT TGGATGATGT GGATATCAGG GATTTCAGTC AAGAAGAGCT 9360 

GAGAAAAAAC ATCGGTTTGG TCTTGCAGGA ACCCTTCCTC TATCATGGAA CTATTAAGTC 9420 

CAATATCGCC ATGTACCAAG AAACCAGTGA TGAGCAGGTT CAGGCTGCGG CAGCCTTTGT 9480 

GGATGCAGAT TCCTTTATTC AAGAACTTCC TCAGGGGTAC GACTCCCCTG TTTCCGAGCG 9540 

TGGTTCGAGC TTCTCTACTG GGCAACGCCA GCTTCTTGCC TTTGCTAGAA CAGTCGCCAG 9600 

CCAGCCTAAA ATCCTGATTT TGGATGAAGC GACAGCCAAT ATTGACTCTG AAACAGAAAG 9660 

CTTGGTTCAA GCTTCTCTGG CGAAGATGAG ACAGGGCCGA ACAACTATTG CTATCGCTCA 9720 

CCGCCTTTCT ACTATTCAAG ATGCCAACTG CATCTATGTC TTGGATAAGG GACGCATTAT 9780 

CGAGAGTGGA ACCCATGAGG AACTCTTGGC TCTGGGAGGA ACCTATCACA AGATGTATAG 9840 

TTTGCAGGCA GGGGCCATGG CCGATACTCT TTGAAAATCT CTTTAAACCA TGTCAGCTTT 9900 

ATCTGCAATC TCAAAGCTGT ACTTTGATTT TCATTGAGTA CTAGAAGGAA ATCCTTCAAA 9960 

TTACAGATTT CTTTCACCGC CTTTTCCATT TTGTGGTATA ATGAAAAATG TTGACAAATA 10020 
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GTATAATAAA 


AACAAAGGAG 


AACAGCATGC 


TGAAATGGGA 


AGACTTGCCT 


GTGGAAATGA 


10080 


AATCAAGCGA 


GGTTGAGTCT 


TACTACCAGC 


TTGTCTCTAA 


AAGGAAGGGT 


TCGCTGATTT 


10140 


TCAAGCGTTG 


CTTGGACTGG 


GTTTTGGCCT 


TGGTCTTACT 


GGTTCTGACC 


TCTCCCATCT 


10200 


TTCTCATCTT 


GAGCATTTGG 


ATCAAGTTGG 


ATAGCAAAGG 


GCCAGTGATT 


TACAAGCAAG 


10260 


AGCGTGTGAC 


CCAGTACAAC 


CGTCGGTTCA 


AGATTTGGAA 


GTTTCGTACC 


ATGGTGACGG 


10320 


ATGCGGATAA 


AAAAGGAAGT 


CTGGTGACTT 


CTGCTAACGA 


TAGCCGCATT 


ACCAAGGTTG 


10380 


GAAATTTCAT 


CCGACGTGTC 


CGTTTGGACG 


AACTGCCTCA 


GTTGGTCAAT 


GTCCTTAAAG 


10440 


GTGAGATGTC 


CTTTGTCGGT 


ACACGACCTG 


AAGTGCCACG 


TTATACAGAG 


CAGTATAGCC 


10500 


CTGAAATGAT 


GGCAACCTTG 


CTCTTGCAAG 


CAGGGATTAC 


CTCTCCAGCC 


AGCATCAACT 


10560 


ACAAGGATGA 


GGACACAATT 


ATCAGTCAAA 


TGACGGAGAA 


AGGTCTGTCA 


GTTGATCAGG 


10620 


CCTATGTGGA 


GCATGTTCTT 


CCTGAAAAGA 


TGCGCTATAA 


CCTCGCCTAT 


CTCCGAGAGT 


10680 


TTAGTTTCTT 


TGGGGACATC 


AAAATCATGT 


TTCAAACCGT 


GTTTGAGGTA 


CTAAAATAAA 


10740 


GTAGTCATAA 


GAAAATGAGT 


ACAGATAAAA 


GGAGCAAATC 


AATGCCAAAT 


TACAATATTC 


10800 


CATTTTCACC 


GCCTGATATC 


ACAGAAGCAG 


AAATTACTGA 


AGTAGTGGAT 


ACCCTGCGTT 


10860 


CTGGTTGGAT 


CACAACAGGT 


CCTAAAACAA 


AAGAACTGGA 


GCGCCGCTTG 


TCTCTTTACA 


10920 


CACAGACACC 


TAAGACTGTT 


TGTCTCAACT 


CTGCGACAGC 


CGCTCTGGAG 


TTGATTTTAC 


10980 


GCGTTTTGGA 


AGTGGGACCT 


GGTGATGAAG 


TCATCGTTCC 


AGCCATGACC 


TATACGGCTT 


11040 


CATGTAGTGT 


CATTACGCAC 


GTGGGAGCAA 


CCCCTGTCAT 


GGTGGATATC 


CAAGCAGATA 


11100 


CGTTTGAGAT 


GGACTATGAC 


CTGCTTGAGC 


AAGCTATCAC 


TGAGAAAACT 


AAGGTGATTA 


11160 


TTCCAGTAGA 


GCTCGCAGGG 


ATTGTTTGCG 


ATTATGACCG 


TTTGTTCCAA 


GTCGTGGAGA 


11220 


AAAAACGTGA 


CTTCTTTACC 


GCTTCAAGCA 


AGTGGCAAAA 


GGCCTTTAAC 


CGTATTGTCA 


11280 


TTGTCTCTGA 


TAGTGCCCAC 


GCTTTGGGAT 


CTATTTATAA 


AGGACAACCT 


TCTGGTTCTA 


11340 


TCGCTGACTT 


TACTTCCTTC 


TCATTCCATG 


CAGTTAAGAA 


CTTTACAACG 


GCAGAAGGTG 


11400 


GAAGTGCGAC 


TTGGAAAGCC 


AATCCAGTGA 


TTGATGACGA 


AGAGATGTAC 


AAGGAATTCC 


11460 


AAATCCTTTC 


CCTTCACGGG 


CAAACTAAGG 


ATGCTCTTGC 


CAAGATGCAA 


CTGGGGTCAT 


11520 


GGGAATACGA 


TATCGTTACA 


CCAGCCTATA 


AGTGCAACAT 


GACCGATATC 


ATGGCTTCAC 


11580 


TTGGTTTGGT 


ACAATTGGAC 


CGCTATCCAA 


GTTTGTTGCA 


ACGCCGTAAG 


GACATTGTGG 


11640 


ACCGCTATGA 


TAGTGGTTTT 


GCAGGTTCTC 


GCATCCATCC 


TTTGGCACAC 


AAGACTGAAA 


11700 


CTGTCGAATC 


TTCACGCCAC 


CTCTACATCA 


CCCGTGTAGA 


AGGAGCAAGC 


CTAGAAGAAC 


11760 
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GCAACCTCAT 


CATCCAAGAA 


TTGGCTAAAG 


CAGGAATTGC 


AAGTAATGTT 


CACTACAAAC 


11820 


CGCTTCCTCT 


CTTGACAGCC 


TATAAGAATC 


TTGGATTTGA 


TATGACGAAC 


TATCCTAAGG 


11880 


CCTATGCCTT 


CTTTGAGAAT 


GAAATTACCC 


TCCCTCTTCA 


TACTAAATTA 


AGCGATGAAG 


11940 


AAGTAGACTA 


TATCATTGAG 


ACTTTCAAAA 


CAGTTTCTGA 


AAAAGTGCTA 


ACTTTATCAA 


12000 


AAAAATGACA 


AACTACAGTC 


AAGCGAAAGT 


GATCCTGCCC 


CTAAAAAGTC 


TAATTGAGTG 


12060 


TAAAAACTGT 


TGTTTTCAAT 


TGATAATAGT 


TTACACCTGT 


AGTTGAGGCC 


CCTTTCTCCT 


12120 


CAGAGAGAGA 


ATTTTTATAG 


GATTTTCCTT 


TCTTGTGGGA 


GTCCCGTGGT 


TTGAAATAAG 


12180 


ATGTGAGCAA 


TTTAGTGTAG 


CATTTAGAAT 


CCTTACTAGA 


CATCATTTAG 


AAAATCTAGT 


12240 


GTCTTGTTCT 


AGTTTTCAAT 


TCACCCTATT 


TTTTGAAAGA 


CGTGAGTTTC 


CATGAGTGAG 


12300 


ATTGTGGAAA 


CTCGCGTCTT 


TTTTTGTTTT 


CAGAATATTG 


TTCAAAATTT 


TGTGCCTGTC 


12360 


TTTCATGTTC 


TAGTCATTCT 


TTTGCATGAT 


AGAATTTATA 


GCATGTTGAT 


ATTATAATAA 


12420 


TACAAATATT 


CTATATGTTT 


AGTGATGCTT 


GCTATACATT 


ATTAGATCTC 


CTGCGAGACA 


12480 


ATCTATAAAA 


CACTTGTCTA 


CGATTACGTA 


TATGCCCTAT 


TCCAGTATTT 


TAGAAGCACT 


12540 


GCATCTATTT 


TTATCGAGGT 


TAAATCTAGC 


TTTTATAGAA 


GGTCTATTTA 


AGAAATATAT 


12600 


TGTAGTGTTT 


TAGTTTCAAT 


CCGCCATATG 


AGCGATATTC 


AGGTAAATAT 


CCCTGGCGAA 


12660 


TGCTTGTATG 


ACAAGGTATT 


TGTTCTTTCA 


TTTATAATTT 


ACAACATATC 


AACAAATTTA 


12720 


AATATAGTAA 


ATGGGATATT 


TTATATTCAA 


GCTAAGAAAG 


AlAuLAi\.AL 


TTTTGAATGG 


12780 


AAGGCTAAAG 


AGCAAACTAG 


GAAGTTGGCC 


ATAGATAGCT 


CAAAACCCTG 


CTTTGAGGTT 


12840 


GTAGATATAG 


TAAAATGAAA 


TGAGAATAGG 


ACAAATTGAT 


CGGGACAGTC 


AAATCGATTT 


12900 


CTAACAATGT 


TTTAGAAGTA 


GAGGTGTACT 


ATTTTAGTTT 


CAGTCTACTA 


TAGAACTGAC 


12960 


CAAGTCAGTA 


ACCTAGACTT 


AGGGCAAGGC 


GGCACTGACC 


TAGTTTGAAG 


AGATTTCCGA 


13020 


AGAGTATAAA 


TTTTAATATT 


TTCTTGTGTT 


ATTCCTTGAC 


AATTCAATTT 


GGAAAATATA 


13080 


TGATAAAGAT 


AATGACAGCG 


GTGTCATTCT 


ATCTATTTTA 


AGAAAAGTAA 


TAATCAATTG 


13140 


TTAAAAATAG 


TAAAAAAATT 


GGAGGTTCTG 


ATGAAATATT 


TTGTTCCG 




13188 


(2) INFORMATION FOR SEQ ID NO: 71: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32768 base pairs 

(B) TYPE: nucleic acid 
(C> STRAND EDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
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AACGAGTGCA TCAGTCTCAG CAAGCACCAG TGCGTCGGCC TCAGCAAGCA CCAGCGCGTC 60 

TGAATCCGCA TCAACCAGTG CCTCAGCTTC AGCAAGTACC TCAGCATCTG AATCAGCATC 120 

AACAAGTGCA TCGGCTTCAG CAAGCACAAG TGCTTCAGCC TCAGCAAGTA TCTCAGCGTC 180 

TGAATCGGCA TCAACGAGTG CGTCCGCTTC AGCAAGTACT AGCGCCTCAG CATCAGCGTC 240 

AACAAGTGCT TCGGCTTCAG CGTCAACGAG TGCGTCTGAG TCAGCATCAA CGAGTACGTC 300 

AGCCTCAGCA AGCACATCAG CTTCTGAATC TGCATCAACC AGTGCGTCAG CCTCAGCATC 360 

GACAAGCGCC TCAGCTTCAG CAAGTACCAG TGCGTCAGCC TCAGCAAGTA CCAGTGCTTC 420 

AGCCTCAGCG TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG CATCAACCAG 480 

TGCGTCAGCC TCAGCAAGTA CTAGCGCCTC AGCCTCAGCA TCAACGAGTG CGTCCGCTTC 540 

AGCAAGTACT AGTGCATCAG CATCAGCATC AACGAGTGCA TCGGCTTCAG CAAGTACCAG 600 

CGCCTCAGCT TCAGCAAGCA CCAGTGCGTC AGCCTCAGCA AGTACCAGCG CCTCAGCCTC 660 

AGCAAGCACC AGTGCCTCAG CTTCAGCAAG TACCAGTGCG TCAGCCTCAG CGTCGACAAG 720 

TGCGTCGGCT TCAGCAAGTA CCTCAGCGTC TGAATCAGCA TCAACGAGTG CATCAGCTTC 780 

AGCATCAACA AGTGCTTCAG CTTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG 840 

TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC AGCATCAGCG TCAACAAGTG CTTCGGCTTC 900 

AGCGTCAACG AGTGCGTCTG AGTCAGCATC AACGAGTACG TCAGCCTCAG CAAGCACATC 960 

AGCTTCTGAA TCTGCATCAA CCAGTGCGTC AGCCTCAGCA TCGACAAGCG CCTCAGCTTC 1020 

AGCAAGTACC AGTGCGTCAG CCTCAGCAAG TACCAGTGCT TCAGCCTCAG CGTCGACAAG 1080 

TGCGTCGGCC TCAACCAGTG CATCTGAATC GGCATCAACC AGTGCGTCAg CCTCAGCAAG .1140 

TACTAGCGCC TCAGCCTCAG CATCAACGAG TGCGTCCGCT TCAGCAAGTA CTAGTGCATC 1200 

AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC AGCGCCTCAG CTTCAGCAAG 1260 

CACCAGTGCG TCAGnCTCAG CAAGTACCAG CGCCTCAGCC TCAGCAAGCA CCAGTGCCTC 1320 

AGCTTCAGCA AGTACCAGTG CGTCAgCCTC AGCGTCGACA AGTGCGTCGG CTTCAGCAAG 1380 

TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT TCAGCATCAA CAAGTGCTTC 1440 

AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG AGTGCTTCAG TCTCAGCGTC 1500 

AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT TCAGCAAGCA CCAGTGCGTC 1560 

GGCTTCAGCA AGTACTAGTG CATCGGCTTC AGCATCGACA AGTGCGTCTG AATCGGCATC 1620 

AACGAGTGCT TCGGCTTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 1680 

TGAATCTGCA TCAACCAGTG CGTCCGCTTC AGCGTCAACC AGTGCGTCGG CTTCAGCGTC 1740 
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GACAAGTGCT TCGGCTTCAG CATCAACGAG TGCGTCGGCC TCAGCAAGCG CAAGTACCTC 1800 

AGCGTCAGct TCCGCCTCAA CCAGTGCGTC GGCTTCAGCA AGCACAAGTG CGTCAGCCTC 1860 

AGCAAGTATC TCAGCGTCTG AATCGGCATC AACGAGTGCG TCTGAGTCAG CATCAACGAG 1920 

TACGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA TCAACCAGTG CGTCAGCCTC 1980 

AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCT TCAGCCTCAG CGTCGACAAG 2040 

TGCGTCGGCC TCAACCAGTG CATCTGAATC GGCATCAACC AGTGCGTCAG CCTCAGCAAG 2100 

TACTAGTGCA TCAGCTTCAG CATCAACGAG TGCATCGGCT TCAGCATCAA CCAGTGCCTC 2160 

GGCTTCAGCG TCAACCAGTG CGTCAGCTTC AGCAAGTACC AGTGCTTCAG TCTCAGCATC 2220 

AACAAGTGCT TCAGCCTCAG CATCGACAAG TGCCTCGGCT TCAGCAAGCA CATCAGCATC 2280 

TGAATCAGCG TCAACCAGTG CTTCGGCTTC AGCAAGTACC AGTGCTTCAG CTTCAGCATC 2340 

AACCAGCGCC TCGGCCTCAG CAAGCACCTC AGCTTCTGAA TCGGCCTCAA CCAGCGCCTC 2400 

GGCCTCAGCA AGCACCTCAG CTTCTGAATC GGCCTCAACC AGCGCCTCAG CCTCAGCATC 2460 

AACGAGTGCT TCGGCTTCAG CAAGCACAAG CGCCTCGGGT TCAGCATCAA CGAGTACGTC 2520 

AGCTTCAGCG TCAACCAGTG CTTCAGCCTC AGCATCAACA AGTGCGTCAG CCTCAGCAAG 2580 

TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCTGAG TCAGCATCAA CGAGTACGTC 2640 

AGCCTCAGCA AGCACCTCAG CTTCTGAATC GGCCTCAACC AGTGCGTCAG CCTCAGCATC 2700 

GACAAGCGCC TCAGCTTCAG CAAGTACCAG TGCTTCAGCC TCAGCGTCGA CAAGTGCGTC 2760 

GGCCTCAACC AGTGCATCTG AATCGGCATC AACCAGTGCG TCAGCCTCAG CAAGTACTAG 2820 

TGCATCGGCT TCAGCATCAA CCAGTGCCTC GGCTTCAGCG TCAACCAGTG CGTCAGCTTC 2880 

AGCAAGTACC AGTGCTTCAG TCTCAGCATC AACAAGTGCT TCAGCCTCAG CATCGACAAG 2940 

TGCCTCGGCT TCAGCAAGCA CATCAGCATC TGAATCAGCG TCGACAAGCG CCTCAGCTTC 3000 

AGCAAGTACC AGTGCGTCAG CCTCAGCGTC GACAAGTGCG TCAGCCTCAG CAAGTACTAG 3060 

TGCATCAGCT TCAGCATCAA CGAGTGCATC GGCTTCGGCG TCAACCAGTG CATCAGAGTC 3120 

AGCAAGTACC AGTGCGTCAg CTTCCGCATC AACAAGTGCC TCGGCTTCAG CAAGCACCAG 3180 

TGCGTCGGCT TCAGCAAGTA CTAGCGCCTC AGCCTCAGCC TCAACCAGTG CGTCAGCCTC 3240 

AGCAAGTATC TCAGCGTCTG AATCGGCATC AACGAGTGCG TCCGCTTCAG CAAGTACTAG 3300 

CGCCTCAGCC TCAGCGTCAA CAAGTGCATC GGCTTCAGCG TCAACGAGTG CGTCTGAATC 3360 

GGCATCAACG AGTGCGTCCG CTTCAGCAAG TACTAGCGCC TCAGCCTCAG CGTCAACAAG 3420 

TGCATCGGCT TCAGCATCAA CGAGTGCGTC CGCTTCAGCA AGTACTAGCG CCTCAGCGTC 3480 

AGCGTCAACA AGTGCATCGG CTTCAGCGTC AACGAGTGCG TCTGAGTCAG CATCAACGAG 3540 
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TGCGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA TCAACCAGTG CGTCAGCCTC 3600 

AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG TCAGCCTCAG CGTCGACAAG 3660 

TGCGTCGGCT TCAGCAAGTA CCAGTGCGTC AGCCTCAGCA AGTACCAGTG CGTCAGCCTC 3720 

AGCGTCGACA AGTGCGTCGG CCTCAACCAG TGCATCTGAA TCGGCATCAA CCAGTGCGTC 3780 

AGCCTCAGCA AGTACTAGTG CATCAGCTTC AGCATCAACG AGTGCATCGG CTTCAGCATC 3840 

AACCAGTGCA TCAGAGTCAG CAAGTACCAG TGCGTCAGCT TCCGCATCAA CAAGTGCCTC 3900 

GGCTTCAGCA AGTACTAGCG CCTCAGCCTC AGCGTCAACA AGTGCTTCAG CTTCCGCGTC 3960 

AACCAGCGCC TCGGCCTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CAAGTGCCTC 4020 

GGCTTCAGCA TCAACGAGTG CATCAGTCTC AGCAAGCACC AGTGCGTCGG CCTCAGCAAG 4080 

CACCAGCGCG TCTGAATCCG CATCAACCAG TGCCTCAGCT TCAGCAAGTA CCTCAGCATC 4140 

TGAATCAGCA TCAACAAGTG CCTCGGCTTC AGCAAGCACA AGTGCTTCAG CCTCAGCAAG 4200 

TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC 4260 

AGCATCAGCG TCAACAAGTG CTTCGGCTTC AGCGTCAACG AGTGCGTCTG AGTCAGCATC 4320 

AACGAGTACG TCAGCCTCAG CAAGCACATC AGCTTCTGAA TCTGCATCAA CCAGTGCGTC 4380 

AGCCTCAGCA TCGACAAGCG CCTCAGCTTC AGCAAGTACC AGTGCGTCAG -CCTCAGCAAG 4440 

TACCAGTGCT TCAGCCTCAG CGTCGACAAG TGCGTCGGCC TCAACCAGTG CATCTGAATC 4500 

GGCATCAACC AGTGCGTCAG CCTCAGCAAG TACTAGCGCC TCAGCCTCAG CATCAACGAG 4560 

TGCGTCCGCT TCAGCAAGTA CTAGTGCATC AGCTTCAGCA AGTACTAGCG CCTCAGCCTC 4620 

AGCGTCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG TCAGCCTCAG CGTCGACAAG 4680 

TGCGTCGGCT TCAGCAAGTA CCTCAGCGTC TGAATCAGCA TCAACAAGTG CGTCGGCTTC 4740 

AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT TCAGCTTCAG CAAGTACCAG 4800 

TGCGTCGGCT TCAGCATCAA CGAGTGCTTC AGTCTCAGCG TCAACCAGTG CCTCTGAATC 4860 

CGCATCAACA AGTGCCTCGG CTTCAGCAAG CACCAGTGCT TCGGCTTCAG CGTCAACGAG 4920 

TGCGTCTGAG TCAGCATCAA CGAGTGCGTC AGCCTCAGCA AGCACATCAG CTTCTGAATC 4980 

TGCATCAACC AGTGCGTCAG CTTCCGCATC AACAAGCGCC TCGGCCTCAG CAAGTACAAG 5040 

TGCTTCAGCC TCAGCATCAA CCAGTGCATC AGCTTCAGCC TCAACAAGTG CTTCAGCCTC 5100 

AGCGTCAACC AGTGCCTCGG CTTCAGCAAG TACCAGTGCG TCAGCTTCAG CAAGCACAAG 5160 

TGCGTCAGCT TCAGCATCAA CGAGTGCTTC GGCTTCGGCA TCAACAAGTG CCTCAGCATC 5220 

AGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA TCAGCATCAG CATCAACCAG 5280 
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TGCATCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA TCAACGAGTG CATCAGCATC 5340 

AGCATCAACG AGTGCATCGG CTTCAGCGTC AACCAGTGCA TCAGTCTCAG CAAGCACCAG 5400 

TGCGTCGGCT TCAGCATCAA CGAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC 5460 

GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA TCGGCTTCAG CAAGCACCAG 5520 

TGCGTCGGCT TCAGCATCAA CCAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC 5580 

GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA TCAGCmTCAG CATCAACGAG 5640 

TGCATCGGCT TCAGCAAGTA CCAGCGCCTC AGCTTCAGCA AGCACCAGTG CGTCAGCCTC 5700 

AGCAAGTACC AGCGCCTCAG CCTCAGCAAG CACCAGTGCC TCAGCTTCAG CAAGTACCAG 5760 

TGCGTCAGcT CAGCATCAAC AAGTGCTTCA GCTTCGGCCT CAACAAGTGC GTCAGCTTCA 5820 

GCATCAACGA GTGCGTCGGC TTCAGCAAGC ACCAGTGCCT CGGCCTCAGC AAGCACCAGT 5880 

GCTTCAGCTT CAGCATCAAC AAGTGCGTCA GCTTCAGCAA GTACATCAGT TTCAAATTCA 5940 

GCAAACCATT CGAACTCACA AGTTGGAAAT ACTTCTGGAT CGACAGGTAA ATCCCAAAAA 6000 

GAATTGCCTA ATACAGGTAC TGAGTCGTCA ATTGGATCTG TGTTACTTGG AGTTCTAGCA 6060 

GCTGTTACAG GTATTGGATT GGTTGCGAAA CGCCGTAAAC GTGATGAAGA AGAGTAAGAC 6120 

AACCTGTAAA GTTAGGCTAA ACTAACTCGC GCACATAAAT CAAGGAGAAA ATTGCTAGTG 6180 

GATGATAAAA TAACAGTCAT TGTACCAGTA TACAATGTGG AAAACTATCT GAGGAAGTGC 6240 

CTAGATAGTA TTATTACTCA AACATATAAA AATATTGAGA TTGTTGTCGT TAATGATGGT 6300 

TCTACGGATG CTTCAGGTGA AATTTGTAAA GAATTTTCAG AAATGGATCA CCGAATTCTC 6360 

TATATAGAAC AAGAAAATGC TGGTCTTTCT GCCGCACGAA ACACCGGTCT GAATAATATG 6420 

TCCGGAAATT ATGTGACCTT TGTGGACTCG GATGATTGGA TTGAGCAAGA TTATGTAGAA 6480 

ACTCTATATA AAAAAATAGT AGAGTATCAG GCTGATATTG CAGTTGGTAA TTATTATTCT 6540 

TTCAACGAAA GTGAAGGAAT GTTCTACTTT CATATATTGG GAGACTCCTA TTATGAGAAA 6600 

GTATATGATA ATGTTTCTAT CTTTGAGAAC TTGTATGAAA CTCAAGAAAT GAAGAGTTTT 6660 

GCTTTGATAT CTGCTTGGGG TAAACTCTAT AAGGCAAGAT TGTTTGAGCA GTTGCGCTTT 6720 

GACATAGGTA AATTAGGAGA AGATGGTTAC CTCAATCAAA AGGTATATTT ATTATCAGAA 6780 

AAGGTAATTT ATTTAAATAA AAGTCTTTAT GCTTATCGGA TTAGAAAAGG TAGTTTATCA 6840 

AGAGTTTGGA CAGAAAAGTG GATGCACGCT TTAGTTGATG CTATGTCTGA ACGTATTACG 6900 

CTACTAGCTA ATATGGGTTA TCCTCTAGAG AAACACTTGG CAGTTTATCG TCAGATGTTG 6960 

GAAGTCAGTC TCGCCAACGG TCAAGCTAGT GGTTTATCTG ACACAGCAAC GTATAAAGAG 7020 

TTTGAAATGA AACAAAGGCT TTTAAATCAG CTATCGAGAC AAGAGGAAAG TGAAAAGAAA 7080 
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GCCATTGTCC TCGCAGCAAA CTATGGCTAT GTAGACCAAG TTTTAACGAC AATCAAGTCT 7140 

ATTTGTTATC ATAATCGTTC GATTCGTTTT TATCTGATTC ATAGCGATTT TCCAAATGAA 7200 

TGGATTAAGC AATTAAATAA GCGCTTAGAG AAGTTTGACT CAGAAATTAT TAATTGTCGG 7260 

GTAACTTCTG AGCAAATTTC ATGTTATAAA TCGGATATTA GTTACACAGT CTTTTTACGC 7320 

TATTTCATAG CTGATTTCGT GCAAGAAGAC AAGGCCCTCT ACTTGGACTG TGATCTAGTT 7380 

GTAACGAAAA ATCTGGATGA CTTGTTTGCT ACAGACTTAC AAGATTATCC TTTGGCTGCT 7440 

GTTAGAGATT TTGGGGGCAG AGCTTATTTT GGTCAAGAAA TCTTTAATGC CGGTGTTCTC 7500 

TTGGTAAACA ATGCTTTTTG GAAAAAAGAG AATATGACCC AAAAATTAAT TGATGTAACC 7560 

AATGAATGGC ATGATAAGGT GGATCAGGCA GATCAGAGCA TCTTGAATAT GCTTTTTGAA 7620 

CATAAATGGT TGGAATTGGA CTTTGATTAT AATCATATTG TCATTCATAA ACAGTTTGCT 7680 

GATTATCAAT TGCCTGAGGG TCAGGATTAT CCTGCTATTA TTCACTATCT TTCTCATCGG 7740 

AAACCGTGGA AAGATTTGGC GGCCCAAACC TATCGTGAAG TTTGGTGGTA CTATCATGGG 7800 

CTTGAATGGA CAGAATTGGG ACAAAACCAT CATTTACATC CATTACAAAG ATCTCACATC 7860 

TATCCAATAA AGGAACCTTT CACTTGTCTA ATCTATACTG CCTCAGACCA TATTGAACAA 7920 

ATTGAGACAT TGGTTCAATC CTTGCCTGAT ATTCAGTTTA AGATAGCAGC TAGAGTAATA 7980 

GTTAGTGATC GATTGGCTCA GATGACAATT TATCCAAACG TGACTATATT TAACGGAATT 8040 

CACTATTTGG TAGATGTCGA TAATGAATTG GTAGAAACCA GTCAAGTACT TTTAGATATT 8100 

AATCATGGCG AAAAGACAGA AGAAATTCTC GATCAATTTG CTAATCTTGG CAAGCCTATC 8160 

TTATCCTTTG AAAATACTAA AACCTATGAA GTAGGTCAGG AGGCATATGC TGTTGACCAA 8220 

GTTCAAGCAA TGATTGAAAA ATTGAGAGAA ATAAGCAAAT GAAGAAAAAT CATTTAGTAG 8280 

GAGATGCTCT GATTTTGACG GTTAGTGATC AGATTGAAGA GTTGGATTAT TTTTTATAAA 8340 

ATTTCTCCGT TCATCATATA TGAAAGTTGT TCAAACATCA GAGTGCTTTA TAAAATATAA 8400 

ATAGACCTAA AGATATTTAA TATGAACTGC ACCCCAAAAG TTAGACAGAA AAAATCTAAC 8460 

TTTTTGGsGT CAGTACAATA TTAGGGTGTG ATTAATTATC TTTTTAGGTG AAAATGATTC 8520 

TATATTATAG CTGTTTGATA CGAAATTTAT TATAAGGAAA TTATGTTAAT GAATACAAAA 8580 

TCTATAGTTT TTAATGCAGA TAATGATTAT GTAGATAAAT TAGAAACTGC AATTAAATCT 8640 

ATTTGTTGTT ATAATAATTG TTTAAAATTT TATGTATTTA ATGATGATAT TGCGTCAGAG 8700 

TGGTTTTTGA TGATGAATAA GCGATTGAAG ACTATACAAT CTGAAATCGT TAATGTAAAG 8760 

ATTGTAGATC ATGTTCTTAA AAAGTTTCAT TTACCGTTAA AGAATTTAAG TTATGCCACT 8820 
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TTCTTTCGTT ATTTTATACC TAATTTTGTC AAAGAAAGTC GTGCTTTATA CCTAGATTCT 8880 

GACATCATTG TTACAGGAAG TTTAGACTAT TTATTTGATA TAGAACTAGA TGGTTATGCC 8940 

TTGGCAGCAG TAGAAGATTC TTTTGGTGAT GTTCCTTCTA CCAATTTTAA CTCCGGAATG 9000 

TTATTAGTTA ATGTAGATAC TTGGAGAGAT GAAGATGCTT GTTCGAAACT GTTAGAACTG 9060 

ACCAATCAAT ATCATGAAAC AGCATATGGA GATCAAGGAA TTTTAAATAT GTTATTCCAT 9120 

GATAGATGGA AAAGATTAGA CCGAAATTTT AATTTTATGG TGGGGATGGA TAGCGTCGCA 9180 

CACATAGAAG GAAATCATAA ATGGTATGAG ATTTCTGAGT TGAAAAATGG AGATTTACCT 9240 

AGTGTTATAC ATTATACTGG GGTAAAACCT TGGGAAATAA TTTCCAATAA TCGCTTTAGA 9300 

GAAGTTTGGT GGTTTTATAA TCTGTTAGAA TGGTCTGATA TTTTATTGAG AAAAGACATT 9360 

ATTAGTCGTA GTTTCGAAGA ACTTGTATAC AGTCCTAAAG CTCATACAGC AATTTTTACA 9420 

GCTAGTTGTG AGATGGAGCA TGTAGAATAT TTGATAGAAA ATTTACCAGA GGTACATTTT 9480 

TCTATACTAG CACATACATA TTTTGCGTCT AGTGTCGTTG CTTTATTAAG ATATAGCAAT 9540 

GTTACGATTT ATCCTTGTTT TTCTCCATTT GATTATCGAA AAATTTTGGA TAATTTAGAT 9600 

TTTTATTTAG ATATTAATCA TTATAAAGAA GTGGATAATA TTGTATCCGT TGTTCAACAA 9660 

CTATCTAAAC CAATTTTTAC CTTTGAAAAT ACTAGTCATG ATATAGGCAA TCAAACTAAT 9720 

ATATTTTCTT CAACCGAACC AAACAAAATG GTAGAGGCTA TTAGACAATT TATAGGAGAA 9780 

TAAGTTTATG GCAGACGAAC TAATTAGTAT TGTAGTTCCA ATCTACAACG TTGAGAATTA 9840 

TTTGCGAATG TGTTTGGATA GCATTCAGAA TCAGACGTAT CAAAATTTTG AGTGTTTATT 9900 

AATCAATGAT GGCTCTCCAG ATCATTCATC CAAAATATGT GAAGAATTTG TAGAGAAAGA 9960 

TTCTCGTTTC AAATATTTTG AGAAAGCAAA CGGCGGTCTT TCATCAGCTC GTAACCTAGG 10020 

TATTGAATGT TCGGGGGGGG GCGTACATTA CTTTTGTAGA CTCTGATGAT TGGTTGGAAC 10080 

ATGATGCTTT AGACCGATTA TATGGTGCTT TGAAAAAGGA AAACGCAGAT ATTAGTATCG 10140 

GGCGTTATAA TTCTTATGAT GAAACACGCT ATGTGTATAT GACTTATGTT ACGGATCCAG 10200 

ATGATTCTCT AGAAGTGATA GAAGGTAAAG CAATTATGGA TAGGGAAGGT GTCGAAGAAG 10260 

TCAGAAATGG GAACTGGACT GTAGCTGTCT TGAAGTTATT CAAGAGAGAG TTACTACAAG 10320 

ATTTACCATT TCCTATAGGA AAAATTGCAG AGGATACTTA CTGGACATGG AAGGTACTTC 10380 

TAAGAGCTTC GAGGATAGTC TATTTGAATC GTTGTGTTTA CTGGTACCGT GTTGGTTTAT 10440 

CTGATACTTT ATCGAATACA TGGAGTGAAA AGCGTATGTA TGATGAAATT GGGGCTAGGG 10500 

AAGAAAAGAT AGCTATTTTA GCAAGTTCAG ACTATGACTT GACCAATCAT ATTTTGATTT 10560 

ATAAAAATAG ATTACAAAGA GTGATAGCAA AATTAGAAGA ACAAAATATG CAGTTCACAG 10620 
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AGATTTACAG AAGAATGATG GAAAAATTGT CTTTACTTCC GTAGATAGTA ATAAAAAATG 10680 

AGATAGCGTA ATATGAAACT ACATTTAACA AATTTATACG GCATGGCTGG TGATAGTACG 10740 

GTTATCTTAG CTCAAAATGC TGTTCAAAAG ATAGCTAGTC AACTGGGATT TAGAGAGGTT 10800 

GGTATTTATT TTTACAACAT TGCTTCAGAT AGTCCTTCTG AAATGAATAA GCGTCTGGAT 10860 

GGTATTATGG CCAGTATCTC TATTGGGGAT ATTTTAGTCT TTCAGTCTCC AACCTGGAAT 10920 

GGTTTTGAAT TTGATCGTCT CTTGTTTGAT AAGCTAAAGG ATATGCAGGT GAAAATTATT 10980 

TGCTTTATCC ATGATGTTGT TCCCCTCATG TTTGATAGTA ACTATTATCT CATGAAAGAT 11040 

TATCTGTATA TGTATAATCT ATCAGATGTT TTGATAGTGC CGTCAGAGAG AATGAAAACA 11100 

CGCCTGATGG AAGAAGGATT GACGACTAAG AAGATTCTTG TTCAAGGGAT GTGGGATCAT 11160 

CCTCATGATT TATCCTTATA CACCCCTGCT TTTAAAAAAG AACTTTTTTT TGCTGGAAGT 11220 

TTAGAGCGTT TTCCAGACTT ACAAAATTGG TCTCAAGATA CGCCTTTGAG AGTATTTTCA 11280 

AATAAAGGGG AAGCTAGTTC TAGTGCTAGA AGTCTCAGCA TCGAAGGATG GAAAAAAGAT 11340 

GAGGAATTGT TGCTAGAATT ATCAAAGGGT GGATTTGGCC TTGTCTGGGG AACCcATCAA 11400 

AATGAGGGAG AAAGTAACCA ATACTATACC TTGAATATAT CTCATAAGGT GAGTACCTAT 114 60 

CTAACAGCGG GCATTCCAGT CATTGTACCA AGTAGCTTGT CAACTGCTAA ATTTATAGTA 11520 

GATCAAGGCT TGGGCTTTAT GGCGGATAGT CTGGAAGAGG TTCATGAGAT AGTTGATAAA 11580 

ATGAATCTAC AAGAATATCA AGAAATGACG AATCGTATCA AGACCTTTAG CTATTTGTTA 11640 

AAAGAGGGCT ATTTCACTAA AAAGTTATTG GTAGATGCAA TCTATCACTT GGGAATTGAT 11700 

TAAGGGAATG AAATGAACAA AACAATTGTA CTAGCAGGGG ATCGCAATTA CACCAGGCAG 11760. 

TTAGAAACAA CGATAAAATC TATTTTATAC CACAATCGAG ATGTTAAGAT TTATATTTTG 11820 

AATCAAGATA TCATGCCAGA TTGGTTTCGC AAACCACGAA AAATAGCTCG CATGTTAGGT 11880 

AGTGAGATTA TCGATGTTAA ACTACCTGAA CAAACTGTCT TTCAAGATTG GGAAAAGCAA 11940 

GATCACATTA GTAGCATTAC TTATGCTAGA TATTTTATTG CAGATTATAT CCAAGAAGAT 12000 

AAGGTTTTAT ATTTAGACAG TGATTTGATT GTAAATACTT CTTTAGAGAA ATTATTTAGT 12060 

ATTTGTTTAG AAGAAAAATC ACTCGCAGCA GTTAAAGATA CAGATGGAAT TACATTTAAT 12120 

GCAGGTGTTT TATTAATCAA CAATAAAAAA TGGCGTCAAG AGAAATTAAA AGAACGACTA 12180 

ATTGAACAGA GCATTGTTAC AATGAAGGAA GTTGAAGAAG GCCGTTTCGA GCATTTTAAT 12240 

GGTGATCAAA CGATTTTTAA TCAGGTCTTG CAAGATGATT GGTTAGAACT AGGTCGAGCT 12300 

TATAATTTAC AAGTAGGGCA TGATATTGTG GCTTTGTATA ACAATTGGCA GGAACATCTG 12360 
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GCTTTTAATG ATAAACCAGT GGTGATTCAT TTTACGACCT ACAGAAAACC CTGGACTACC 12420 

TTGACAGCCA ATCGTTATCG TGATTTATGG TGGGAATTCC ATGATTTGGA GTGGAGTCAG 12480 

ATTTTACAAC ACCATATGGG AGAATTTGAA CTAATATCGC CTCTAGATAA GGAATTTTCT 12540 

TGCTTAACCT TAACGAATTC CCAAGATTTA GAAGGAATAG AAGAGCTAGT TACAGCTCTA 12600 

CCTGAGGTGG TATTTCATAT CGCAGCTTGG ACGGATATGG GAGATAAATT AAAAAAATTA 12660 

GCTGTATATA ATAATGTGAG ATTGCATCCA CAAATTGTTC CACCGGTCTT AGATAAGCTG 12720 

AAAAAGTCAA CAAATCTATA TTTGGATATC AATCATGGTA GTGCAGATGA GAACTTTTTA 12780 

AAATCTTTGC AAGAACAAGA AAAAACGCTA CTAGCTTTTC AATCGACTCA GCACGGAGAG 12840 

TTAGGACAAA TCGTTTTCGA AAATGGGAAA GTTTCCTTTA TGATTGATAC GATTAAAGAT 12900 

TTTAAGAAAA ACGGACATCT TACCTGTTTT CGACAACTTC CAAGTTTAAC TTGTTTAACG 12960 

TTTACGGCTT CTCAGTATAT CGAACAATTG GATTACTTGG CTGGACAGTT GCCAAATGTT 13020 

GTTTTTCAAA TTGCTGCTTG GACAGCTATG GGGCCAAAAT TATATGATTT GTCTAATCGT 13080 

TATCCTAATA TTCAGCTCTA TCCGGCAATT TCTAGAGATA AGCTAGACGA GTTGAAGGAG 13140 

AAGATGGATG CTTATTTAGA TATCAACCTA CTGACTTCAA CATCCGATAT CGTTGCAGAA 13200 

ATGGCTCATC TATCTAAACC TATACTAGCC TTTTATAAAT CTCAAAATGG GAATAATGGC 13260 

CAAAGGTTGT ATTCAAGTGA ACATCCTGAA CGAATGTTGG CTGATTTGCA AAAATTGATA 13320 

ACTAAGGATA TGCTAGAAAA ACCGCTTGAT ATAATCCAGG TGAAAGGGAT AGATGAAACC 13380 

TTGGATTATA TTATTGAACA CAACTCTTCT TTAGTTCGTT TTGGAGATGG GGAAATCAAT 13440 

ATGCTTGCAG GGCATTCAAT TCCCTACCAG GATTATGATG AAGAGTTGGT TTCAATCATG 13500 

AGGGACATTA TCGGCCAAGA AAGTCGAGAA GATTTAGTAG TGTGCCTTCC TGATGCTTTT 13560 

ACAGATCGTT TTAGGTTTAC ATCGTGGGCG ATTCCATTTT GGAAAGATCA CATGGATCAT 13620 

TATATGGATT TTTACAGAGA GTTATGCAGT GATTCATGGT ATGGCTCAAC CTTTGTATCT 13680 

CGCCCTTATA TCGATTTTGA AGACAAGAGT CAAGCTAAAG CTCAATTTGA AAAATTGAAA 13740 

AGCATTTGGG AAAACCGTGA CTTACTGATA GTCGAAGGTG CGACTTCTCG TTCAGGTGTC 13800 

GGAAATGATT TATTCGATGA GGCAAATTCT ATTAAGCGAA TTATCTGTCC TTCTCATAGT 13860 

GCCTTTTCTA GAGTTCATGA ACTTGAACAA GAAATTGAAA AGTATGCTGG TGGTCGCTTG 13920 

ATTTTATGTA TGCTTGGACC TACAGCAAAA GTTCTGAGTT ATAATCTATG CCAGATGGGC 13980 

TATCAAGTTT TGGATGTAGG CCATATTGAC TCAGAGTATG AATGGATGAA AATGGGAGCT 14040 

AAAACTAAGG TTAAATTTTC TCATAAACAT ACTGCAGAAC ATAATTTCGA CCAAGATATT 14100 

GAATTTATTG ATGATGAAAC CTATAACAGT CAGATTGTTG CACGAATATT AAACTAGACT 14160 
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ATTTAAAATA AATGATAAGG ATTTAAAATG AGAAATACCA AACGCGCTGT AGTATTTGCA 14220 

GGTGATTACG CTTATATTCG ACAAATCGAA ACGGCGATGA AGTCACTCTG TAGACACAAT 14280 

AGTCATTTGA AAATTTATCT GCTAAATCAG GACATTCCTC AGGAATGGTT TAGTCAAATA 14340 

AGAATATATT TACAAGAGAT GGGGGGCGAC TTGATTGACT GCAAGTTAAT TGGCTCACAG 14400 

TTTCAAATGA ATTGGTCTAA TAAATTACCT CATATCAATC ATATGACATT TGCACGCTAT 14460 

TTTATTCCAG ATTTTGTAAC AGAAGATAAA GTTCTCTATC TAGATAGTGA TTTGATTGTG 14520 

ACTGGTGATT TGACCGATTT GTTTGAATTA GACTTAGGTG AAAATTATTT GGCAGCAGCT 14580 

CGTTCTTGCT TTGGAGCAGG AGTCGGCTTC AATGCTGGTG TTCTCTTGAT TAACAACAAA 14640 

AAATGGGGAT CTGAAACTAT TCGACAAAAA TTGATTGACT TAACAGAAAA AGAACATGAG 14700 

AATGTGGAAG AAGGAGACCA GTCAATTTTG AATATGTTGT TTAAAGATCA ATATAGTTCC 147 60 

CTTGAAGATC AATATAATTT TCAAATAGGA TATGATTATG GGGCGGCAAC CTTTAAACAT 14820 

CAATTCATTT TTGATATTCC GCTCGAACCA CTGCCACTAA TTTTACACTA TATTTCTCAG 14880 

GATAAGCCTT GGAATCAATT TTCTGTTGGA CGTCTAAGAG AAGTTTGGTG GGAATACTCT 14940 

TTGATGGATT GGTCTGTTAT TTTAAATGAA TGGTTTTCAA AGAGTGTGAA GTACCCTAGT 15000 

AAATCACAAA TATTTAAGTT GCAATGTGTT AATTTAACGA ATTCTTGGTG TGTCGAGAAA 15060 

ATCGATTATT TGGCGGAGCA ATTGCCAGAA GTTCATTTTC ATATTGTTGC TTATACAAAT 15120 

ATGGCAAATG AACTACTAGC TTTAACGCGT TTTCCTAATG TTACCGTATA TCCAAATTCC 15180 

TTACCAATGT TATTGGAACA AATAGTAATA GCTTCAGATT TGTATTTGGA TTTGAATCAT 15240 

GATCGAAAAT TAGAAGATGC ATATGAGTTT GTGCTTAAGT ACAAAAAACC AATGATAGCT 15300 

TTCGACAATA CTTGCTCTGA AAATCTTTCT GAGATTTCAT ATGAAGGTAT CTATCCAAGC 15360 

TCCATTCCGA AAAAAATGGT TGCAGCAATC AGATCTTACA TGAGGTAGAG AACAGTATGA 15420 

GAAAATCAAT AGTATTAGCG GCAGATAATG CCTATCTTAT TCCTTTAGAG ACGACTATAA 15480 

AGTCTGTATT GTATCACAAT AGAGATGTTG ATTTTTATAT TCTCAACAGT GATATAGCTC 15540 

CTGAATGGTT TAAATTATTG GGGAGAAAAA TGGAAGTTGT GAATTCTACA ATTCGCAGTG 15600 

TACACATTGA TAAAGAACTT TTTGAAAGCT ATAAAACAGG ACCTCATATA AATTATGCTT 15660 

CTTACTTTAG ATTTTTTGCG ACAGAAGTGG TTGAATCTGA TAGGGTATTG TATCTGGATT 15720 

CCGATATCAT TGTAACTGGG GAACTAGCTA CTTTGTTTGA GATAGATCTC AAAGGATATT 15780 

CAATTGGTGC TGTTGATGAT GTCTATGCCT ATGAAGGACG AAAATCTGGA TTTAATACTG 15840 

GTATGTTACT AATGGATGTT GCAAAGTGGA AAGAACATTC TATTGTCAAT AGTTTATTGG 15900 
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AATTAGCGGC CGAGCAGAAT CAAGTTGTTC ATCTTGGGGA TCAGAGTATT TTAAATATTT 15960 

ATTTTGAGGA TAATTGGCTA GCCTTAGATA AAACATATAA TTATATGGTG GGTATTGATA 16020 

TTTATCACCT TGCTCAAGAA TGTGAACGTC TAGATGACAA TCCACCTACA ATTGTTCACT 16080 

ATGCTAGTCA TGATAAACCT TGGAATACAT ATAGTATATC TAGACTACGT GAATTATGGT 16140 

GGGTTTATAG AGATTTGGAT TGGTCAGAGA TTGCTTTTCA ACGTTCCGAT TTAAATTATT 16200 

TTGAAAGAAG CAATCAGTCT AAAAAACAAG TGATGCTTGT GACATGGAGT GCAGATATAA 16260 

AACATTTAGA GTATTTAGTA CAACGGTTAC CTGATTGGCA TTTTCATTTG GCTGCACCGT 16320 

GTGATTGTTC TGAGGAGCTG ACCTCTCTAT CACAGTATAC GAATGTAACA GTATATCAAA 16380 

ATGTATTACA TAGTAGAATT GATTGGCTAT TGGACGATTC TATAGTTTAT TTAGATATTA 16440 

ATACAGGTGG AGAGGTTTTT AATGTAGTTA CAAGGGCACA AGAAAGTGGC AAGAAAATCT 16500 

TCGCTTTTGA TATCACACGT AAAAGTATGG ATGATGGACT CTATGACGGT ATTTTTTCTG 16560 

TGGAGAGACC AGATGATTTA GTGGATAGAA TGAAGAATAT AGAGATAGAG TAATGAGTGA 16620 

ATTAATTAGT GTTGTGGTAC CGATATACAA TACGGGAAAA TATTTAGTGG AGTGTGTCGA 16680 

GCATATTCTG AAGCAAACCT ATCAAAATAT AGAAATTATT TTAGTTGATG ACGGTTCTAC 16740 

GGATAATTCT GGGGAAATTT GTGATGCTTT TATGATGCAA GATAATCGTG TGCGAGTATT 16800 

GCATCAAGAA AATAAGGGGG GGGCAGCACA AGCTAAAAAT ATGGGGATTA GTGTAGCTAA 16860 

GGGAGAGTAC ATCACGATTG TTGATTCAGA TGATATCGTA AAAGAAAATA TGATTGAAAC 16920 

TCTTTATCAG CAAGTCCAAG AAAAGGATGC AGATGTTGTT ATAGGGAATT ACTATAATTA 16980 

TGACGAAAGT GACGGGAATT TTTATTTTTA TGTAACAGGG CAAGATTTTT GCGTCGAAGA 17040 

ATTAGCTATA CAAGAAATTA TGAACCGTCA AGCAGGAGAT TGGAAATTCA ATAGCTCGGC 17100 

CTTTATATTG CCGACATTTA AGTTGATTAA AAAAGAATTA TTCAATGAAG TTCACTTTTC 17160 

AAATGGTCGC CGCTTTGATG ATGAAGCAAC TATGCATCGC TTTTATCTTT TAGCCTCTAA 17220 

AATCGTCTTT ATAAACGATA ATCTCTATCT GTATAGAAGA CGTTCAGGAA GCATCATGAG 17280 

AACGGAATTT GATCTTTCCT GGGCAAGAGA TATTGTTGAA GTGTTTTCTA AGAAAATATC 17340 

GGATTGTGTC TTGGCTGGTT TGGATGTCTC CGTTCTGCGT ATTCGATTTG TCAATCTTTT 17400 

AAAAGATTAT AAGCAAACTT TAGAATACCA TCAATTAACA GATACTGAGG AATATAAAGA 17460 

TATTTGTTTC AGATTAAAGT TGTTTTTTGA TGCAGAACAA AGAAATGGTA AAAGTTGAAA 17 520 

TAAAAGAATT GTTATTTACC ATATCACAAA CAATGAAGGT GAGGGGAGTG TTTTATGACT 17580 

AAGATTTATT CGTCAATAGC AGTAAAAAAA GGACTATTTA CCTCATTTCT ACTGTTTATC 17640 

TATGTATTGG GAAGTCGTAT TATTCTCCCT TTTGTTGACC TAAATACTAA AGATTTTTTA 17700 
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GGAGGTTCAA CAGCCTATCT AGCCTTCTCA GCCGCCCTAA CAGGTGGGAA TCTAAGAAGT 17760 

TTATCAATTT TTTCTGTTGG ATTATCCCCT TGGATGTCCG CCATGATTTT ATGGCAGATG 17820 

TTTTCTTTTT CTAAACGGTT GGGTTTAACA TCTACGTCTA TAGAAATACA AGATCGCCGT 17880 

AAAATGTACC TGACCTTGCT AATTGCTGTG ATTCAATCCT TGGCAGTTAG CTTGAGACTG 17940 

CCAGTACAAT CCTCCTATTC TGCAATATTG GTTGTTCTAA TGAATACAAT ATTGCTGATA 18000 

GCAGGAACAT TTTTTCTTGT TTGGTTGTCA GATTTAAATG CGAGTATGGG GATTGGAGGT 18060 

TCTATTGTAA TCCTCCTATC CAGTATGGTT TTAAATATTC CTCAGGATGT TTTGGAAACA 18120 

TTTCAGACAG TACACATTCC AACAGGGATT ATTGTGTTAC TTGCTTTATT AACCCTTGTC 18180 

TTTTCTTATT TACTTGCCCT TATGTATCGA GCTCGCTATT TGGTTCCTGT TAATAAAATT 18240 

GGCTTACACA ATCGATTTAA ACGCTATTCT TATCTCGAAA TCATGTTGAA TCCTGCAGGT 18300 

GGGATGCCTT ATATGTATGT GATGAGTTTT CTTAGTGTAC CAGCTTATTT GTTCATCTTG 18360 

TTGGGATTTA TTTTCCCTAA TCATTCAGGG TTAGCGGCTT TATCAAAGGA ATTTATGGTT 18420 

GGAAAGCCTT TGTGGGTCTA TGTTTATATT TCGGTCTTAT TTTTATTTAG TATCATTTTT 18480 

GCTTTTGTTA CGATGAATGG AGAAGAGATT GCAGACCGTA TGAAAAAATC TGGAGAATAC 18540 

ATTTATGGTA TTTATCCAGG TGCGGATACT AGTCGATTTA TTAATCGATT GGTCCTTCGT 18600 

TTCTCAGTCA TAGGTGGTCT CTTTAATGTG ATTATGGCAG GTGGTCCCAT GCTTTTTGTT 18660 

TTGTTTGATG AAAAGTTATT ACGATTGGCA ATGATTCCAG GCTTATTTAT GATGTTCGGG 18720 

GGCATGATTT TTACGATTAG AGACGAGGTC AAGGCTTTAA GGCTAAATGA GACCTATAGA 18780 

CCTTTGATTT AGGAGACTTT TATGTATTAT TTTATTCCAG CTTGGTATGG GTCAGAAAGA 18840 

ACATGGCATG CAGATATCAC TCCATGGTAT TTTTCTCATT TTCGTCTAGA GTTTGATGAT 18900 

ACCTTTCACC AGATTCGGCT CTTTCAAGAG CAAGATATAG ATTCTCGTCT ATTAGTATTA 18960 

GCTTACCAGC CTCATCTACG TTATTTTTTA TATAGACATG GTGTGTTAGA AATGGATACT 19020 

TATTCCGTTT TTGATGTTAT GCAAGATTTT CATAATCTCC ATACCCAAGT TTTAAGCATT 19080 

AGAGATATTG AGTGGGATGA TGACTGTGAA TTTATTTATA GTCCCTTTAC GATTATCGTT 19140 

CAAAAAAATG GGAAGAAATT TGCTAAGGTT GAACATGGAG TTGAAGGCTT CATCAGTGAT 19200 

ATACAGTATT TTGAACCAAA TGGTCAAATA CATATGCACC ATATCGTGGA TGATCGTGGG 19260 

TTTGTATCGA GCATTATCTT TTTTGAAGAT GGGCAAGCAG CCTATCAAGA ATATCTGAAC 19320 

CTCAAGGGAG AGTGGCAATT CAGAGAGCGT TTAAAAGAAG GAGGACAGGT AGAAGTCAAT 19380 

CCAATTTTGG GTTATCGCTT TAAAATGCTT ACCTATCAAA ATATGGGAGA TCTGGTGGCA 19440 
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GAATTTTTTG AGAATTATCT GCAAACGTAT GTGAAGGATC AGGATATTTT TATGCTTCCT 19500 

TCTCATTCTC ATCATGACCA GTTGGTACTA GATCGTTTAC CTAGTACTAA TCCTAAACTG 19560 

TTGAGTCTGT TCATTGGACG TAATCCTCAA GATACCTTTA GGGATTTAGA TGTAACTTTT 1962 0 

GAAAAATCGG ATTTGATTTT GGTGGATAGA GAGGATAGTT TACGATTGTT GCAGGAGTTG 19680 

TATCCTGAAC GAATGCATCA ATGTTATCAT TTATCATCTT TTGACACCCG ATTACGATTG 19740 

GGACGAAGCC AAACTAAGAA AGAATCCATC ATTTATTTTC AACTGGATTT TGAGCAGGGG 19800 

ATTGATAATC AAGCTCTGCT TCAAGTCTTG TCCTTTGTCG CTGAAAATAA GGATACTGAG 19860 

GTGATTTTTG GAGCCTTTGC TGCTAGTCAG GAGCAAATGA ATGAGGTTGA AGGGATTGTT 19920 

GAGTCTTTCA TCCAAGAAAA CATTCAATCC GAAAATCTGG GAAAGGCGAT TGATTATGGT 19980 

GATGCAGAAA ATCCTCTGGA AGAAAATCAA CACCAGGACT TACGCTTACA GTTTGTTAAC 20040 

TTGAATGATG AGTTAGATTT GATAAAAACA CTAGAATTTG TCCGTTTGAT TGTGGATTTA 20100 

AATAGACATC CTCATCTCTA CACACAGATT GCTGGGATTA GTGCAGGAAT TCCTCAAATC 20160 

AACCTAGTTG AAACCGTCTA TGTTGAACAT TTAAAAAATG GTTATTTGTT AGCAGATGTT 20220 

ACAGAATTTT CTAAGGCTGC ACATTATTAC ACAGATAGGT TGAAGGAGTG GAATGAGTCC 20280 

TTGATATATT CAATTGATAA GATTAAGGAG CACACAGGAC AACAATTTCT TGGAAAATTA 20340 

GAGAAATGGA TAGAGGAGGT TAAAAATGTC AAAGGAACTT AATATTTTAC AGATAGGACT 20400 

TGCCAATTGG GAAAATCACT ATGACATACC TGAAAATATG AGTTGGTATT ATTTTTACCC 20460 

AAACTCATCA AAAGCCCTTC GTGAAATAAT TGAAAAAGAG GATATTAACC GTTTTCATGC 20520 

AGTTTTAATA GAAGATGGTC AGTATTCCAG AGACTTATTT TCCTATGTAA AATATTTTGA 20580 

ACCTTATACT TTATTTTATA ACCAGAATCT ACAAATAAAT GATAGAGAGG TTGTGGATTT 20640 

TCTAAAAAAA CGATGTGCAC AAGCAATTGA TTTTTTAAGT CCCCAACAAC TAATCAATGA 20700 

TTTAAGTAAA TCTCTTTTTG GCGGTGGGTA TGGTGATAAA CTCTTTCCTC CGACGATACA 20760 

AGTCAATCCA AATTTTACAG GAGCTATTTC TTATCAAGGA TTGGATTATG TCAGTTTGGA 20820 

AGGTGAGTTT GGGCAAGATT TTGCCCAGCT TGCCTATTGG GCTTATAATA TTATGGTGCA 20880 

AAAAACACTC CCTATTGAGT TGTGGCTTGA ATATGAGAAG GAAGGCAATT GTGACTTTCG 20940 

TTTAGTAATC CGTAAAATGT GGAGTGGGTC TGTTGATGAT TTCTTTGAAG AAGTAATAGT 21000 

ATCTGAAAAA GACTTGGAGC AAGCACTTTT TATGGATAGT CGAGACGGAG ACTACTTTCT 21060 

CTCGATATCT GTTGAAGCAA GAGGTCGTGG AACTATCAAA CTAGGTAATC TTCACCAACG 21120 

ATGGAGTCGA AAACAATTTG GTAAGTTTGT ACTTGGTGGA AATATCCTAC ATGATTCCAA 21180 

GCGTGATGAA ATAAACTATT TCTTCCATCC AGGTGATTTT AAACCGCCTT TGACTGTCTA 21240 
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TTTTGCAGGT TATCGACCTG CAGAAGGATT CGAGGGTTAC TTTATGATGA AAACTCTTGG 21300 

ATGTCCCTTC ATTTTATTTT CTGATCCACG TTTAGAGGGG GGAGCTTTTT ATCTCGGAAC 21360 

GGATGAGCTA GAGGGAAAAG TAAAGGATAC GATCACTCAC TATCTTGATT ATTTAGGCTT 21420 

TGATCATAAG GATTTGATTT TATCAGGTCT TTCTATGGGA ACGTTTCCGG CTCTCTATTA 21480 

TGGTGCTTCT TTTGAACCCC ATGCCATCAT AGTTGGTAAG CCCTTGGCTA ATTTAGGAAC 21540 

TATAGCTAGT CGTGGACGTT TGGACGCACC GGGTGTCTCT AACTTAGCTT TTGATTGTTT 21600 

AATTCATCAT ACAGGTGGGA CAAGTTCTCA AGATATGACG GAGTTGGATC AGGGTTTTTG 21660 

GAAAATTTTT AAACAAGCAA ATTTTTCAAA GACAACCTTT GGTTTATCCT ATATGAAAGA 21720 

TGAAGAAATG GATCCACAAG CCTATGAACA ATTAGTGTCT TATCTGTGTA ATACAGGTGC 21780 

GAAGATTTTA TCTAAAGGAA CTGCTGGACG ACACAATGAT GATACAGATA CCAATATTTC 21840 

TTGGTTTTTG CACTTTTATA GAATGGTCTT AGAGACTGGT TTTGGAAGGG AGAAAAGATG 21900 

ATTATTACAC AGAGACAGTC TATTCATTGG GGAGAAGTTG GTGGGACTTA TATGTATGGA 21960 

ACAACTGTAT CTTATTACCC TGACAAAAGT GTTCGTCTGT ATAATCCTCT ATTGCCATCT 22020 

GGTGAGATTC TAAAGACTTG GTTTTCTAGT GTCAATTACC AGGCTGCACG AACCCAACCT 22080 

CAGCTTCCCT TATTAAAAAG AAAGCAGGAG TATCAACTAT CACTGGTTTT TGACTGTCAG 22140 

CCTGAAAATG GAGTTTATAC CAAGATAACT TTTTTTGACC GCTATGGTGA TATTTTAGAA 22200 

AAAAAGGTAG AAAAAGTGAA AGATTTCATA TTTACTTATC CAGAAGATAG TTATACTTAT 22260 

CGAGTTTCTC TTTTAAGTGC TGGATTTGAG TCCTTAACTT TTTATCATTT TTCTATCAAG 22320 

GAGATCAGAA GTGTTTAGAC GTTTAGGTCA AGATTTCCAG CTTAGGAAAG TGAAAAAGAT 22380 

TTTAAAGCAG ATTAATGCCC TGAAAGGCAA GATGTCCTCT CTTTCGGATC AAGAATTAGT 22440 

AGCTAAAACA GTAGAGTTTC GTCAGCGTCT TTCCGAGGGA GAAAGTCTAG ACGATATTTT 22500 

GGTTGAAGCT TTTGCTGTGG TGCGTGAAGC AGATAAGCGG ATTTTAGGGA TGTTTCCTTA 22560 

TGATGTTCAA GTCATGGGAG CTATTGTCAT GCACTATGGA AATGTTGCTG AGATGAATAC 22620 

GGGGGAAGGT AAGACCTTGA CAGCTACCAT GCCTGTCTAT TTGAACGCTT TTTCAGGAGA 22680 

AGGAGTGATG GTTGTGACTC CTAATGAGTA TTTATCAAAG CGTGATGCCG AGGAAATGGG 22740 

TCAAGTTTAT CGTTTTCTAG GATTGACCAT TGGTGTACCA TTTACGGAAG ATCCAAAGAA 22800 

GGAGATGAAA GCTGAAGAAA AGAAG CTTAT CTATGCTTCG GATATCATCT ACACAACCAA 22860 

TAGTAATTTA GGTTTTGATT ATCTAAATGA TAACCTAGCC TCGAATGAAG AAGGTAAGTT 22920 

TTTACGACCG TTTAACTATG TGATTATTGA TGAAATTGAT GATATCTTGC TTGATAGTGC 22980 
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TATCGTGCTG GTGTTTTTCG GAAATTAAAT CTGTCCTCTA AGTTTATCTT TACAGATATG 24840 

ATTTTAGCCG ATAATATTCA GCACTTAACA GCCAATATTG GTTTTGATGA TAATCAGGTT 24900 

ATCTGGCTTT ATAATCATTT CACAGATATC AAAATTGCAC CTACTAGCGT GACAGTGGAT 24960 

GATGTCTTGG CTTACTTTGG TGGTGAAGAA AGTCACAGAG AAAAAAATGG CAAGGTTTTA 25020 

CGTGTATTCT TTTTTGACCA AGATAAGTTT GTAACCTGTT ATTTGGTTGA TGAGAACAAG 25080 

GACTTGGTTC AACATGCCGA GTATGTTTTT AAGGGAAACC TGATTCGGAA GGATTACTTT 25140 

TCTTATACGC GTTATTGTAG CGAGTATTTT GCTCCCAAGG ACAATGTTGC AGTCTTATAC 25200 

CAACGAACTT TTTATAATGA AGACGGGACT CCAGTCTATG ATATCTTGAT GAATCAAGGG 25260 

AAGGAAGAAG TTTATCATTT CAAGGATAAG ATTTTCTATG GAAAGCAAGC TTTTGTGCGT 25320 

GCCTTTATGA AATCTTTGAA TTTGAATAAG TCTGATTTGG TCATTCTCGA TAGGGAGACA 25380 

GGTATTGGAC AGGTTGTGTT TGAGGAAGCA CAGACAGCAC ATCTAGCGGT AGTTGTTCAT 25440 

GCGGAGCATT ATAGTGAAAA TGCTACAAAT GAGGACTATA TCCTTTGGAA TAACTATTAT 25500 

GACTATCAGT TTACCAATGC AGATAAGGTT GACTTCTTTA TCGTGTCTAC TGATAGACAA 25560 

AATGAAGTTC TACAAGAGCA ATTTGCCAAA TATACTCAGC ATCAGCCAAA GATTGTTACC 25620 

ATTCCTGTAG GCAGTATTGA TTCCTTGACA GATTCAAGTC AAGGGCGCAA ACCATTTTCA 25680 

TTGATTACGG CTTCACGTCT TGCCAAAGAA AAGCACATTG ATTGGCTTGT GAAAGCTGTG 25740 

ATTGAAGCTC ATAAGGAGTT ACCGGAACTA ACCTTTGATA TCTATGGTAG TGGTGGAGAA 25800 

GATTCTCTGC TTAGAGAAAT TATTGCAAAT CATCAGGCAG AGGACTATAT CCAACTCAAG 25860 

GGGCATGCGG AACTTTCGCA GATTTATAGC CAGTATGAGG TCTACTTAAC GGCTTCTACC 25920 

AGCGAAGGAT TTGGTCTGAC CTTGATGGAA GCTATTGGTT CAGGTCTACC TCTAATTGGT 25980 

TTTGATGTGC CTTATGGTAA TCAGACCTTT ATAGAGGATG GGCAAAATGG TTATTTGATT 26040 

CCAAGTTCAT CTGACCATGT AGAAGACCAA ATCAAGCAAG CTTATGCCGC TAAGATTTGT 26100 

CAATTGTATC AAGAAAATCG TTTGGAAGCT ATGCGTGCCT ATTCTTACCA AATTGCAGAA 26160 

GGCTTCTTGA CCAAAGAAAT TTTAGAAAAG TGGAAGAAAA CAGTAGAGGA GGTGCTCCAT 26220 

GATTGAACTT TATGATAGTT ACAGTCAAGA AAGTCGAGAT TTACATGAAA GTCTAGGCGC 26280 

TACTGGTCTT TCTCAACTTG GAGTGGTCAT CGATGCAGAT GGTTTTCTGC CTGATGGTCT 26340 

GCTTTCTCCT TTTACCTATT ATCTAGGTTA CGAGGATGGA AAACCTCTCT ATTTTAATCA 26400 

AGTTCCCGTT TCAGATTTTT GGGAAATTTT AGGAGATAAT CAGTCTGCTT GTATTGAAGA 26460 

TGTGACGCAG GAGAGGGCTG TCATTCATTA TGCTGATGGA ATGCAGGCTC GCTTGGTTAA 26520 
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ACAGGTAGAC TGGAAAGACC TAGAAGGTCG AGTACGTCAG GTTGACCACT ACAATCGCTT 26580 

CGGAGCTTGT TTTGCTACAA CGACTTATAG CGCAGATAGC GAGCCGATTA TGACAGTTTA 26640 

CCAAGATGTC AATGGTCAAC AAGTTTTACT GGAAAACCAT GTGACGGGTG ATATCTTATT 26700 

GACTTTGCCA GGTCAGTCCA TGCGTTACTT TGCAAATAAA GTTGAATTTA TCACCTTCTT 26760 

TTTGCAAGAT TTGGAAATAG ATACCAGTCA GCTTATCTTT AATACTCTAG CGACTCCTTT 26820 

CTTGGTTTCC TTCCATCATC CAGATAAATC TGGCTCGGAT GTCTTGGTAT GGCAGGAACC 26880 

TCTCTATGAT GCCATTCCAG GTAATATGCA GTTGATTTTG GAAAGTGATA ATGTGCGTAC 26940 

TAAGAAGATC ATCATTCCAA ATAAGGCGAC TTATGAGCGC GCTTTAGAGT TAACTGACGA 27000 

GAAATACCAT GATCAGTTTG TGCACTTGGG TTATCATTAC CAGTTCAAAC GTGATAATTT 27060 

CCTAAGACGA GATGCCTTAA TCTTGACCAA TTCAGATCAG ATTGAGCAAG TAGAAGCAAT 27120 

CGCAGGAGCC TTGCCTGATG TCACTTTCCG TATTGCAGCG GTGACAGAGA TGTCTTCTAA 27180 

GCTCTTAGAC ATGCTTTGCT ATCCTAATGT GGCCCTTTAC CAGAACGCTA GTCCACAGAA 27240 

GATTCAGGAG CTGTATCAAC TGTCGGATAT TTACTTGGAT ATAAACCACA GTAATGAGTT 27300 

GCTACAGGCA GTGCGTCAGG CCTTTGAGCA CAATCTCTTG ATTCTTGGCT TTAATCAGAC 27360 

GGTGCACAAT AGACTTTATA TCGCTCCAGA CCATCTATTT GAAAGTAGTG AAGTTGCTGC 27420 

TTTGGTTGAG ACCATTAAAT TGGCCCTTTC AGATGTTGAT CAAATGCGTC AGGCACTTGG 27480 

CAAACAAGGC CAACATGCAA ATTATGTTGA CTTGGTGAGA TATCAGGAAA CCATGCAAAC 27540 

TGTTTTAGGA GGCTAACATG TCAGAGGAAG ATTTATTTTA CAAAGACGTT GAAGGCCGCA 27600 

TGGAAGAGTT GAAACAAAAA CCCATCAAGA AGGAAAAAGA AACCCGAGGG GAAAAGATTA 27660 

GTAAGACTTT TTCACTTTTA CTGGGTTTGA TGATTCTGAT TGGTTTGCTC TTTACTTTGC 27720 

TGGGAATTTT GAGGTAGATC TATGATTGAA ATACTAATTG TTTTAGCTAT TATCCTATCT 27780 

CTTGCTTTGA TTGTATTGGT AACTATACAA CCCCGTCAAA ATCAACTATT TTCCATGGAT 27840 

GCCACTAGTA ATATTGGTAA ACCAAGCTAC TGGCAGAGCA ACACCTTGGT CAAGGTGCTC 27900 

ACTTTATTGG TGAGTTTGGC TTTATTTATT CTACTATTAA CCTTTATGGT GATTACTTAT 27960 

AAATAAAAGA AAACTTCAGA TATTCACCTT TTGTGGATTG GTCTGAAGTT TTCTTTTTTA 28020 

TACTCAATGA AAATCAAAGA GCAAACTAGG AAGCTAGGCG CAGGCTgCTC AAAACACCGT 28080 

TTTGAGGTTG TAGATATAAC TGACGAAGTC AGCTCAAAAC ACCGTTTTGA GGTTGTAGAT 28140 

ATAACTGACG AAGTCAGCTC AAAACACCGT TTTGAGGTTG TGGATAGAAC TGACGAAGTC 28200 

AGCTCAAAAC ACCGTTTTGA GGTTGTGGAT AGAACTGACG AAGTCAGCTC AAAACACCGT 28260 

TTTGAGGTTG TGGATAGAAC TGACGAAGTC AGCTCAAAAC ACCGTTTTGA GGTTGTGGAT 28320 
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AGAACTGACG AAGctCAGTA ACATATATAC AGCAAGGCGA CGCTGACGTG GTTTGAAGAG 28380 

TATTACTGTC TATATTTTTG GTAAAAATCA ACTTTTACTT GGATGAAGGT TTTGGCTTCA 28440 

CGTAGGAGTT GAAGAAGGGT GGCGCGGGTT TCAAATTCTT CTCTTGTCTT GGGCAGACTG 28500 

CGGTTCCGGA AGACTTCCAG ATAACGTTCA ATTTCATCTA GCAAATCAGA AGCAGGATTG 28560 

GTCTGGCTCA GTTGACCTGC AATTTTTGAA AAGAGTTGCG CTAAGATCAG GCTTTCACTG 28620 

GCGGCAAGGT GACAAGTGTT AATCTGTTGG GCCATGTTTC TCAGGATACG ACTTTGTCGC 28680 

TGTCTCATCT CAAAGTAGTG GATATGGTAG TCTGTCTGGT GAAAGAGGTG GTCAGAGTGA 28740 

TCCAAATAGA CCAGTCTGAG GGCTTCTTTC AAAAGCGTGT CTAATTCTGC TACCAGCTGT 28800 

GCTCGGTTGC GTCCGTCTCC TCTGGATAAA TAGTATTTGA AGCGCTGGAG GATATCTTTT 28860 

AACTTTTCTT CCACCAGCGT GTGGTAGTGC TGGATTTCCT CTTCTCGTGA AGGCATATAG 28920 

AGATTAACAA GCAAGGCAAA TCCTGTACCA ATAGCAAAGA GAAGGAATTC ATTGACTAGA 28980 

AGGTCTGGAG AGGTTGACTC TTGAACCAAG AGATGGCTAA CCAAAACAGT GCTTGGTGTG 29040 

ATG CCAATTT CCCAGCCCAT CTTGTAGGCT AAAGGAACGT AGAAGGCCAG ATAGAGGCCG 29100 

AGACTCCAGA TATGAAATCC GCTCAAGTGA AAAGCTAGAA CACCGATAGC CAGAGCTAGA 29160 

AGCATAGAAA AAAGACGATT GCGAGCCAGT TTTAAAGTAC TTCTACGCGT ATCAGATAGG 29220 

CTCAAGAGAG CGATAATTCC AGCCGAAACT GCTGACGAAA GATTGAGAAA ATAAGCAAGC 29280 

AGGCAGGCAA GACAGGTAGC TAAGATGAGC TTGGTCGTAC GTTGGCTAAT AGACATAAGA 29340 

ATTTCCTAAT AAGTTAGAAT AAAAGCGTAA AAGACAAGAC ATGAGCAGGC TTGCCTTGAT 29400 

GAGTTATTTT TTACGGGTTG CTGCGTATTC GGCAACGGCG GTAAAGAGGA CATCTGTAGA 29460 

AGAGTTAAGG GCTGTTTCAC ATGAGTCTTG GATGACACCA ATCACAAAAC CAACCCCAAC 29520 

AATTTGTATG GCAATATCGT TAGAAATACC GAAAAGGCTA CAAGCAACTG GGATAAGAAG 29580 

GAGGGAACCT CCGGCAATAC CTGAAGCATC ACAGGATGAG ATAGCTGCTA CCACACTGAG 29640 

GACAAAGGCT GTGGCAAAGT CAACAGGAAT TCCAAGAGTG TTAACTGCAG CAAGGGTCAA 29700 

AAGGTTAATG GTAATCGCTA CTCCAGCCAT ATTGATAGTA GAACCGAGTG GGATAGAAAC 29760 

AGAATAGGTA TCTGGGTTGA GTCCAAGGTC ATGGCAGAGT TTCATGTTGA CAGGAATGTT 29820 

AGTCGCAGAA CTACGAGTGA AAAAGGCTGT CACACCGCTG ACACGGAGGC AGTTCCAAAC 29880 

TAGAGGGTAA GGATTGCGTC TCATAAAGAA GAAGGCAATC AAAGGGTTGA CCACAGGGGC 29940 
AACAAAAAGC ATAGTCGTTA CTAATAGAAC CAATAAAATA CCGTAGTTGG CAAGGCTTCC - 30000 

GACTCCCTTG TCAGAAATGG TTTTAAAAAC AAGACCAAGG ATTCCAAATG GAGCCAGATT 30060 
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GATGATCCAT TCGACAATTT TAGAAGTCAC GTCAGCGATA GTTTTTAGCA ATTCTTGACT 30120 

ATTTTTACTG GCTTCTCTCA TAGCGATTCC AAAAATGACT GCCCAAGATA AGATTCTAAT 30180 

ATAGTTAGCA GTAAGCAGGG CGTTGACTGG GTTGTCAACC AGTTTGAGCA AGAGGTTGCT 30240 

GAGAACCTGC CCAATCCCAT CTGGTGGTGC AATTTCAGTA TTGGCACTAT TTGGGGTAAT 30300 

TTCAATAGGG ACGATGAAAT TTGCTAGTAC AGCTACAAGA GCAGCGGCGA AAGTCCCTAT 30360 

CATAGGATAT ACAAGAAAAC AACAGTTTTC ATATTGCTAT CTTGTCCCTT TTGATGTTGG 3 0420 

GAAAGGGCAT TGGCAACGAG AGCAAAGACT AGGATAGGAG CAACAGCTTT TAGACCTCCA 30480 

ACGAATAAAT CCTCGAGTAG CCCAATCCCT GAGAGATTAG GAAGGGTCAG TCCTAGGATT 30540 

CCCCACAAAG CATACCAATC AAGATACGCT TGACAAGGCT TGCCTTATTC CAAGCATGAA 30600 

TGATTCTTTT CATAATAATC TCCTTTTTGT GTAGTGATTA TGATTATAGT ATAAATGATA 30660 

GACAAAATCA AGAATTTTCT GTCTATTTTT TGAATATTTA TGGAGAATGA GACTGATGAA 30720 

AATATGGTAT AATGAAATAA AGGAGTTTTA TATGCAAAAA TTTATTCAGG CTTATATTGA 30780 

AAAGCTAGAT GTGACAACCA TTATCGAGAA TATTCTAACC AAGGTCATTT CTCTTTTACT 30840 

GCTTTTAATT GTATTTTATA TTGCTAAAAA AATGCTTCAT ACCATGGTGC AGAGAATTGT 30900 

CAAACCTTCT CTAAAAATGT CTCGTCATGA TGTTGGACGC CAAAAAACCA TCTCACGTTT 30960 

ACTAGAAAAT GTGTTTAATT ATACGCTATA TTTCTTTTTA CTCTACTGCA TTTTGTCGAT 31020 

TTTAGGTTTG CCAGTTTCTA GTTTGCTGGC TGGAGCTGGT ATTGCTGGGG TAGCGATTGG 31080 

TATGGGAGCC CAAGGCTTTC TGTCTGATGT CATCAATGGC TTTTTCATCC TCTTTGAACG 31140 

TCAACTGGAT GTGGGAGATG AGGTCGTTCT GACAAATGGA CCGATTACTG TATCGGGTAA 31200 

GGTTGTCAGT GTGGGAATTC GTACGACACA GCTTCGTAGC GAGGAGCAAG CCCTTCACTT 31260 

TGTCCCTAAC CGAAATATCA CAGTTGTTAG CAATTTCTCA CGCACAGACT AGACCTGTTA 31320 

TTTTAAGTAA TTTGTGGTAC AATAGAGGGA GTTTAATAAG GAGAAAAGAT GGTTTTAGAA 31380 

AAGCAGTTGG GCAATGGTTG TACCTGGATA GACCTAGACC TAGGAAAGTT GAATAAACTA 31440 

GAAGACCTTT CTGAAATTTA CGGTTTGGAC AAGGAAACCA TTGAATACGC ACTGGATAGA 31500 

AACGAGCGCG CCCACATGGA CTACCACCGT GAAAGTGAGA CGGTTACCTT TATCTATAAT 31560 

GTCTTAGACG TAAAAAAGGA CAAGGCCTAC TATGAGACTT TTCCCATGAC CTTTATTGTC 31620 

GAGCATCGTC GCCTGATTAC CATTAGTAAT ACCAAGAACG CCTATGTCAT TGAACAGATG 31680 

ACTCGTTATC TGGAGAACCA TGACACGCTT TCGATTTATA AGTTTCTCTT TGCCAGTCTG 31740 

GAAATCATCA GCAATGCCTA CTATCCTGTC ATTGAGCAGA TGGACAAGAG TAGGGATGAG 31800 

GTCAATGACC TCTTGCGCCA GCGAACTACC AAGAAAAACC TCTTTGTCCT GTCTGATTTG 31860 
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GAGACTGGTA TGGTTTATCT GACGGCAGCT GCCAAACAAA ATCGGATTTT GTTAGAGCAT 31920 

ATTCAAGGTC ATGCCTTGTA TCGTAGTTTT GATGAGATTG AGAGAGAACA <3TrPGATGAT 31980 

GCCATGATTG AGGCTCATCA GCTGGTATCC ATGACAGACC TAATCTCTCA GATTTTACAG 32040 

CAGCTTTCAG CCTCTTACAA CAATATTCTA AACAATAATC TGAATGACAA TTTGACAACC 32100 

TTGACTATCA TTTCAGTCTT GCTAGCTGTT TTGGCAGTCG TGACAGGCTT TTTCGGAATG 32160 

AATGTTCCCT TACCTTTAAC AGATGAGCCC CATGCTTGGC TCTATATCAG TTTGGCTAGT 32220 

GCAGGTTTGT GGATTGTTTT ATCCTTGTTA CTAAGGAAAA TTGCGAAAAA AAGTTAAGAA 32280 

AAGGAGCCAG AATGGCGATT GAAAATTATA TACCAGATTT TGCTGTGGAA GCAGTCTATG 32340 

ATCTGACAGT CCCAAGCCTG CAGGCGCAGG GAATAAAGGC TGTTTTGGTC GATTTGGATA 32400 

ATACCCTCAT TGCTTGGAAC AACCCTGATG GAACGCCAGA GATGAAGCAA TGGCTACATG 32460 

ACCTTCGGGA CGCGGGTATT GGCATTATCG TAGTGTCAAA TAACACCAAA AAACGCGTTC 32520 

AACGAGCAGT TGAGAAATTT GGGATTGATT ACGTTTACTG GGCCTTGAAG CCCTTCACAT 32580 

TTGGTATTGA CCGTGCTATG AAGGAATTCC ACTATGACAA AAAGGAAGTG GTCATGGTTG 32640 

GTGACCAACT CATGACAGAT ATACGAGCAG CCCACCGTGC AGGGATTCGG TCAATTTTAG 32700 

TCAAACCCTT GGTCCAACAT GACTCAATCA AAACGCAGAT TAACCGAACT CGTGAGCGTC 32760 

GTGTTATG 32768 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 14872 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

CCAGTCACAA AGAAATTGAG CGCGTTCAGc TGAGGATGCA CTATGATGCA AGCTACATTT 60 

CATTTGATGG GATATTAAGA AAGGAGATTT TCATGACACT TTTAGATGTA AAACACGTTC 120 

AAAAAATTTA TAAAACACGT TTTCAGGGCA ACCAAGTAGA AGCCCTCAAG GATATTCACT 180 

TTACCGTAGA AAAGGGTGAC TACGTTGCCA TCATGGGTGA GTCTGGTTCT GGTAAATCAA 240 

CTCTTCTCAA TATTCTAGCT ATGTTGGATA AACCAAGTCG TGGTCAGGTT TACTTGAATG 300 

GAACTGACAC CGCAACTATT AAAAATTCAC AGGCTTCTAG TTTCCGGCGT GAAAAGCTAG 360 

GATTTGTCTT CCAAGACTTT AACTTGCTAG ATACTCTGTC TGTTAAGGAC AATATCTTGC 420 
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TTCCGCTTGT CTTGTCAAGA AGACCTATAA 
CTGAGAATCT GGGTATTAAC CAATTGCAAG 
AGAAACAGCG TGTAGCAGTA GCCCGCGCCA 
ACGAGCCAAC AGGAGCCCTT GATTCCAAGT 
AAATCAATGA GCGTGGGCAA ACCATCCTCA 
GGGCCAAGCG TGTTCTCTTT ATCAAAGACG 
AGAAGACAGA GCGTCAGATG TTCCAAGAAA 
AGGTGAATTA GTATGTTTCG ATTAACCAAT 
CGCAAACTCT ACTATCCCTT TGCACTGGCT 
TTTTACTCCC TAACCTTCAA TCCAAAGATT 
GCAACACTTG GATTTGGTAT GTTTGTCGTT 
CCAATAGTTT TGTCATGAAA AACCGTTCCA 
TGGAGAAGCG CCATCTAATC AGTATGACCT 
CTGTTGGAGC GGGTATCGGT ATTGGAGCCT 
TCAAACTAAT GAAACTGAAG GTTGAGCTGG 
CAGTACTTGT TGTCTTTGGA TTGATTTTCC 
TCGCCCGTAT GAATGCCCTC CAGCTCTCGC 
GCTTCCTACC TCTCCAAACG ATTCTTGGTT 
CCCTTACGGT AACCGATCCT CTTACAGCCC 
TTATCTTTGG TACTTATCTA TTGTTTAATG 
AGAAAAACAA GAAATACTAT TACCAACCTA 
TCCGTATGAA GAAAAATGCG GTTGGACTAG 
TGGTAACCAT GTCAGCAGCG ACAAGCATTT 
TAAATCCTCA TGATTTTGGG GTTTCAGGGC 
TCTTGAGCCA GTTTGCAAGT GACAAAGGTT 
ACAGTAACTT TGGTATTGCA AATCAAGAAG 
AAAACCGTGT CCAACCCACA ACAGTTTTCA 
TGACTGGTCA AAAACTGTCT CTATCAGGAA 
GACTGAAAGG ACAGAAAGCT CTAACTCTAA 
TTAATAAAGA TTTCATTGTG AACCATGTTC 



594 

CGGAGATGAT GAAGAAATTG 
AGAAGTACCC TTACGAGATT 
TCATCACAGA ACCTGAAATT 
CATCTGCAGC CTTACTTGAT 
TGGTAACCCA CTCAACAGCA 
GCATTCTTTA CAACCAAATC 
TCTCTGATAC CTTGACTGTC 
AAGTTAGCGG TATCGAACTT 
GTTCTCTTGG CAGTCACCAT 
GCGGAAATCC GTGGAGGAAC 
ACCCTTGCGT CAcCATTATC 
AGGAACTGGG TATATATGGC 
TTAAGGAGTT AGTGGTATTT 
TGTTTGACAA GTTAATTTTC 
TTGCTACCTT CCAAATGAAT 
TAGGCCTCAT GTTCCTGAAT 
GTGAGAAAGC AAGCGGAGAG 
CCATAAGTTT AGGGATTGGC 
TAACAACTTT CTTCCTAGCT 
CAGGGATTAC AGTCTTCCTA 
ATAACCTCAT ATCTGTTTCC 
CAACCATCGC TATTTTGTCA 
TCAATTCCGC AGAAAGCTTT 
AAAATGTTGA AAAAGAAGAT 
ATAGTGTCAA AGAGAAAGAA 
GAACCAAGTT AACTATTTTT 
TGGTATTTGA CCAAAAAGAT 
ATGAGGTCGG TCTCTTTGCC 
ATGATCATCA ATTTTCTGTC 
CAAATAAGTT TAATATCTTG 



GTGGTGACAG 480 

TCTGGTGGTC 540 

CTCCTTGCGG 600 

GTCTTTAATG 660 

GCTGCTAGCA 720 

TACCGTGGAG 780 

ATGGCAAGCG 840 

GATTAAAAAC 900 

CACCTATCTC 960 

CACCATTCAA 1020 

GTCCTCTATG 1080 

ATGTTAGGCT 1140 

GGGATTCTAA 1200 

GCTTTCCTGC 1260 

GTTGTCATTG 1320 

GCTCTTCGAA 1380 

AAAAGAGGTC 1440 

TATTATCTTG 1500 

GTTTTGCTGG 1560 

CAAATCTTAA 1620 

AACTTGATTT 1680 

ACAATGGTTT 1740 

AAAAAAGTTC 1800 

TTGGACAAAC 1860 

GTACTTCGTT 1920 

GAAAAAGGAC 1980 

TATGAAAATA 2040 

AAAAATGACG 2100 

AAAGAAGAAT 2160 

ACTACTGATT 2220 
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ACAATTACCT TGTTGTTCCT GATTTACAAG CCTTTTTGGA TCAATTCCCA GATTCGGCTA 2280 

TCTATAATCA GTTTTACGGT GGTATGAATG TAAATGTCAG TGAAGAAGAA CAACTCAAGG 2340 

TCGCTGAGGA GTATGAAAAC TACCTCAATC AATTTAATGC TCAATTAGAC ACAGAAGGTA 2400 

GCTATGTTTA TGGTAGCAAT CTAGCAGATG CTAGTTCTCA GATGAGTGCC CTCTTTGGTG 2460 

GTGTCTTCTT TATCGGTATT TTCCTATCCA TTATCTTTAT GGTCGGAACT GTTCTGGTCA 2520 

TCTACTACAA ACAAATTTCT GAAGGCTACG AAGACCGTGA ACGCTTTATT ATCTTGCAGA 2580 

AAGTCGGTTT GGACCAAAAG CAAATCAAGC AAACCATCAA CAAACAGGTT TTAACTGTTT 2640 

TCTTCCTTCC TTTGCTCTTT GCCTTCATAC ATCTCGCCTT TGCCTACCAT ATGCTTAGCC 2700 

TGATTTTAAA AGTGATTGGT GTACTGGATA CGACTATGAT GTTGATTGTG ACCTTGTCTA 2760 

TCTGCGCTAT CTTCCTCATC GCCTATGTGC TGATTTTCAT GATTACTTCA AGAAGTTATC 2820 

GCAAGATTGT GCAAATGTAA AAAAGATACC TCGACTTCAA AATCGAGGTA TTTCTTGTAT 2880 

TCTAAATGCT GAAAAGTTGT CCGAGCAGGA AGGTAACTCC CATGGTCAAG AGACCAATAG 2940 

CAAGGTTCCG AATCATAGCT GTTTTGGTTG GGGCTTTTCC AAGTCTAGCA CTTGTGTAAC 3000 

CAGTGAGAAG AAGGGCCACA CCGACAATAA GGACGGTAGC AGGGATGCGG TAATCACTTG 3060 

GAAAAATGGT CACTGACAGC ATTGGAGGCA AACTTCTAAG GAAAAAGGCA ACGAAGCTAG 3120 

AAATGGCAGC GTGCCAAGGA TTGGTAAATT CTTCATACTC AATCCCATAT TTTTCCTCTA 3180 

CCAGAGCCTT GAGTGGATTT TTAAGAAAGA TCTTATTGGT CAAGAGTTGG GCAGAAGTTT 3240 

TGAATTCTCC ATTTTGGATA TAAGCAGCAT AGAGGGATTT TTTGGCTAGT TCCCTATCTT 3300 

GGTCTAGCAA GAGTTTTTCT CGCGAAACGG CAGCTTCCTC GGTATCTTTT GGAGTTGAAA 3360 

CGGATACATA TTCTCCACCA GCCATTGAAA AGGCACCAGC TAAGATAGCC GTAAAACCTG 3420 

ATAAAAAGAT AATCCAGATA TTGGTCGTGG CACTGGCAAC TCCGATAACC ACACCAGCAA 3480 

TGGAAATAAT TCCATCGTTA GCATCAAGAA CACCCGCACG CAGGATATTT AAACGACCTG 3540 

CAAAATTTGA ATCAATTTCG TGATTTGTTT CTGACGCTAA ATTTCAAGTT CAAGTTAGCC 3600 

ATCAAGAAGT CTTCTCTGGG TGACTTGTAG TCCAAGCATT TTTTAGGATA GTTGTTAATC 3660 

CACTTTTCGA TGAATGCGAC TTCTTTGGGA GTCATTTTCT TGGTTCCCTT AGGTAACCAT 3720 

CTACGAATGA GCCTGTTGTG ATTCTCATTA GTTCCCCTTT CCCAAGAGGC ATAGGGATGT 3780 

GCATAATAAA TGTGCTCCTC AGAAAATACA TTAGACAAGC GATTGAATTC CGTTCCATTA 3840 

TCTGCCGTGA TGGAAAGAAT CTTGTGTTGT TTTAAGATGA GTTTTAGAGC CTGATTGACC 3900 

ACATCAGCAC TTTTATTTGG AATCAATCGG ATGATCTGAT GTCTACTTTT TCGATCCGTC 3960 
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AAGACAAGCA AGCAGTAGTT TTTCGCTCTC GTAAGTAGAA CTGTATCAAT CTCATAATGC 4020 

CCATTCTCCA AGCGAAAATT GATAGCTTCA AGCCGCTGTT CGATGGATTG ACCAGCAGGT 4080 

TTAAAGTTGG TGCTGGCCTG TTTCTTAAGC GCTTTTCCTT TTCTAGGGTA AAGCAGATCC 4140 

TGTTTGCTTA ACCCCAATTT TCCATGATGA ATCCAATAGT AAATGGTTGA AATTCCCACG 4200 

TTAACCCCTT TAGCCATCAC CATCATTTCA GGCGAAAATT TTTGGTTATG ATAGTGGAGA 4260 

ATCTTTTCCT TTAGTTCCTT GGTCAAGCTT GATTTCTTGA CCGAGCGCTT GCGATTGTTT 4320 

TCATAAGACT GTTGAGCATA GTCGGCAGAA TAAACCTCTT TGAAGCGCCC TTTTCCAAGA 4380 

CATTGTCGGA CTGTCCCACG CTTGATTTCA GTGTGGATAG TTTGAGGAAC TTTTCCAAGC 4440 

AGAGAGGCAA TTTCTCTATT TGATTTCCCT TCTTTTTTCC ATCTTTCGAT TAAGCGACGG 4500 

CTATCGATTG TCAAATGTTC GCCTTTTGTA GTATAATGGT TTTGCATCTC TGTGCCTTTC 4560 

TTGTGTTTGT GGTTGAACAA CAAGTATAAC ACAGAGGTGT TTTCTTATGC CTACAAGAGC 4620 

TATCGGCTAG TTGAACCATC TAATTTTTAG GAGGGCTGGG TGGCTAACTT CATTATAGAA 4680 

CTTTCATTTA CGAACATATA GTAAAATGAA ACAAGAACAG AACAAATCGA TCAGGACAGT 4740 

AAAATCTATT TCTAACAATG TTTTAGAAGC AGAGGTGTAC TATTCTAGTT TCAATCTATT 4800 

ATATTTTTGT TTTTTATCAA AAAATACTTT ACAAGTTCTT AAAAACATGA TATAGTAATA 4860 

AAGCTTAGAA AATGAGATGA TGTTTTCTAG CAAATATAAA CCCGAGTAAA AAATGCCTAC 4920 

GGACAGGCAG GGTTGAATGC CGAAGCGTGG TTGAAAAGCC ACATTATTGA TAGGGTTAAA 4980 

AGCCTACTTT TATAAGTTGA TGTTAGGACA CTTGTCCTAA TTCATAAATT TTTAGTGTGG 5040 

TGAAAGCACA CGTCATCTTG TGAAACGATC AATAAAGTAC GTAATATTTG CTACTAGAGA 5100 

GTTAGGAAAC ATCGGGAACA GACATACTCA ACAGAAACCA AAATAAACAC GTCAGAAGAT 5160 

TGCAGAGCAG GTGAAAACCT GCTCTTTTTT CATGAGTCAA CCTTTAGTTC CTTAGTTTTC 5220 

ATAAGGTCCT AAAAATATTG AAAGGAGTAT GTTTTGAAAG AGTTAGATCA AAACCAAGCC 5280 

CCAATTTATG AGGCCTTGGT GAAGTTACGC AAGAAAAGGA TTGTTCCCTT TGATGTTCCA 5340 

GGTCACAAGC GTGGACGGGG AAATCCAGAA CTTGTCGAAC TCTTAGGAGA AAAATGTGTA 5400 

GGCATTGATG TCAATTCGAT GAAACCTTTG GATAATTTAG GCCATCCTAT TTCGATTATT 5460 

CGTGATGCAG AGGAGCTGGC TGCAGATGCT TTTGGAGCTA GCCATGCCTT TCTAATGATT 5520 

GGTGGAACAA CTTCATCGGT GCAGACTATG ATTCTGGCAA CCTGCAAGGC AGGAGATAAG 5580 

ATTATTCTGC CACGAAATGT CCATAAATCT GCTATCAATG CGTTGGTTCT ATGTGGTGCC 5640 

ATTCCCATCT ATATCGAGAT GAGTGTAGAT CCTAAGATTG GTATCGCTTT AGGTCTTGAA 5700 

AATGACCGAG TAGCACAGGC CATAAAGGAC CATCCAGATG CTAAGGCTAT CCTAATCAAC 5760 
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AATCCTACTT ACTACGGCAT CTGTTCAGAC CTAAAGGGGT TGACAGAAAT GGCTCATGAA 5820 

GCTGGCATGA TGGTTTTAGT AGATGAAGCC CACGGAGCGC ATTTGCATTT CACTGATAAA 5880 

CTTCCAATTT CTGCTATGGA TGCAGGGGCT GATATGGCAG CAGTTTCCAT GCATAAGTCT 5940 

GGTGGGAGTT TGACCCAAAG CTCCATTTTA CTTATCGGGG AGCAGATGAA TTCTGAATAC 6000 

GTTCGTCAGA TAATTAACCT GACCCAGTCT ACATCTGCCT CTTACTTGTT GATGGCTAGT 6060 

TTGGATATTT CACGTCGCAA CTTGGCCCTT CGTGGTAAAG AGTCGTTTGA GAAAGTCATT 6120 

GAGCTATCTG AGTATGCCCG CCGTGAAATC AATGCTATCG GTGGCTACTA TGCCTACTCA 6180 

AAAGAGTTAA TAGACGGTGT TTCGGTTTGC GATTTTGACG TAACTAAGCT GTCAGTTTAC 6240 

ACTCAGGGTA TTGGCTTAAC AGGTATCGAG GTTTATGACC TCTTGCGAGA CGAATACGAC 6300 

ATTCAGATCG AGTTTGGTGA TATCGGCAAT ATCTTGGCCT ATATTTCCAT CGGCGACCGC 6360 

ATCCAAGACA TCGAGCGCTT GGTTGGTGCT CTGGCTGATA TTAAGAGACT CTATTCAAGA 6420 

GATGGAAAAG ATTTGATAGC AGGAGAATAT ATTCAGCCCG AGTTAGTGCT GTCTCCGCAA 6480 

GAAGCCTTCT ATTCAGAAAG AAAAAGTTTA ACTTTGGATG ATTCTGTTGG ACAGGTCTGT 6540 

GGAGAATTTG TTATGTGTTA CCCTCCAGGT ATTCCTATCT TGGCTCCTGG TGAACGCATT 6600 

ACACGAGAAA TTGTCGACTA TATCCAATTC GCCAAGGAAC GTGGTTGCTC CCTCCAAGGG 6660 

ACGGAAGATC CAGAGGTCAA TCATATCAAC GTTATTAAGA GAAAGACAAA CTATAAGAAA 6720 

AGTCAATAGT TTTATCTAAA CTATTTCTTA TTTCAATTTG ATGATTTGGC GATGATTTTA 6780 

GAGCACGGCA AAAAGCCCTT GAATTAGAAG CGGTCAATCG CTTAATTTCT ATCAGCTTAT 6840 

CAAATCCTGC CTCAAGCCTT TTCTGAGGAT TAGGGTAGCG TGTCAAGAGT TGGTAGGTAT 6900 

ATTCTGAATG CTTTCCAACG ATTTTATCCA ACTCAGGAAA GATGATATCA AGACAACGAG 6960 

TGTATTGTAC TTTCCAATCA GACTGTTTTT TCTTGAGACG ATGAATATGT CTAGCCAGTA 7020 

TTTTTAGTTC TACTTGCCGA TTATCGTGTT GAAATTGTTC ACGATTGGGG TCAGAAAGAA 7080 

GTTTAAGAGC GATGCCATGA GCGTCTTTCT TATCCGTTTT AGTTTTGCGA AGTGATAATG 7140 

ATTTGGCAAA TTTCTTGATG AGCAAAGGAT TGTAGGTGTA AACTTTATAT CCTTGTTCAT 7200 

GCAGGAAGTT CAGTAGATTA AAGGCATAAT GTCCGGTATT TTCAAGAGCG ATGAGACAGT 7260 

CTTGGTTGAG CTGTCGAAGA GACAGATCTA AGAGTTCAAA ACCAGCTTTA TTATTTGAAA 7320 

AAGTGAGTGG TTTAAGAACA GTTTTTCCTG GAACATTCAA GGCTGTAACA TCGTGTTTAT 7380 

TTTTAGCGAC ATCAATGCCC ACATAAAGCA TGGGAGTATC TCCAGATATA GTATTTCAAG 7440 

TCTACTGGGT TATCCACGAA CTTTTTGCCT TGTTACCTTA GACGAGATAA AACGTCTATG 7500 
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CGTTATCAAA CTCATTACCA ATTGAAACAA AAAACTGTGG TTAGAGCCTT TCGGAAATCG 7560 

TCAAGCGATT GGAGGAAATG AACTAATCCA CAGTGGCTTA TTCCAAGTAT ACCACTTGGG 7620 

CTTTGGCAGT AGCTAACTGC GCTAAATATA ATATAAGGAG AAATAGATGG ATTTATGGTT 7680 

TTCTGAAGTT CATACTCCAG ATGTCAAATT GTCTCTGAGA ACAGCCAAGC AACTTTACGC 7740 

TGGAAAAAGT GAATGGCAGG ATATCGAAGT CTTGGATACG CCAGCTTTTG GGAAAATACT 7800 

GATTTTAAAT GGCCATGTCT TGTTCTCAGA TGCGGATGAT TTCGTCTACA ATGAAATGAC 7860 

CGTTCACGTT CCCATGGCTG TCCACCCAAA TCCAAAGAAA GTATTGGTTA TTGGGGGTGG 7920 

TGACGGCGGT GTTGCCCAAG TATTAACCCT CTATCCTGAA CTGGAGCAAA TTGATATTGT 7980 

GGAACCGGAT GAGATGTTGG TCGAGGTCTG TCGTGAGTAT TTCCCAGACT TTGCTGCAGG 8040 

GCTAGATGAT CCTCGTGTTA CCATTTACTA CCAAAATGGG CTACGCTTTT TGCGAAACTG 8100 

CGAAGATGAT TACGATATTA TCATCAACGA TGCGACAGAT CCATTTGGCC ATACGGAAGG 8160 

ACTCTTTACC AAGGAATTCT ACGGCAATAG TTATCGAGCT CTGAAGGAAG AOGGCATCAT 8220 

GATTTACCAG CATGGGAGTC CCTTCTTTGA CGAGGATGAG TCGGCCTGCC GAAGCATGCA 8280 

CCGCAAGGTC AATCAAGCCT TTCCAATCAG TCGGGTTTAT CAGGCCCATA TTCCAACTAG 8340 

CCCAGCTGGC TATTGGTTGT TTGGATTTGC ATCGAAAAAA TACCACCCTG TCAAAGATTT 8400 

TGACAAGGAA GGCTGGAAAA AACGCCAGCT TTTCACAGAA TACTACACTG CAAACTTACA 8460 

CGTGGGAGCC TTTATGTTGC CCAAGTATGT TGAGGACATT TTAGAAGAAG AGGAAGGAAA 8520 

AAAATGAGTC GTTTACTAGT TATTGGTTGT GGGGGCGTTG CCCAAGTTGC TATTTCAAAG 8580 

ATTTGTCAAG ATAGCGAAAC ATTTACAGAG ATTATGATTG CTAGCCGTAC CAAGTCAAAA 8640 

TGCGATGACT TGAAAGCGAA GCTAGAAGGC AAAACAAGTA CTAAAATTGA AACTGCAGCA 8700 

CTTGATGCTG ACAAGGTTGA AGAAGTGATT GCCCTGATTG AAAGCTACAA ACCAGAAGCT 8760 

GTTTTGAATG TAGCTCTGCC TTATCAAGAT TTAACCATTA TGGATGCTTG TTTGGCAACA 8820 

GGTGTTCACT ATATCGATAC AGCCAACTAC GAAGCAGAAG ACACAGAAGA CCCTGAGTGG 8880 

CGTGCTATCT ACGAAAAACG TTGTAAGGAA CTTGGTTTTA CAGCCTACTT TGACTACTCA 8940 

TGGCAGTGGG CTTATCAAGA GAAATTCAAA GAAGCAGGCT TGACTGCTCT TCTTGGTTCT 9000 

GGTTTTGACC CAGGTGTAAC TAGTGTCTTT TCAGCTTATG CCCTCAAACA CTATTTTGAT 9060 

GAAATCCATT ATATCGACAT TTTAGACTGT AATGGCGGTG ACCACGGTTA TCCATTTGCA 9120 

ACCAACTTTA ATCCAGAAAT TAATCTCCGT GAGGTTTCTG CGCCAGGTTC TTACTGGGAA 9180 

GATGGGAAAT GGGTCGAAGT CGAAGCTATG TCTATCAAGC GTGAGTATGA TTTCCCTCAA 9240 

GTTGGACAAA AAGATATGTA TCTCCTTCAC CATGAAGAAA TCGAATCATT GGCCAAGAAC 9300 
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ATTCCAGGTG TCAAACGCAT TCGTTTCTTT ATGACTTTTG GTCAATCTTA CTTGACGCAC 9360 

ATGAAATGTC TTGAAAATGT TGGACTCCTT CGTACGGATA CCATTAACTT TAACGGCCAA 9420 

GAAATTGTTC CAATTCAATT TTTGAAAGCC TTGCTTCCAG ATCCTGCCAG TCTTGGGCCA 9480 

CGTACAGTCG GAAAAACCAA TATTGGATGT ATCTTTACAG GTGTCAAAGA CGGTGTCAAA 9540 

AAGACTATCT ATATCTACAA TGTCTGCGAC CATCAGGAAT GTTACGCAGA GGTTGGTTCG 9600 

CAAGCTATTT CTTATACGAC AGGAGTTCCA GCCATGATTG GGACAAAATT AGTCATGAAC 9660 

GGAACTTGGA AACAAGCTGG AGTGTATAAC CTTGAGGAGT TAGATCCAGA TCCATTCATG 9720 

GAAGCTTTGA ATGAGTATGG TTTGCCATGG GTTGTGGTTG AAAATCCACA AATGGTGGAC 9780 

TAATGAAGTT AGAACAAGTA CCAACACCAG CCTATGTTAT TGACTTGGCC AAGTTAGAAG 9840 

CTAATTGCCG CATTCTACAA TATGTACAAG AAGAGGCCGG TTGCAAGGTC TTGCTTGCCC 9900 

AGAAGGCATA TTCCCTCTAC AAAACTTATC CCTTGATTAG CCAGTATCTA TCAGGTACGA 9960 

CAGCTAGTGG ACTCTATGAG GCCAAATTGG CAAGGGAAGA ATTTCCTGGT GAAGTCCATG 10020 

TATTTGCGCC TGCTTTCAAG GATGCAGACT TGGAGGAATT GCTAGAGATA ATGGACCATA 10080 

TAGTCTTTAA CTCAGAGAGA CAGTTGCGTA AACACGGTCC GCGTTGTCGA GAGGCTGGTG 10140 

TCAGTGTTGG TTTGCGCCTC AACCCTCAGT GTTCAACTCA AGGcAGATCA CGCGCTCTAT 10200 

GACCCTTGTG CACCAGGTTC TCGCTTTGGA GTTACTATAG ACAAGATTCC GAGTGATTTG 10260 

CTAGATTTGG TTGACGGACT TCATTTTCAT ACCCTTTGCG AGCAGGGAGC AGATGATTTA 10320 

CAAACAACTT TGAAAGCAGT AGAAGAACAG TTTGGTCCCT ACTTACATGA GGTAAAATGG 10380 

CTCAATATGG GTGGTGGTCA TCATATTACA AGAGAAGGTT ACGATGTGGA TTTGCTGATT 10440 

TCAGAAATCA AGCGTATCCG AAAAACTTAC AATCTTGAAA TCTATATCGA GCCTGGTGAA 10500 

GCCATTGCGC TTAATGCGGG TTATTTAGCA ACTGAGGTAT TAGATATTGT AGAAAACGGT 10560 

ATGGAAATCT TGGTTTTAGA CGCCTCTGCG ACCTGCCATA TGCCTGATGT ACTTGAGATG 10620 

CCCTATCGTC CACCTTTGAG AAATGGCTTT GAGTCACAGG AAAAAGCCCA TACCTACAGA 10680 

CTTTCTTCTA ATACCTGTCT GACGGGCGAT GTGATTGGTG ATTATAGTTT TGAAAATCCA 10740 

GTCCAAATCG GAGACAGACT TTATTTTCAA GACATGGCCA TTTATTCTTT TGTCAAAAAT 10800 

AATACCTTTA ATGGTATTGG ATTGCCAAGT CTCTATCTCA TGGACGAACA GGGAGACTGT 10860 

AGCTTACTCA AAGCTTTTGG CTATCAAGAC TTTAAAGGGA GATTATCATG ATGGACAGTC 10920 

CAAAAAAATT AGGCTATCAC ATGCCAGCAG AGTACGAACC CCATCATGGT ACCCTCATGA 10980 

TATGGCCGAC TCGACCAGGA TCATGGCCTT TTCAAGGAAA GGCTGCTAAA AGAGCATTTA 11040 
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CTCAGATTAT CGAGACCATA GCAGAAGGGG AAAGAGTCTA TCTTTTGGTG GAGCAGGCCT 11100 

ATCTATCTGA AGCCCAATCC TATCTTGGAG ACAAGGTTGT TTATTTAGAC ATTCCCACCA 11160 

ATGATGCCTG GGCGCGTGAT ACTGGCCCAA CCATTCTCGT CAATGATAAA GGTAAGAAAT 11220 

TAGCCGTGGA TTGGGCCTTC AATGCTTGGG GAGGCACCTA TGATGGTCTT TATCAAGATT 11280 

ATGAAGAGGA TGACCAAGTA GCCAGTCGTT TTGCTGAGGC CTTGGAAAGG CCTGTCTATG 11340 

ATGCTAAACC TTTTGTACTG GAAGGAGGCG CAATCCATAG CGATGGTCAA GGAACTATTC 11400 

TCGTAACTGA AAGTTGCTTG CTTAGTCCTG GTCGCAATCC TAACTTGACT AAAGAGGAGA 11460 

TTGAAAACAC ATTATTAGAA AGTCTTGGTG CTGAAAAAGT TATTTGGCTT CCTTATGGTA 11520 

TTTATCAGGA TGAAACCAAT GAACACGTCG ATAATGTTGC TGCCTTTGTT GGTCCTGCTG 11580 

AGCTTGTTTT GGCTTGGACA GATGACGAAA ATGATCCCCA GTATGCCATG TCAAAAGCAG 11640 

ATCTCGAACT CTTAGAACAG GAAACAGATG CAAAAGGTTG TCACTTCACC ATTCATAAAT 11700 

TGCCTATCCC TGCAGTTCGA CAAGTTGTGA CAGAAGAAGA TTTGCCAGGC TACATCTATG 11760 

AAGAAGGAGA AGAAAAGCGA TACGCAGGTG AACGACTAGC AGCTTCCTAC GTAAACTTTT 11820 

ATATCGCCAA CAAGGCTGTC TTGGTTCCAC AGTTTGAGGA TGTAAACGAC CAAGTGGCCT 11880 

TAGATATCCT CAGCAAGTGT TTCCCAGACC GTAAAGTTGT CGGAATACCA GCCAGAGATA 11940 

TTCTCTTAGG TGGTGGCAAT ATCCACTGTA TCACCCAACA AATTCCAGAA TAGGAGAAAA 12000 

AGATGAGAAA TGTAAGAGTT GCAACCATTC AGATGCAATG CGCTAAGGAT GTGGCAACAA 12060 

ATATCCAAAC CGCAGAGCGT TTAGTACGTC AGGCTGCTGA GCAAGGAGCC CAAATTATTC 12120 

TCTTGCCCGA GTTGTTTGAA CATCCCTATT TCTGTCAGGA ACGTCAGTAT GACTACTACC 12180 

AGTATGCCCA ATCTGTAGCG GAAAATACTG CCATTCAGCA TTTTAAGGTG ATTGCTAAGG 12240 

AACTACAAGT TGTTTTACCA ATCAGTTTCT ATGAAAAAGA TGGTAATGTC TTGTATAACT 12300 

CTATTGCCGT CATTGATGCA GATGGGGAAG TGCTGGGCGT TTATCGAAAG ACCCATATAC 12360 

CAGATGACCA TTATTATCAA GAAAAATTCT ATTTCACGCC TGGTAACACT GGTTTCAAGG 12420 

TCTGGAATAC TCGCTATGCT AAGATTGGTA TCGGTATCTG TTGGGATCAA TGGTTCCCTG 12480 

AAACAGCGCG CTGTCTTGCA TTGAATGGTG CTGAATTGCT CTTTTATCCT ACAGCTATCG 12540 

GTTCAGAGCC AATTTTGGAT ACAGATAGTT GTGGTCACTG GCAACGTACT ATGCAAGGGC 12600 

ACGCAGCAGC GAATATTGTT CCAGTCATCG CAGCCAATCG TTATGGTTTA GAGGAGGTTA 12660 

CTCCTAGTGA GGAAAATGGC GGACAGAGCT CCAGTCTTGA CTTCTACGGT TCCTCCTTTA 12720 

TGACGGATGA AACAGGAGCT ATTCTAGAAC GAGCTGAAAG ACAAGAAGAA GCTGTTCTGT 12780 

TAGCTACTTA TGACCTAGAC AAGGGAGCAA GTGAACGCCT AAACTGGGGC TTGTTTCGAG 12840 
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ATAGAAGACC AGAAATGTAT AGACAAATTA CAGATTAGTG TGGGAGAAAT GAGAGATTCA 12900 

TTCTGCTAGA CTAACTTCTT ATTAGTAACT ATAAGATACT ATGGCATCTA GTAAATCGAT 12960 

TTTTATGATT CGCTATTCTT GTCTATTGAT TAGTCCGTAT TTTAAAATAT TAGCAAAAAA 13020 

GCAAATAGCA GTAACTTCTG TCTATTTGCT TTTCTTTTTT ATAGAATATA TTTCTCAATA 13080 

GCACGCGCAA CGCCGTCTTC TTCGTTGCTT GAGGTAACGG CATCCGCAAG AGATTTGATA 13140 

TAATCGCTGG CATTTCCCAT TGCAATCCCA AGCCCTGCAA ACTGGAGCAT TTCGATATCG 13200 

TTATTAGCAT CGCCCATGGC CATAATCTCT GAGGAATCAA TCTTCAAAAT CTCAGCTAGT 13260 

CGTGAAAGAG CAGTAGCCTT TGTCGTTCCA AGCGGCATTG CTTCATAAAT GACAGGCTGC 13320 

GAACGAACTC CACTGAATCG TTGGCAAAGC TCTTCAGCAA AACGCTGCTC AAAATCGTCT 13380 

GTTTGTTCTT TTGTTCCTAA ACACATACCT TGGAACATCC GGAACTTTCC ACTAGTCGCT 13440 

TCTTCAAGAG AAATTTCAGT CAGGTCTGAA AATACTAGTT TAGCATCATT TTCAATAACT 13500 

TGATTGGGCT TGTCACCGAG AACAAAATAA TGTGACTCGT CAAAAAGTGT CAACTGAACA 13560 

TCACTCTTTT CAGCAAGGTC ATAGAGGTAT TCGATGTCAG CTGGACTCAG TTCTTTCCAG 13620 

TCAACTAGAC TCCAATCACT GGTCTGGTGA GTTGAACAAC CGTTGTTAAC AATAATATAT 13680 

TCGTTCTGGA GGTCAAGCTC CAGTTTTTTG TAGTAGGGGA GGACACCGAA AAGGGGGCGA 13740 

CCCGTACAGA GAACCAGTTT GACACCTTTT TCAATGGCTT TGTGAATAGC AGTAATGTGT 13800 

GCTTGTGGGA TTTCCTTGGC TTCATTGAGG AGGGTGCCGT CCATATCCAA GGCTAGTAGT 13860 

TTAATCATAG GTCTTCCTCT TTATCTTTGC TATTATTATA GCATATTTTG GAGAAGAAAT 13920 

TGATAGAAAG CTTGAGACTA ATTGATTTTA TAGTTTAAGA TGTTTTGATG ACAATTCATG 13980 

ATTTGAAGAG GATATTTCGC AAAGATATGC TATACTATGT TTGTCAATGT TGCAACTAGA 14040 

CAAATTAAAA AACCAACTTA ATATAATAGT TTTTTTGTAA GTAGGTATGA GTAGCAGATT 14100 

ACTCAACTAA TCTGAAGAAT AATGGAGGAA ATATATCATG ATTTTAATGA CAAAAAATAT 14160 

AAATCTAACA AATGAAGAAT TAGAGCTGAT ACAAGGTGGA GCAGATCCAT ATGGTAAAAA 14220 

TCCTAATGGT AGGTACGATT GGGAAATAGA ACCAGTATTA ACTCTGCTGG TTCATGGATT 14280 

TTGTCCCAGA GGCACCTATG ATTCAGGATA TATTGGAGGA GGTAATCATC TTTGCAAAGG 14340 

AAGTGCTGCG AGATTTTAAG TAAAATTTAT TAGGAATATG AAGAAACAAG GGGAGAAAAC 14400 

AGAGGATTTA ATATGAAAAA ACGAGCTATT CAAATTTTAC TAGCATTGTC CTTAATTTTT 14460 

TACAAATCAA CTTGGTTTTG GAGGCTTTTC AATTATCTCG CAAAGCCCTA TCTACCAGCA 14520 

AGTCGTGAAT TTTTTCAGAT TCTGCTTTTG ATGGAGAGCG GAGTTCTTTT CTTAGCGGTC 14580 
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ATCTATCTAC TGGTTTTTGC AGGAAAGAAA ATTTTTCATT TCAAGTGGCA GCTGAGGTAC 14640 

TTCATCTACC TTTTACTGGG CTACATCATT TCATATATGT CTGACTTCCT CTTTTCGTAT 14700 

TTCATATCCC TGTCTTCAAA TCAGATTTCT TTGAATGAAA CGGTAGAAAT GATGGGGAGA 14760 

CAGGAGTTCC CTTATGTCTT GCTCATCGTT TGCTTCATCG CCCCTATTGC TGAGGAATTG 14820 

ATTTATCGAG GtGTGCTTAT GACAACCTGT TGCAAAAACT CACCTTGGTA CG 14872 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

CGTGCTATCG GTCTCAAAAC CAATCTGGTC GCTATGGTCA AATCCAGTTG GAAAATCCAT 60 

TCTTCTTGGA GCCATCTGCT GGATTGCCAT CATCCTCACC ACTCTTGGTA TGCAGACCCT 120 

TATCGGCATT TTCTAATACT CTTCGAAAAT CTCTTCAAAC CACGTCAACG TCGCCTTGCC 180 

GTAGGTATAT GTTACTGACT TCGTCAGTTC TATCTGCAAC CTCAAAACGG TGTTTGAGCT 240 

GACTTCGTCA GTTCTATCTG CAACCTCAAA ACGGTGTTTT GAGCTGACTT CGTCAGTCGT 300 

ATCTACAACC TCAAAACAGT GTTTTGAGCT GACTTCGTCA GTTCTATCTG CAACCTCAAA 360 

ACAGTGTTTT GAGCAGCCCG TGGCTAGTTT CCTAGTTTGC TCTTTGATTT TCATTGAGTA 420 

TAACACAAAA GGTAGCCCAT CAGCTACCTT TTTCTTATGC TTCCTCAATC AAGCGAGTAT 480 

GTTCTCTCTT GATACAGCGA TTCATCACGA TATCATCACA TCCACCATCA CGCAAAATCT 540 

CTTTCGCTTC TAAACTTTCA AGTCCTAGCT GTGCCCAAAA AATCTTGGCA TCAGCTTTGA 600 

GAAAATCACG CGCCACATCG GGCAGAAATT CACTGCGACG ATAAACATTG ACAATATCTA 660 

CAGGAAAAGG AATTTCAGCG AGGCTAGCAT AAGCCTTTTC ACCCAAGATT TCGCCACCTG 720 

CCGCCTTGGG ATTGACTGGG ATGATTTTAT AGCCCCGAGC CTGCATTTCC TTTGTTACTC 780 

GATTGCTGGT TGTTTCTTCA CGGTCAGACA AACCCACCAC AGCAAGGGTT TTACTCGTTG 840 

CGAGATACTG ACGAATCACG CCATCACTTG GATTGATAAA TTCTTGACTC ATAGAAATCC 900 

TCCTTTTTCA TCAGTATAGC ACATTTTGAA AAGGTTTGCA GAATTATACT ACAAAAAAGG 960 

AGGACTAGCC CCCTTTTTAT TTAGCCTCGT ACCAGGTTGC CCCTTCATTC TCATCTGCGA 1020 

TAAGAGGAAC ACTGAGTTGA ATGGCTTCTT CCATGGTTTG TTTCACCAAT TTTTTCATCT 1080 

CTACCAATTC AGATTTAGGC ACTTCAAGGA CGATTTCATC GTGCACTTGT AACAGCATCT 1140 
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TAGTCTGATA ACCACCTGCA ACCAAGGCTT TATCCAGCTG AATCATGGCA ATCTTGAGAA 1200 

TATCTGCTGC CGAACCCTGG ATAGGTGAGT TGATAGCAGT TCGCTCCGCA AAACCACGAA 1260 

TATTGAAGTT GCGCGAATTG ATATCTGGCA ACTCACGGCG ACGCTTAAAG AGGGTCTCTA 1320 

CATAGCCCTT ATCACGCGCC TCCCGCACCA CTTCATCCAT GTAGTTTTTA ATACCTGGAA 1380 

AACGTTCAAA GTAGGTATCA ATGTAGGCTT TGGCTTCCTT ACGACTAATT CCCAAATTAT 1440 

TAGACAAGCC AAAGTCTGAA ATCCCATAAA CCACTCCAAA GTTAACTGCC TTGGCATTGC 1500 

GACGGTCGTT TGCAGTCACA TCATCAGGAC GCTCAATGCC AAAGACCCGC ATGGCTGTCG 1560 

AAGTATGGAT ATCTGCCCCC TCTTGGAAGG CCTTAATCAA GTGCTCATCC TTAGAAATAT 1620 

GCGCCAAAAC GCGCAATTCA ATCTGTGAAT AGTCAGAGCT GAGTAGCACA CTATCCTCCC 1680 

ACTCTGGCAC AAAAGCCTTC CGAATCAAGC GCCCCTGTTC CAATCGGGCA GGAATATTTT 1740 

GCAAGTTTGG ATCCACACTA GACAAACGCC CGGTCTGGGT CAAATCCTGC ACATAGCGAG 1800 

TATGAATCTT TCCATCAGCC AAAATCCAGT CCTGCAAGCC AATTACATAA GTAGATTGAA 1860 

TCTTAGCAAT TTGACGGTAA TCCAGGATTT TCTTAACAAT CGGAGCAATA GGAGCGAGAC 1920 

GCTCTAAAAC ATCCACTGCT GTCGAATAAC CTGTCTTGGT TTTCTTAGTG TATTCTAGAG 1980 

GAAGTCCCAA TTTCTCAAAG AGAAGCACGC CCAACTGCTT AGGCGAGTTG ACATTAAACT -2040 

CCTCACCAGC CAGCTCGTAA ATCTCTTGAG TCAGTTTTTC AATGACAAGC TCATTTTCAG 2100 

CCTGCATCTC AAGCAAGGTC TCTTTCTTGA CCATAATCCC AGCAATTTCC ATCTTGGCAA 2160 

GGACAAAAGC CAGAGGTTGC TCCATATCAT AAAGAAGCTC TAATTGCCCA TTTTCGCTGA 2220 

GTTTTTCAAG TAAAATAGGC TCTGTTTCTA CCAAAACAGC AAGTTTACAA GCTAAGTGTT 2280 

CCAAGAATTT CTCACGTTCA GGAATGGCCT TTTTAACACC CTTACCGTAG AAAGTTTCAT 2340 

CATCAACCAA GTAAGTCTGA CCATAAAGAC TAGCGATGGT CGCAATTTCA TTGTCCTCCA 2400 

CAGTCGAAAG GAGGTATTTA GCCAAACGGA TGTCAAAAGC AGGCGCCTGC AAATCCACAC 2460 

CAAAACGTTG CAAAAGAACT TTAACCTTCT TAAAGTCATA AACTCTCAGA GATGTTTTTT 2520 

CTAAGAAATC CTTGAAAATC GGGTCTTGCA ACAGCTCAAG CTTGTCTGTG GCATAGAGCT 2580 

TATCCCCACA AGACCAGACA AATCCAACCA AATTATCCGT ATGGTAATTC TCACCAAAAA 2640 

GCTCAAAGTG GAAGATAGAC TCTTCACTCA GCATATCTTG ACTGATTTGG TCAACAATAG 2700 

TAAAATCCAA ACTCTCAGAC ACATCAGCTG ACGACACATT TAAAGCCTGC TTTAGCTGTT 2760 

TGAAGCCCAT CTCATCGTAG AATTTCCCAA GATTTTCAAC ATCTGGACCA CTATAGACCA 2820 

AGTCCTCTAA ACCAATCGCA ATCGGTGCCT TGGTATCAAT GGTCGCTAGT GTTTTAGACA 2880 
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AAAAGGCCTG TTCCTTGTCA TTGATGAGAT TTTCCTTCAT CTTAGAAGTC TTCATTCCAT 2940 

CAATATTTTC ATAAATCCCC TCAAGCGAAC CATGCTCCAG CAAGAGCTTA ATACCCGTCT 3000 

TTTCACCGAC TTTGGTCACC CCAGGGATAT TATCCGACTT ATCACCCATG AGCGCCTTGA 3060 

GATCGATAAA CTGAGCTGGT GTGAGGCCCA TTTCTTCCAT GAGGTAATCT GGCGTAAAGG 3120 

CCTCAAACTC AGCCACACCT TTCTTGGAAA TTTCAACCAC CGTATGCTCA TCCGTCAGCT 3180 

GAATCAAATC CTTGTCCCCA CTGACAATAG TAATATCAAA ACCATCCTGC TCTGCTAGCT 3240 

TATCCAGCGT CCCAATGATG TCATCCGCCT CATACTGAGC CAGATCATAG TGACGAATCC 3300 

CCATATGATC CAGCAACTCA CGAATGAAAG GAAATTGCTC ACGAAACTCA TCAGGAGTCT 3360 

TGGCCCGACC ACCCTTATAG TCCGCATACA TCTCTGTCCG GAAGGTCGTC TTTCCCGCAT 3420 

CAAAAGCCAC CAAAATATGA CTCGGCTCAA CGCGCTCCAA TAAATGACTC AACATCAACT 3480 

GAAAACCATA AATCGCATTG GTATGCAAAC CAGCCACATT CTTAAAACGG TCCAACTGCT 3540 

GATACAGCGC AAAAAACGCC CGAAAAGCTA CAGAAGACCC ATCAATCAAT AATAATTTTT 3600 

TCTTATCCAT ACACCCATTA TAAAGGAAAG AATCAAAAAA TACCATTGGG AAGAGCTAGA 3660 

GCAAGTATTT TTCAAACTTT TTCCGAATAA ATAGATAGAG CCAGAGAATT TAGTAAACCT 3720 

AGATTTAAAA ATGTGCTATA ATATAGTATA TTGAATCTAT AATAGTACAC CTTGACTGCT 3780 

AAAATATTTC TATAAATTAA TTTGACTTTC CTGATAGAGT TATTCACATC TTATTTCAAC 3840 

TCACTATAGA AGGAGGAATA GGAGGATTCT CAGACATCCG GGCATCAGCC CAACTAATGA 3900 

TTTGATTGCT AAGAAAATAT TCAGCAATCC AGAAATCACT TGTCAATTTA TTCGCGATAT 3960 

GCTGGACTTG CCAGCAAAAA ATGTGACCAT TTTGGAGGGA AGCGATATTC ACGTATTACT 4020 

CTCCATGCCT TACTCGGTGC AGGATTTTTA TACCAGTATA GACGTCTTGG CGGAGTTGGA 4080 

TAACGGTACT CAAGTAATTA TTGAGATTCA AGTCCATCAT CAGAATTTTT TCATCAATCA 4140 

CTTGTGGGCT TACCTGTGCA GTCAGGTTAA TCAAAATCTT GAAAAAATTC GTCAGCGAGA 4200 

AGGTGATACT CACTAGAGCT ACAAACACAT CGCTCCTGTT TACGCCATTG CTATCGTGGA 4260 

TAGTAATTAT TTCTCAGATG ACCTGGCTTT TCATAGCTTT AGTATGCGCG AAGACACAAC 4320 

AGGTGAGGTA TTGGCGATTA CCAACAATGG ACAGGAAAAC CATCTGGTTA AGATGGCATT 4380 

CTTGGAATTA AAAAATACAG AGAAACCAGC AAAGACAAGG TTCGCAAGCC ATGGTTGGAG 4440 

TTTTTCGGCA ACAAGCCCTT TACCCAGCAA CCGCAACGAG CCATTACCCA AGCAAATCAA 4500 

CTGCTGGACT ACAAGAGCTG GTCCGAGGAG GACAGGAAAA TGTTTAGTCA ACTACATATG 4560 

CGAGAAGAAC AAGTCTTGTT AGCACAGGAC TATGCCTTGG AAACTGCTAG GGCTGAAGGC 4620 

CTTGAACAAG GACTAGAGCG TGGGAAAGTT GAAGGAAGGG CAGAAAGGAA ACTTTTTGCC 4680 
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TTCCTAGACA TAGTACGCCA AGGTCTTCTG ACTTCTGAGG TTGCCAGCCA GCAATTAGGT 4740 

ATGTCAGTAT CTGAATTTGA GGCACTGTTG TAAAATGGCT CCATAATATC CATAGTGGGT 4800 

AAATCCCCTA TGGATATTAT GGAGCCTATT TTGTGTAGAA AAAAAGTCCC ATATGACCTA 4860 

TAATGAAAAG CGACAAAACA ACTCATTAGA AAGAATCATA TGGAACAATT ACATTTTATC 4920 

ACAAAATTAC TAGACATTAA AGACCCTAAT GTCCAGATTT TAAACATCAT CAATAAGGAT 4980 

ACACACAAGG AAATCATCGC CAAACTGGAC TACGACGCCC CATCTTGCCC TGAGTGCGGA 5040 

AACCAATTGA AGAAATATGA CTTTCAAAAA CCTTCTAAAA TTCCTTATCT TGAAACGACT 5100 

GGTATGCCTA CAAGAATTCT CCTTAGAAAG CGTCGATTCA AGTGCTATCA CTGTTCAAAA 5160 

ATGATGGTCG CTGAAACTTC TGATGACGTA CAGTCATATT TCTTCTCTTT TTATTATATC 5220 

ACAGTTTTAA ATCTAGCTTT ACTAGATTCA CCGCTACTAT CTATTTATTC GGAAAAAAGA 5280 

CGAAAAAACC TGAGAATCAT CTCAGGCTTG GTCATTAAAT TTTTTTCTCA ATATCGAAAA 5340 

GTGGAGAAAG TGGTCGTTTT TCATGAATAC GTACGATAGC ATCCCCTAGG AGATGAGCGA 5400 

TTGAAATCTG CTCAATCTTA TCAATCAAAC GCTCTTCTGG CAGATAGATG GTATCCAAAA 54 60 

CAACCAATTT CTTAATAGCT GATTTTTGGA TATTGTCCGT AGCAGGACCA GAAAGAACTG 5520 

GGTGCGTACA GCTTGCATAG ACTTCAACAG CACCAGCTTC CGCAAGAGCA TCTGCCGCAT 5580 

GACAAATCGT TCCAGCGGTA TCAATCATAT CATCAATCAA GATACAAGTC TTGCCTTCAA 5640 

CCTTACCGAT GATATTCATA ACTTCACTAG TATTCATCTT ATCAACGCTA CGACGTTTAT 5700 

CAATAATAGC GATAGATGTT TTCAAAAATT CTGCCAACTT ACGAGCACGA GTCACCCCTC 5760 

CATGGTCCGG GCTGACAACC ACATAGTCAG AACCAACCAT ACCACGACGC TCAAAATAAT 5820 

CTGCAATCAG AGGAGCACCC ATCAAATGAT CCACAGGAAT ATCAAAGAAT CCTTGAATTT 5880 

GCGCAGCATG CAAGTCGATG GTCAATAAAC GATCCACTCC AGCTACTTCA AGCATATTTG 5940 

CGACAAGTTT TGAAGTGATT GGCTCACGCG CTCTCGCCTT TCTATCCTGA CGTGCATACC 6000 

CATAGTAAGG CATGACAACA TTGACAGATT CTGCACTCGC ACGCTTCAAA GCATCTACCA 6060 

TAATCAAAAT TTCAAGCAGA TTGTCATTTA CAGGCGAACT AGTTGATTGT AAGATAAAGA 6120 

CGTGTTTCCC ACGGATTGAT TCTTCAATGT TGACCTGAAT CTCTCCATCT GAAAATTGGC 6180 

GAACACTTGA TTTCCCCAAC TCTATCCCAA TCTCCTGCGC CACACGTTCT GCCAATTCTT 6240 

TATTAGAAGA AAGGGCAAAC AGCTTTAAAT CAGAAAAAGA CATGATTTCC TCCGGTATAT 6300 

ATGTATAACT TGTGCTTTTC ACAAGATTTT CCATCTACCA TTGTAGCGCT TTTTGCACTA 6360 

TTTTTCAATC AAAAATAAAA GAAGGGCACC ATATTTGTAC CCTTGCATCA TTCTTTTGAA 6420 
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AAATATTCTA GGTCATCAAC TCATTGTGTT TCTCAACAAA GCAATAAGCA TGATAAAAAC 6480 

CATAGAGAGC AATAGCCGTA ACCACTGGAA TCGCTAAAGG CAACTCTGTT TCCAACTCCA 6540 

CAAAAGGAGA GTTAAACAAG AAGTGAGTTC CCAAGGCTAA ACCTAGAAAA ATAAGGCCCT 6600 

GTTTCTTGCC AACCTTCTGT CCTTTATAGG CTCTGTAAAG CAAGTAAACA CCTACTACAG 6660 

CTAGACCTGA AAAAGTCCAG TGAGAGGCAA TTCCTGAGAT GATACGCTCT AAAATTCGCG 6720 

AAATAGTAAA GTCAAAGCCC TCTGGCAAAT CCGTACGAAT ATAACCAATA TCCTTAATCA 6780 

TTTGGAATCC CAAACCGGAA GCAATTCCAA GTAAAAACAA AGATTTTAAT TTTCGCACAG 6840 

GAATCAAAGC CAAAACAAAA ACAAGTGACA ATAATTTCAA GGGTTCTTCT ACCAAAGGAG 6900 

CCGCAATAGC ACTTTCAAAG GCATTTAAAA ATGGACTATC TGGGAAAAGA ACCCCCAGTA 6960 

AATCATGGAT ATAAGTATTA GCAAAACTAG ACAACCAGCC TGAAAGGAAC ATCCCTCCCA 7020 

ATAAAGACAG AATCAAAACC TTCTTTGGCA ATTCCCATTT TTCCCAATAC GGAAGAGAAA 7080 

ATAAAGAGCC GGAATCATGT AAAAGAGAGC TAGAAAGATA GAAACTCCCA TTAGTCCATA 7140 

TTCCGCACCT GACCTCGAAC CGTCCGTATA GTAGATGGTT TCATACTGTA AACCAATACA 7200 

TAGCAATAAA ATAAAAATAA ATAAAATATT GCTTTTCTTC ATACACTTTC TTTCTAAATG 7260 

AAGTATTTAT AATTCTACGA CTGTCATACT TCCTGTATCA ACATTGTAAA TGGCACCAGA 7320 

GATAATGACA TCGTCTGGTA TTAGGGGAGA CTCGATAAGC AGTTGCATAT CCTCGCGTAC 7380 

ACTCTCTTCT ATATCTTGGA AGGGCAAGAA GTCCTGGTCT GACACATCGA CACCCAATTC 7440 

TTCCTTCAAA TACTCCTGAA AAGGTTCATT TTCAAAGGTC TGAGCACCAC AGTCTGTATG 7500 

ATGCAATACC ACAATTTCTC TTGTCCCCAT TTGTTGCTGG GAAATAACTA GAGAACGAAT 7560 

CATATCCTCA GTCACTCGAC CACCTGCATT CCGCAAAATA TGAGCATCCC CAAGTGCCAA 7620 

ACCTAGAGCT TGCGCAACGT GCAAACGTGA GTCCATACAG GTCACAATGG CTACTCTGGT 7680 

TTTAGGTTTA AGTGGCAGAT TTAACTGCCC ATGTAGGGCA ACATAAGCCT GATTGGCTTG 7740 

CATAAACTGT TCAAAATACG ACACGATTCC CTCCTTGAAA ATTTGATAGT CAAATATTTC 7800 

TCCTATCTTA TCATTTTTAA GAGAATTTGT CACGGATTAT GCAAAGACCT TTTTCAAGAC 7860 

TTCCTGAATC GTTGTCACGC CAATGACCTG AATTTCCTTA GGCAGAGTGA TTCCTGTCAA 7920 

GGAATTCTTA GGTACATAAA TCTTAGTAAA GCCCAGTTTA GCAGCTTCGT TGATGCGTTG 7980 

CTCAATACGA TTCACGCGCC GAATCTCTCC TGTCAAGCCC AGTTCTCCGA CAAAACATTC 8040 

CTGAGGATTA GTTGGCTTGT CTTTGTAGCT CGAAGCAATA GCAACTGCAA CAGCCAAGTC 8100 

AATCGCAGGT TCATCCAATT TAACACCACC AGCAGATTTG AGATAGGCAT CCTGATTTTG 8160 

CAAGAGAAGC CCTGCCCGTT TTTCCAAAAC AGCCATAATC AAGCTAGCAC GGTTAAAATC 8220 
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AAGTCCTGTC GTAGTACGCT TGGCATTTCC AAACATGGTC GGTGTTACCA AAGCCTGAAC 8280 

CTCCGCCAAA ATCGGACGCG TCCCTTCCAT GGTTACAACG ATGGAGGAAC CAGTCGCCCC 8340 

ATCCAAACGC TCTTCTAGGA AAACTTGACT CGGATTGAGT ACCTCAACCA AGCCGCCCGA 8400 

CTGCATCTCA AAAATCCCAA TCTCATTAGT GGAACCAAAA CGATTTTTGA CGGCTCTCAA 8460 

AATACGAAAG GTGTGGTGAC GCTCCCCTTC AAAGTAAAGC ACCGTATCCA CCATATGCTC 8520 

CAACATACGA GGCCCAGCCA AGGTTCCTTC TTTGGTCACA TGACCTACGA TAAAGATGGC 8580 

AATGTTATTG GTCTTGGCCA ACTGCATGAG TTCAGCGGTC ACTTCACGCA CCTGAGAAAC 8640 

AGACCCCTGC ACCCCTGAAA TCTCAGGAGA CATGATGGTC TGGATGGAAT CAATAATGAG 8700 

AAAGTCTGGC TGGATACGCT CCACTTCTGC ACGAACACTC TGCATATTGG TCTCTGCATA 8760 

GAGATAAAAC TCACTATCAA TATCACCTAA GCGCTCTGCA CGTAGTTTAA TCTGCTGGGC 8820 

AGACTCCTCC CCACTGACAT AGAGAACTGT CCCCACTTGG GACAACTGGG TTGAGACTTG 8880 

TAGGAGAAGA GTTGATTTCC CAATCCCAGG ATCCCCACCG ATAAGGACGA GACTTCCTGG 8940 

TACCACTCCG CCTCCAAGCA CACGGTTGAA TTCCTCCATC TCCGTCTTGG TTCGATTGAC 9000 

ATTGATGGAA GTCACCTCAG CTAGTTTCAT GGGCTTGGTT TTCTCACCTG TCAAGGACAC 9060 

ACGCGCATTC TTAACTTCGG CAACCTCAAC CTCTTCCACA AAAGAAGACC AAGACCCACA 9120 

GTTGGGGCAA CGTCCCAGAT ATTTAGGGGA ATTATACCCA CAATTTTGAC ATACAAATGT 9180 

CGCTTTTTTC TTTGCGATGA CAAACCTCTT TCTATATCTC TAACTCACAC TCAATCACTT 9240 

GGCAAAAATC AATCTTCTCA TTTGG C AC AA ACTGGCGCAT GAGCATTCGA TGAGCAACAA 9300 

CTACCACAGT CTGATGTTCT CGATACTTAG ACATACATTC TAGAAACCGA GACTTCATTT 9360 

CCGTAGCTGT CTCATATTGA ATAGGACTAT TAGGAAGCAA CTCCCCCTTG TTTTCTAAAA 9420 

ACAGTCTTCT AGCTGTTTCA AAGTTTTCTA TTCCTGTTTT ATAGACCTGC CATTCATGTA 9480 

ATAAAGGCTC TACTCTTAAA GGAAGACCCG TAGCACAGAC CACATACGAA GCCGTTTCTA 9540 

AAGCTCTTGT GACTGCAGAA GATACGATTA TTTCAGCTGA CGAGAGTAAA GGATTTTTGC 9600 

TCAATTTCTG GACTTGCTGC CGTCCCATCT CAGACAAGGG TGCCAAATCT ATCCCAAATC 9660 

CTATATAAGA ACGCTCCTCT AACTCACGGT AATCTGGCTC CCCATGACGT ACAAAGATAA 9720 

TCTTCATTCT AGTGCCCTGT CGATCCAAAT CCACCAGTTC GAACGCCATC AGCTGCATCT 9780 

CCATCTGCAA TTAAGAAAGT AGCAAAAACA GCCTGGACAA TACGCTCCCC AACTTCAAGA 9840 

ACAACCTCTT GGTCTGTGAT ATTCTTCATC TGCGCAAAAA TATGCCCTTC ATTTCCAGGA 9900 

TTTCCATAAT AATCCCCATC AATGACTCCA ACTGAGTTAA TTAAAACCAA GCCCTTCTTA 9960 
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CGAGGATTTG AAGAACGATC ATAGAGGTAG AGAACCTCAG TCGGCTGCAT ATAAGCCTTA 10020 

ACCCCTGTCG GAACCAAGAC AATCTCTCCT GGCGCAACAA CTGTACGCAC AGCAACCTTT 10080 

AAGTCGTAAC CAGTCGCATG CGCTGTCTCA CGCTTGGGCA ATAAATTTTC ATCTGTAAAA 10140 

CTCGAAACCA ATTCAAAACC ACGAATTTTC ATAATTTTCT CTTTTCTATT ATCATTTATT 10200 

CTAGATTATT CTATACTTAT TTA 10223 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16535 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

TGGTTCTGTC CTTATCGGCG CCTTGTCTTG CTTGCCATGG CTACACCAAC TATCTCATCC 60 

GACGAAAGTA CACCAACCAC TAACGAACCC AACAACAGAA ATACAACCAC CCTTGCCCAA 120 

CCTCTTACTG ATACAGCAGC TGGCTCTGGT AAGAACGAAA GTGATATTTC TTCACCTGGA 180 

AATGCAAACG CTTCCCTAGA GAAAACAGAA GAAAAACCTG CTGCAAGCCC AGCCGATCCA 240 

GCACCACAAA CTGGACAAGA TCGTTCAAGT GAGCCAACTA CTTCTACTAG TCCAGTAACA 300 

ACTGAAACTA AGGCAGAAGA GCCCATCGAA GATAACTACT TCCGTATCCA TGTCAAAAAA 360 

CTTCCTGAAG AAAACAAGGA TGCTCAAGGA CTATGGACTT GGGACGATGT TGAAAAACCA 420 

TCTGAAAACT GGCCAAACGG AGCTTTGTCC TTCAAGGATG CCAAGAAAGA TGACTACGGC 480 

TATTACCTAG ATGTCAAATT AAAGGGAGAA CAAGCCAAGA AAATTAGCTT CCTCATCAAC 540 

AATACAGCTG GAAAAAATCT AACCGGCGAT AAATCTGTAG AAAAACTAGT TCCAAAAATG 600 

AACGAAGCTT GGTTAGACCA AGATTACAAG GTTTTCTCTT ACGAGCCACA GCCTGCAGGA 660 

ACTGTTCGCG TCAACTACTA CCGCACAGAT GGCAACTATG ACAAGAAATC TCTCTGGTAC 720 

TGGGGAGATG TGAAAAATCC AAGTAGCGCT CAATGGCCTG ACGGAACAGA CTTTACGGCT 780. 

ACAGGCAAAT ATGGCCGCTA TATCGACATT CCTCTTAATG AAGCCGCAAG AGAATTTGGA 840 

TTTTTATTAC TAGATGAGAG CAAACAAGGA GACGACGTGA AAATCCGTAA AGAAAATTAT 900 

AAGTTCACAG ATTTGAAAAA TCATAGCCAA ATTTTCCTAA AAGACGATGA TGAATCGATT 960 

TACACAAATC CATACTATGT CCATGATATC CGTATGACAG GAGCCCAACA CGTAGGCACT 1020 

TCTAGCATTG AAAGTAGCTT TTCAACACTT GTCGGTGCTA AAAAAGAAGA TATCCTCAAA 1080 

CACTCCAACA TCACTAATCA CCTAGGAAAC AAGGTAACTA TTACCGATGT TGCAATCGAT 1140 
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GAAGCTGGTA AGAAAGTGAC CTACAGCGGA GATTTCTCTG ACACAAAACA TCCTTATACT 1200 

GTTAGCTACA ATTCCGACCA ATTCACTACC AAAACAAGCT GGCGCCTGAA AGATGAGACA 1260 

TACAGCTATG ATGGCAAACT GGGAGCTGAC CTAAAAGAAG AAGGAAAACA AGTTGATTTG 1320 

ACCCTTTGGT CACCAAGTGC TGATAAGGTT TCTGTTGTTG TCTACGACAA GAATGACCCT 1380 

GACAAAGTAG TTGGAACTGT CGCTCTTGAA AAAGGGGAAA GAGGAACTTG GAAACAAACT 1440 

CTAGACAGCA CAAACAAACT CGGAATCACA GATTTCACTG GCTACTATTA TCAATACCAA 1500 

ATCGAGCGTC AAGGTAAAAC TGTTCTTGCA CTCGATCCTT ACGCTAAATC TCTTGCTGCT 1560 

TGGAATAGCG ACGATTCCAA GATTGACGAT GCCCATAAAG TGGCTAAAGC CGCCTTTGTA 1620 

GATCCAGCTA AACTCGGACC TCAAGACTTG ACTTATGGTA AGATTCACAA TTTCAAGACT 1680 

CGTGAAGACG CCGTTATCTA CGAAGCTCAT GTGCGTGATT TCACTTCAGA TCCTGCCATT 1740 

GCAAAAGACT TGACCAAACC ATTTGGGACT TTTGAAGCCT TCATTGAAAA ACTAGACTAT 1800 

CTCAAAGACT TGGGTGTAAC CCATATCCAG CTCCTTCCAG TCTTGTCTTA CTACTTTGTC 1860 

AATGAATTGA AAAACCATGA ACGCTTGTCT GACTACGCTT CAAGCAACAG CAACTACAAC 1920 

TGGGGATATG ACCCTCAAAA CTACTTCTCC TTGACTGGTA TGTACTCAAG CGATCCTAAG 1980 

AATCCAGAAA AACGAATCGC AGAATTTAAA AACCTCATCA ACGAAATCCA CAAACGTGGT 2040 

ATGGGAGCTA TCCTAGATGT CGTTTATAAC CACACAGCCA AAGTCGATCT CTTTGAAGAT 2100 

TTGGAACCAA ACTACTACCA CTTTATGGAT GCCGATGGCA CACCTCGAAC TAGCTTTGGT 2160 

GGTGGACGCT TGGGGACAAC CCACCATATG ACCAAACGGC TCCTAATTGA CTCTATCAAA 2220 

TACCTAGTTG ATACCTACAA AGTGGATGGC TTCCGTTTCG ATATGATGGG AGACCATGAC 2280 

GCCGCTTCTA TCGAAGAAGC TTACAAGGCT GCACGCGCCC TCAATCCAAA CCTCATCATG 2340 

CTTGGTGAAG GTTGGAGAAC CTATGCCGGT GATGAAAACA TGCCTACTAA AGCTGCTGAC 2400 

CAAGATTGGA TGAAACATAC CGATACTGTC GCTGTCTTTT CAGATGACAT CCGTAACAAC 2460 

CTCAAATCTG GTTATCCAAA CGAAGGTCAA CCTGCCTTTA TCACAGGTGG CAAGCGTGAT 2520 

GTCAACACCA TCTTTAAAAA TCTCATTGCT CAACCAACTA ACTTTGAAGC TGACAGCCCT 2580 

GGAGATGTCA TCCAATACAT CGCAGCCCAT GATAACTTGA CCCTCTTTGA CATCATTGCC 2640 

CAGTCTATCA AAAAAGACCC AAGCAAGGCT GAGAACTATG CTGAAATCCA CCGTCGTTTA 2700 

CGACTTGGAA ATCTCATGGT CTTGACAGCT CAAGGAACTC CATTTATCCA CTCCGGTCAG 2760 

GAATATGGAC GTACTAAACA ATTCCGTGAC CCAGCCTACA AGACTCCAGT AGCAGAGGAT 2820 

AAGGTTCCAA ACAAATCTCA CTTGTTGCGT GATAAGGACG GCAACCCATT TGACTATCCT 2880 
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TACTTCATCC 


ATGACTCTTA 


CGATTCTAGT 


GATGCAGTCA 


ACAAGTTTGA 


CTGGACTAAG 


2940 


GCTACAGATG 


GTAAAGCTTA 


TCCTGAAAAT 


GTCAAGAGCC 


GTGACTATAT 


GAAAGGTTTG 


3000 


ATTGCCCTTC 


GTCAATCTAC 


AGATGCCTTC 


CGACTTAAGA 


GTCTTCAAGA 


TATCAAAGAC 


3060 


CGTGTCCACC 


TCATCACTGT 


CCCAGGCCAA 


AATGGTGTGG 


AAAAAGAGGA 


TGTAGTGATT 
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TCACTGCTCC 


AAACGGCGAT 


ATCTACGCAG 


TCTTTGTCAA 


TGCGGATGAA 
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AAAGCTCGCG 


AATTTAATTT 
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ACCAAGCAGG 
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ACTAGCCATG 


AGTCAACTGC 


AGAAGAGAAA 


CCAGACTCAA 


CCCCTTCCAA 


GCCTGAACAT 


3420 


CAAAATGAAG 


CTTCTCACCC 


TGCACATCAA 


GACCCAGCTC 


CAGAAGCTAG 


ACCTGATTCT 


3480 


ACTAAACCAG 


ATGCCAAAGT AGCTGATGCG GAAAATAAAC CTAGCCAAGC 


TACAGCTGAT 


3540 


TCACAAGCTG 


AACAACCAGC 


ACAAGAAGCA 


CAAGCATCAT 


CTGTAAAAGA 


AGCGGTTCGA 
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AACGAATCGQ 


TAGAAAACTC 


TAGCAAGGAA 


AATATACCTG 


CAACCCCAGA 


TAAACAAGCT 


3660 


GAACTTCCAA 


ATACAGGAAT 


CAAAAACGAA 


AACAAACTCC 


TATTTGCAGG 


AATCAGCCTC 


3720 


CTTGCGCTCC 


TTGGTCTCGG 


TTTCTTACTA AAAAATAAAA AAGAGAACTA 


AACTAGCCCT 


3780 


CCTATAGAAA 


AATCCCCCAA 


GCATTATAGC 


TCGGGGGATT 


AATTTTTGTA 


CAATATTTGT 


3840 


TGTCCTAATA 


AACTTGATTA 


GGATTTTTTA 


TTAAGCCTCT 


TTCATAGCAA 


AATAAGCTCG 


3900 


TACTTTGGGT 


GCAACTTGTG 


TTCCGAAGAG 


TTCAATAGCT 


CTCAGAACCT 


GGTCATGAGG 


3960 


CATAGAACCA 


AGCGGTAGAT 


GAAGCATGAA 


GCGGTCCAAT 


CCTAAATCCT 


CTATCATGCG 


4020 


AATCAATTTT 


TCGGCCACCT 


GATCTGGATT 


GCCAACAAAC 


ATGGCX3CCAT 


TTGGCCCTAC 


4080 


CTGCTCCAAA 


TATTGCTCAT 


AACGCAATTC 


CTGCCAGTGC 


GGACGGTCTT 


TGGAAATAGC 


4140 


ATCCACCACT 


TGCTTAGTCG GATGGAAATA ATCTTTCACC GCCTGCTCAC 


CATCTTCCGC 


4200 


AATCCACCCC 


CAAGAATGGG 


CTCCCACTTT 


CAAGTCTTTG 


TCAGCATGGC 


CCCTTCGCTT 
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CCAATCTCAC 


GATAAGCCTG 


AATCAACTTT 


TTAAAATAAC 


GTGGATTACC 


ACCAATAATA 
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GCATATACAA 


TCGGTAGACC 


AGCCTGAGCA ATCTTCACTG TTGATTCGAC 


ATGACCACCT 


4380 


GTAGCTATCC 


ACAAGGGCAA 


TTTGTCCTGA 


ACTGGACGAG 


GATAAACTTC 


TTTACCAGCA 


4440 


ATCGTTTGAG 


TCAATCGACC 


TTGCCAGTCT 


AACTTGGTCT 


TTTCATTGAC 


TAACTGAAGC 


4500 


AAGTCTAATT 


TCTCATCAAA 


AAGAGAGTCG 


TAGTCTTTCA 


AGTCATAACC 


AAACAGAGGG 


4560 


AAAGATTCCG 


TGAAAGAGCC 


CCTTCCAGCC 


ATAATCTCCG 


ATCGTCCATT 


TGACAAAGCA 


4620 


TCGATAGTGG 


CATACTGTTG 


GAACAAACGA 


ATCGGGTCCA 


TGCTTGACAG 


AATGCTGACT 


4680 
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GCACTGGTCA AACGGATTTT CTTGGTATTG ACTGCCCCAG CGGCCAGAAC AATCTCTGGG 4740 

GCTGATACTG CAAAATCCGC CCGATGGTGC TCACCAATCC CATATACATC CAAACCAACC 480O 

TTGTCAGCCA GCTCAATCTC TGCCACCAAC TGGCGAATGC GTTCAGCATG ACTGTAAGTT 4860 

TGTCCAGTCC CTTCAAGCTC CGTTATTTCC CCAAATGTTG AAATTCCCAA TTCTACCATT 4920 

GTGATTCTCC TTATCTATCT CTGTACTTCA ATTTGAAAAA TTATTCTAAC ACGAATCTTG 4980 

AGTACAAGCA ACCGATTTGC TCATTAGAAA AAGCCTAGAT AACTAGACTT TTTTAGCTTA 5040 

TTCTACCGTT ACTGACTTGG CAAGGTTACG TGGTTTGTCC ACATCGAGGC CACGGTGGAG 5100 

GGTTGCAAAG TAAGCGACTA ATTGCGTTGG TACGACCATT GAAATTGGTG AGAGGTATGG 5160 

ATGTACGGTC GTAAGGACGA TATCGTCGGT ATCTTTGGCT ACATTCTCTT CTGCGATAGT 5220 

GAGGACTTTG GCACCACGGG CTGCGACCTC TTGGATATTT CCACGAGTAT GATTGGCAAG 5280 

AACTGGATCT GACAAGAGAG CCAAAACAGG CGTTCCTTCT TCAATCAAGG CAATGGTTCC 5340 

GTGCTTGAGT TCTCCTGCAG CAAAGCCTTC ACACTGGATA TAAGAAATCT CTTTGAGTTT 5400 

GAGACTTGCT TCCATGGCTA CGTAGTAATC TTGACCACGT CCGATGTAAA AGGCGTTACG 5460 

AGTTGTTTCA AGAAGTTCAC GAACCTTGAC TTCAATGGTT TCTTTCTCTG AAAGAGTTGA 5520 

TTCGATAGAC TGAGCTACGA TTGACAATTC ATGAACCAGG TCAAAGGCTT GCGCTTTAGC 5580 

ATTACCATTT GCTTCTCCGA CTGCTTTTGC AAGGAAGGCA AGGGCTGCGA TTTGCGCTGT 5640 

ATAGGCTTTA GTTGATGCCA CGGCAATTTC AGGACCTGCG TGAAGGAGCA TGGTATAGTT 5700 

GGCTTCACGT GAGAGGGTTG AACCTGGAAC GTTTGTCACT GTTAAGCTTG GAATTCCCAT 5760 

TTCATTAGCC TTGACCAAAA CTTGACGACT ATCCGCTGTT TCACCAGATT GGCTGATAAA 5820 

GATGAAGAGT GGTTTCTTGC TGAGAAGTGG CATACCGTAG CCCCACTCAG ATGAGATTCC 5880 

AAGTTCAACT GGTGTATCTG TCAATTCTTC CAACATTTTC TTAGAAGCAA ATCCTGCATG 5940 

GTAAGATGTT CCAGCTGCAA GGATGTAGAT GCGGTCTGCG TCTTGAACAG CCTTAATGAT 6000 

ATCTGGGTCT ACGACAACTT GACCAGCCTC ATCTGTGTAG GCTTGGATGA GTTTCCGCAT 6060 

AACAGTTGGT TGCTCGTCAA TTTCCTTGAG CATGTAGTAA GGGTAAGTTC CCTTACCGAT 6120 

ATCTGACAAG TCAAGTTCAG CAGTGTAGCT AGCACGCTCA CGACGATTTC CATCATAGTC 6180 

TTGAACTTCC ACACTATCAG CCTTGACGAT TACCAACTCT TGGTCATGGA TTTCCATGTA 6240 

TTGGTTAGTT TCACGAATCA TAGCCATGGC GTCTGAGCAG ACCATGTTAT AGCCTTCTCC 6300 

AAGACCAATC AAAAGTGGTG ATTTATTTTT AGCTACGTAG ATGACTTCAG GATCTTGTGA 6360 

GTCAACCAAG GCAAAGGCAT AAGAACCACG GATGATGTGA AGGGCTTTTT TGAAGGCTTC 6420 



WO 98/18931 



PCT/US97/19588 



612 

AAGAACTGAG AGCCCTTCTT CTTCCGCAAA TTTTCCAATC AAATGAACGG CTATTTCAGT 6480 

ATCTGTCTGC CCCTTGAAGT GGTGACCTGC AAGGTATTCT TCCTTGATTT CAAGATAGTT 6540 

CTCAATCACC CCATTATGCA CCAAGACAAA ACGTTCCGTC TCAGAGCGGT GTGGGTGAGC 6600 

ATTGTCCTCA GTTGGTTTTC CGTGAGTAGC CCAACGAGTA TGTCCGATAC CAGTTGTTCC 6660 

CTCAACACCA GCTGTCTTGG CAGACAATTC TGCAATACGA CCAACCGCCT TCACCAAATG 6720 

GTTATCAGCA CCATCTAGGA CAAAAATTCC CGCAGAATCA TAGCCACGGT ATTCAAGCTT 6780 

TTCAAGCCCT TGAATCAAAA TATCAGTTGC ATTTGTGTTT CCAACAACAC CAACAATTCC 6840 

ACACATAGTA TATACGACAC AGGCAAGCTG TGCTTTCTCC TTAAAATTGG TATAGTCTAA 6900 

TTCATCTTTT ATAGAATCAG CAAAAACAGT ATATACTTGT TTCTTTCACT TGTCAAGAGT 6960 

AAAAATTGGT ATAGTTCAAA TTAAGCTCCT GTAAGCATAA AAACTCTGAC CGATTGGGAT 7020 

AATCAGTCAG AGTCCTTTTT AAAATCCATT ATTATCGCTT AATTCTTTGA ACCAGTGGCC 7080 

TGATTTCTTC AGACGACGTT CTTGCGTTTC CAAGTCTAAT TCGACCAAAC CATAGCGATT 7140 

TTTATAGCTG TTGAGCCATG ACCAGCAGTC AATAAAGGTC CAAATCAAGT AGCCCTTACA 7200 

GTTGGCACCA TCTTCAATGG CACGGTGAAG TTCACGAAGA TGACCTTTTA CAAAGTCAAT 7260 

ACGGTAATCA TCTTGAATCA TTCCATCTTG ACGGAATTTT TCTTCCCCTT CAACACCCAT 7320 

ACCATTCTCA GTCAACATCC ACTCAATATT GCCATAATTT TCCTTGATAT TTTGGGCGAT 7380 

GTCATAAATC CCTTGCTCAT AAATCTCCCA ACCACGGTGA GAATTGATTT TACGTCCAGG 7440 

CATCACATAA GGCTCGTAAA AATGTTCTGG TAAGAGTGGA CTCTCTGGAT GCTTAGCAAA 7500 

TCGAGGAGCC ATAACACGCA AAGGTTGATA GTAGTTCACA CCAAGGAAGT CCACCGTATT 7560 

ATCACGAATG AGTTCCAACT CTTCCTCTGT AGCATCAGGT AAAAGACCGT GTTCATGCAA 7620 

GATTTCTACC AACTCCTGTG GATAAGTCCC CAAGACAGAT GGATCTAAGA AAGATTGGGC 7680 

CTGAAAAAGG GCCGCAATAC GAGCTGCCTT GACATCAGCA GGATGCTGGC TACGTGGATA 7740 

AGCCGGTGTC AAGTTGAGGA CAATCCCAAT CTTGGAATCA GGCAAAAGTT CATGGCAAGC 7800 

CTTAACAGCC CGGCTGCTGG CCAATTGTGT ATGATAGGCT ACCTTAACAG CTGCCTCTGC 7860 

ATCCACCTTA TGTGGATAAT GGGCATCATA AAAATAACCA AATTCTACAG GAACGATGGG 7920 

CTCGTTAAAG GTAATCCATT GATCCACTAA ATCTCCATAA GTCTCAAAAC AAAAACGAGC 7980 

ATAGTCTTCA TAGGCTGAGA CTGTCGCCTT ATTTTCCCAA CCATCACCAT CCTCTTGAAG 8040 

GGCAAAAGGT AAATCAAAAT GATAGAGATT GACTAACAGA CGAATTCCTT TAGCCTTAAT 8100 

AGCCTCAAAG ACCTTACGAT AAAAATCCAC ACCTTGAGTG TTGACTTTTC CACAGCCTTG 8160 

TGGAAAAATC CGTGACCACT GAATAGAAGT CCGAAAGGCT GTGTGACCAG TCTCTAACAA 8220 
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AAGCTCAATA TCCCGCTCCC AATTTTCATA AAAAGTCGAT GTCTTATCTG AACCAATCCC 8280 

ATTATAGTAA CGATTTGGCT CCACTTGGAA CCAGTAATCC CAGAGATTGT CTCCCTTACC 8340 

GTCACCAGCT ACACGTCCTT CTGTCTGCGG TCCAGAAGTA GAGGATCCCC AGACAAAATC 8400 

CTTTGGAAAT CTTAGCATAC ATTTACCTCT TTATCTACTC ATTTCTCCCA TTATACAGAA 8460 

AAAACAAGGT AAAAACTAGT TACATTTTTT CCTTGTTTTT CTTCTGATTA TAGTTTTTAT B520 

TTCTTGCTTA GGATTTCAAG CGTTTCAAGC ACGTTATCTG CATGAACCTC AATGGTGTCA 8580 

CCAGTTGCCT TGATCTTAAC TTCTACAATG CCATCGGCCG CTTTTTTACC AACAGTGATA 8640 

CGGATTGGAA GACCAATCAA GTCACTATCG CTAAATTTAA CACCGACACG TTCGTTACGG 8700 

TCATCTGTCA AGACTTCATA ACCAGCTCCC ATCAAGCTTG CTTCAAGTTT TTCTGTCAAG 8760 

GCTTGCGCTT CTTCATCCTT GACATTGACA GTAATCAAAT GCACATCAAA TGGTGCCAAT 8820 

TCTTTAGGGA AATTGATTCC CCAAGCGTAA CGGTATTCAC CTTTTGGCGT TTTGTTAACA 8880 

AAGAGGCGAG CGTGTTGCTC CATCACTGCT GAAAGAAGAC GGCTGACACC GATACCGTAA 8940 

CATCCCATGA TGATTGGCAC AGCACGACCA TTTTCATCCA AGACATCTGC TCCCATGCTT 9000 

GCTGAATAGC GAGTTCCGAG TTTGAAAATA TGACCGATCT CAATACCACG CGCAAAGTTA 9060 

AGGACACCTT GTCCATCTGG GGAAATTTCA CCCTCACGAA CTTCACGGAT ATCCACATAT 9120 

TCTGCAGTAA AATCACGGCC TGGGTTCACA CCAGTCAAGT GGTAGTCATC TTCGTTAGCA 9180 

CCGACAACTG CATTGCGAAC ATCTTGTACC TTACGATCTG CAATAATTTT AATATTCTCT 9240 

GGCAAACCAA CTGGTCCAAG TGAACCAAAT CCTGCTTGAA CAACATTCGC CACTTCTTCT 9300 

TCGCTAGCAA CGTCAAAGAA ATCTGCTCCC AAGTGATTTT TCAACTTGAC TTCGTTGAGT 9360 

TGGTCATTTC CAACTAGAAG GGCTGCAACA AGCTCACCAT CTGCAATGTA GAAGAGGGTT 9420 

TTAATCGTTT GTTCTTCTGG AACATTGAGG AAGGCTGCAA CTTCATCAAT TGATTTAACA 9480 

TCTGGCGTTG CAACACGAGT AACTTCTTCT TCAGCGACAA CACGGTTGCT TGGTTTGTAC 9540 

TCGTTTGTTG CCATTTCTAA GTTAGCTGCA TAGCTAGACT CACTTGAGTA AGCAATGGTA 9600 

TCTTCACCAG AG ACT AT CCA TTTGAGCAAT TCTGCCTTGA TTTCTTCTTG CACTTCTGCA 9660 

GGAATTTCGT CAAATGAGGC AACTGACTTG TCCAAGACAA CCCAGCGGTC AAGGTCTGTA 9720 

CGAGCAGATG TAATGGCCAT AAATTCTTGG CTATCCTTAC CACCCATGGC TCCACCGTCA 9780 

CCAATAATAG CCTTGAAGTC TAAACCACTA CGAGTGAAAA TACGCTCATA GGCTGCTTTG 9840 

TACTCATCAT AAACACTATC CAAACTATCA TAGTTAGCGT GGAAACTATA AGCATCCTTC 9900 

ATGATAAACT CACGTGTACG AAGAAGTCCA TTACGCGGGC GTTTTTCATC ACGATACTTG 9960 
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GGCTGAATTT GATAAAGGTT GAGTGGCAAT TGCTTGTAAG ATTTAACAGA ATCACGGACA 10020 

ATAGCTGTAA AGGTTTCTTC GTGAGTTGGA CCTAAGATAA AGTCTGATTT TTCACGGTTT 10080 

TTTAGTTTGT AAAGGTCTTC ACCATAGGTT TCGTAACGAC CTGATTCACG CCACAATTCT 10140 

GCACTAAGAA GGGCTGGAGC CAACATCTCA ACAGCACCAA TCTTTTCGAA TTCTTGGCGC 10200 

ATGATGTTTT TAGCTTTTTC AATCACACGG TTGGCAAGTG GTAGATAAGA ATAAACACCT 10260 

GCTGAAACTT GGCGAACATA ACCAGCACGC AACATAAGAG CATGGCTGAT AACTTGAGCA 10320 

TCGCTTGGCA TTTCGCGAAG CGTTGGGATA GGCATTTTAC TTTGTTTCAT AATATTCCTC 10380 

GATTATCTAA AAAAGAGTCG CATAATGTCA TTCCAAGTCA CAGCAATCAT CAAGACAACC 10440 

ATGATGACCA CTCCGGCCAA GGTGACATAG GTTTCAATTT CTTGTTTCAA TGGTTTGCGG 10500 

CGGATGGCTT CTAGGATATT GAGCACAATC TTACCACCAT CCAAGGCTGG AATCGGAATA 10560 

AGATTAAAAA TCCCAATATT GATGGAAATC ATTGCCAAGA AGTACAAGAT ATTTTCAATT 10620 

CCATTTTTAG CAGCATCACT ACTTGCCTTA AAGATAGCAA CAGGTCCACC CAACTTGTTC 10680 

AAATCTGGTT GGAAAATCAG ATTTTTCAGA GCTGAGAGAA TTCGGAGAGC TGAGTCAGCA 10740 

GCAGTTGTAA AACCACCTAC AAACATGGAT AGAAAATCTG ACTTAACCCC CGGTTGAACA 10800 

CCTAGAAGGT AACGACCTTG ACTATCTTTG GGTGTAACAG TGACTTGTTT GTCACTCCCC 10860 

TTTTCAGAAA TAGTCACATC CAAAGTCGGT GCCGTCTTAT CTTTGGTTTC TGTTTCCACA 10920 

GCTTGGATCA AGCTTTCCCA GTTGCTAACC TCATGTGAGC CAATCTTGGT AATTTGTGCC 10980 

ATTTCTGGTA CTCCTACCTT GGCCAAGGCA CCTTGGGGCA TGATATGGAA CTGATTGGTA 11040 

TCAACATCTC TGACACCACC CTGCATAAAG ATTAAAACCC AAAAAACAAC GACACCTAAG 11100 

ATAAAATTGT TCATAGGACC TGCAAAATTG GTAATCAGTT TGCCCCAGAT AGTCGCATTT 11160 

TGATATTGAA CATCTAAAGG TGCAATCCGA ACCTCAGTAC CATCTGCTTC CACAACCGTT 11220 

GCATCGTGAT CCACTGCAAA TGTTTTTTCT TCTTCCAGAA CCAATCCTTT GATAAAGAGC 11280 

TTGTCTTCAA AATCAAACTG GGTCACCTGC ATAGGGAGGG CTGTTTGATC CAATTTTTTA 11340 

CCTGAGAGAT TGATGCGTTT AACCTTACCA TCATCAGCAA GTGTCAAACT AACAGGCGTT 11400 

CCTGTCTTGA TTTCAGTTGT ATCATCACCC CAACCGGCCA TGCGGACATA GCCACCCAGA 11460 

GGCAAGATTC GAATGGTATA GGCCGTTCCA TCCTTGCCAA TGTGAGCAAA AATTTTAGGT 11520 

CCCATACCGA TGGCAAATTC ACGTACTAAA ATCCCTGATT TCTTGGCAAA GTAGAAGTGA 11580 

CCGAACTCGT GCACCACTAC AATAATCCCG AAAACCAGAA TAAAGGTTAA AATTCCGAGC 11640 

ATAGCGTTTC CTCCGTCTTT TGATTAAAAG AGTCCAAATA AGTGCATGAT TGGAAATACA 11700 

AGCAACATAC TATCGAAACG ATCCAAAACA CCACCATGTC CAGGGATAAA TTTCCCAGAA 11760 
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TCCTTAACAC CAAAATGACG TTTGATCGAA CTTTCTAGTA AATCACCAAA TTGTCCAGCA 11820 

ATGCTAAAGA AAATAGCAAA GACTGACATC TTGTAAATTC CATATGGAAG AGCAACTGTA 11880 

CTGTCAACTA TCATAAGGAT AATGGTTACT AAAATTGCTC CTAAAATACC ACCCAAGGCA 11940 

CCCTCAAGGG TTTTATTAGG CGATACCCTT GGTGCTAACT TTCGTTTCCC ATAGTTCATC 12000 

CCAACAAGAT AGGCACCACT GTCTGTCGCC CAGACGATAC ACAAGGCTAA GAGAGCCTTG 12060 

TCCAAACCTG CAACACGAGC ATCTAGTAAA GCATTAAATC CAAAGCCCAC GTAGAAGCTC 12120 

ATAGCAAGAG GGAAAACCGC ATCCTCAATC GTATAAGACT TGCTAAAAAC GGTCGTTCCT 12180 

AACATGATTG AAATCAAAAC ACTATAGGCA ACCACATTCC CATCAACTGG CAAAAAAGTC 12240 

AGGTAATTCT CCAAGGGAAT GGTCAATGCA AAGGTTGCAA AGAGGGTCAA GAGGCCCTCC 12300 

ATCGTCATGG TCTCTAGACC TCTCATCTTC AAAAGTTCAT GCATGGCTAG CATGGCTATG 12360 

ATTCCGATTG CTATCTGAAG CAAGAGGCCC CCAATCATTA AAATTGGTAG GAAAATAGCC 12420 

AGGGCAATCC CTGCAAACAA GGTTCTTTTC TGTAAATCCT GGGTCATATT TCCTCCTAAA 12480 

CTCCTCCAAA TCGGCGATGA CGACGATTAT AGGCAAGAAT AGCTTCCTGC AAGGCCGCTT 12540 

CGTCAAAATC AGGCCATAAG GTGTCCGTAA AATAAAGCTC ACTATAGGCT CCCTGCCATG 12600 

GAAGGAAATT GCTCAAACGT AATTCTCCAC TAGTACGGAT AATCAAGTCT GGGTCTCGTA 12660 

AGTCCTTAGG CAAATGCTGA GTAAAGAGAT AGTTACCAAT CAATTCCTCT GTGATGTCAC 12720 

CTGGGTTGAT TTTGGCATCT AAAACATCCT GGGAAATCAA CTTAAGCGCC TGTGTAATCT 12780 

CAGCACGTCC ACCATAGTTA AGAGCAAAAT TAAGAATCAA TCCTGTGTTG TTCTTAGTCA 12840 

ATTCCTCAGC CTTGGTTAAA GCTTCAAAGG TTTGCTTAGG CAGGCGGTCT GTCTCCCCAA 12900 

TCATTTGAAT CTTAACATTA TTCGCATGTA GTTCCGGGAC ATAATTATCA TAAAACTCTA 12960 

CTGGCAAGTT CATGATAAAC TTGACTTCCT GATCTGGACG GGTCCAGTTT TCCGTAGAAA 13020 

AAGCATAGAC CGTAATAACC TTGACGCCCA GTTTGTTGGC TGCCTTGGTC ACGGTTTGCA 13080 

ATGCTTCCAT GCCCGCCTTA TGTCCAAAAA CTCGCGGTTG CATACGTTTT TTAGCCCAAC 13140 

GGCCATTGCC ATCCATGATG ATGCCGATAT GAGCAGGAAC CTGTGTCGGA ACCTCTACTT 13200 

CCACAGCCTT ATCTTTCTTA AAAAATCCAA ACATGATCTT ATTCCTATTC AAAAATCTAT 13260 

CGTTTCATTA TACCATATTT CCCCATTTTC TTCTATCACT AAGCTATTTA TTCTCAGGCA 13320 

CCAAGCCCAT TTTTCAAAAA AATAAGCCGC CTGATTGGGC GACTTTATTT TTATAGGGAG 13380 

ATTATTATGA AAAAGTTTTA GGAGTTTAAG TTAAGGTCTT CTTAACTTAT GAACTTAGTG 13440 

TACACTCCCT AGCTTAAAGT TTCCTTAAGT ATTTTTAAAA ATCAAATTTT TCCATTTCTC 13500 
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CTGCCAATTT TTCTTGGATA AACGTGTTTG ATAGAGTTCC ATTOGGTCTT CATTTTCTAA 13560 

GAAATGAGGA GTTGGACGAA CTTGAAAATT CAAAATATCC TCCAAACCAT AAGGTACATA 13620 

GAGTTCAAAA TCTAATTCTT CATTCAAGCG CAGTCCAACT GCCGTACACC GTTCTGGATA 13680 

CTTACTCATA GCATCACGAG AACTGGTATA GGAAGCAGTG TGAGGACTGT GCTGATGCAT 13740 

ATAGACCTGA TTTTTCAATT CCCACTGGTA CTGAGGAAAA TCCTCTCTCA GCTTTTTCTC 13800 

CAGTAATAAG GTTTCCTCAT AAGAAAAATC TGGATCAAAG AAAATCACAT CTATATCTGT 13860 

TTCATGATCA AAAGGGGATT TGTCTGACAA AAGATTCCAG ATGAAATTTC TGACAGAACC 13920 

TGCTGCCAAC CACGAGTCTT TCAAACCAAG GTCTCGGATG ATCGTCAGAA TGGCCATCAT 13980 

ATCTGGACTT TCTCTAAAAG CCTCTAAGAT TTCTTGCTTA TTTTTCACTG TATTCATAAC 14040 

CTAAGTGCTC ATATGCCTTA GCAGTCGCCA CCCGTCCAGA CCGTGTCCGC ATGATAAAAC 14100 

CTTTTTGAAT CAAGTAAGGC TCATACATGT CTTCAACTGT CTCACGCTCT TCGGCGATAT 14160 

TCACAGAAAG AGTTCCTAGA CCAACAGGTC CTCCACTGTA CATCTCAATC ATGGTGCGAA 14220 

GGATTTTTTG ATCCACATAG TCCAAACCTT CATGGTCAAC ATCCAGCATA GTCAAAGCCT 14280 

TATCGGTAAT AACATCATCG ATAACCCCAT TCCCCATTAT CTGGGCAAAA TCGCGCACGC 14340 

GCTTGAGGAG ACGATTGGCA ATACGAGGGG TTCCACGACT ACGTAGGGCC AACTCAGATG 14400 

CTGCCTCATG GGTGATTTCC ATCTCAAAAA TATCTGCCGT CCGCTCGACA ATTTCTGTCA 14460 

AGTCAGCATG AGCATAATAC TCCATATGAC CTGTAATCCC AAAACGTGCC CGTAGTGGAT 14520 

TTGAGAGCAT ACCAGCCCGA GTCGTCGCAC CAATCAAGGT AAAAGGAGGC AACTCCAAAT 14580 

GAACACTGCG ACTGCCTTCA CCAGCCCCAA TCATAATATC GATGTAGAAG TCCTCCATGG 14640 

CACTATAAAG CACTTCTTCC ACTGACATGG GTAAGCGATG AATCTCGTCA ATAAAGAGGA 14700 

CATCTCCAGG CTCTAAATCA TTCAAAATCG CTACCAAATC ACCCGCTTTT TCGATAACAG 14760 

GACCAGACGT TTGCTTGAGA TTGACTCCCA GTTCATTGGC AATGACAAAA GCCATGGTTG 14820 

TTTTCCCAAG CCCTGGAGGG CCAAATAAGA GCACATGATC CAGCGCTTCA TCCCGCATTT 14880 

TAGCGGCTTC GATAAAGATC TGAAGTTGAT CCTTAACCTT ATCCTGACCA ATATATTCAC 14940 

GTAAATACTG AGGACGGAGC GTGCGTTCTA CTAACTCCTC ATCACCCATC ATCTCATTAT 15000 

CTAAAATTCT ACTCATGGCT CTATTATATC AAAAAAAACA AGCCACAAAC AAAAAAGCCA 15060 

CCTGATTGGG TGACTCCTAA GTTTAGCACT TATGTGGTAT AATATTATAC GGCACTTCTA 15120 

CACCGCCTAC GAAAGGAGGT GAGATAGCCC ATGATGGAAT TAGTACTCAA AACTATTATC 15180 

GGACCAATTG TGGTCGGTGT CGTTCTTCGT ATAGTCGATA AATGGCTAAA CAAGGACAAA 15240 

TAGTGTCAAA AAAGACCTCA AGCTTATTTG GTCGTGAGCT TGGGGTCTTT TCTAGCCTAT 15300 
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GATATAGAAC TAGTACTCAA TTCCTTTTTA TTATCCCATA GTTCACGAAT TTTGTCAAAA 15360 

CTTTACATTT TCTTCAACCG CTGTACGACA AGACGGTTAA GATTAAGAGA ACGTTAGGGA 15420 

TTCTATCAAT TTCATAGAAA TTTTGATTTC GTAAACGAAG AGACAATCTT ACATGTCACT 15480 

TCTCATTTAA TACGCCACTA CTAGACAAGC AAAATCATTA TTACAGTAGT TCCAGTCCTT 15540 

CAATTAACAG TCACTTACAA TCAAATTGAG TTTGAACTAG CTGAAGCGAC CACAGACCTA 15600 

TTTCTTAGTC ATATTCGCTA AAAAAATCCC CGCCAAAATC TCAAAAAGTC CCCGCCAATT 15660 

CCCCGACCAA AATCCGAAAA ATACCGAAAA ATATCGAAAA ATTATTTTTA GAATAGTCCC 15720 

AAAAATCCTG AAATAGAGCT AAAAAACTCC ACCTGATTCG GTGGAGTTAA GGGAGATTAT 15780 

TATGAAAAAG AAAAGTTTAG GATTTTATTA AATAAAGTTA GGAGGTCTTT ATTTAATAAC 15840 

TACATGATAC AAGACGAAAC TTAAAACTAG CTTAACTTTT CTAAAATTTT ACTATTTTGC 15900 

AAAAAATTTC TATCACCAGC ACCTCACCAA TCGAGTAGGG GATAATCTCT AGCCCCTCTC 15960 

ACACCACCGT ACGTGCCGTT TGGCATACGG CGGTTCAACT AACTTTTAAC GCATGTCGTT 16020 

CAAGGTAATA ATCCAAACAC GAAACCAGTC CACGTTTTTC CAGGACTGGT TTTGATATAG 16080 

CACGTTTAAG TACCGACTTC TGAGCTACTA ATTGATAATG GTCGCCCCAG CCAGATACCT 16140 

TATCTGCTAT CCATTTAGGA ACTCCTAACT TAAGCAATCC CCATAATCGT CTCGATTTCT 16200 

TCTTCCATTG CTTCCAGATA ATCACTCGTA GGCGAGTACG CAAGCGCTCA TCTATGCTGG 16260 

CGACTATACT TTTCATATTT CCCAATGAGC AATAGTTTAT CCATCCTCGA ATAGACAAAT 16320 

TCAGTTGCTC AATACGTCTT GTTAGGTCTA TACTCCATTT CCTCTGTGTT AGTTTCTTCA 16380 

ATTTAAACTT AAATCTCCGA ACACTATCTT GATGTGGACG GCTTTTCCAA CCATCTGATA 16440 

ATTTCCAGAA CCCAAAACCT AGATATTTCA ACTCTCTTGG TCATGTTTAC TTTCAAACCT 16500 

AGCCGTTTCT CAATAAACGA CTGACTGAAT ACATC 16535 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8136 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

CCAGAGCGTT GCGTCCGAAA GTCTATCCAG ACACGGCTCT TTAAAAACAA AAGGAGAAAT 60 

GATGCATACT TATTTGCAAA AGAAAATTGA AAATATCAAA ACAACCCTAG GTGAAATGTC 120 
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AGGTGGTTAC 


CGTCGTATGG TTGCGGCTAT GGCTGATTTA GGATTTTCAG GAACTATGAA 


180 


GGCTATCTGG 


GATGACCTCT 


TTGCCCATCG 


TAGTTTTGCC 


CAGTGGATTT 


ATTTGCTGGT 


240 


TTTAGGAAGT 


TTTCCTCTCT 


GGCTGGAGTT 


GGTTTACGAA 


CATCGTATTG 


TTGACTGGAT 


300 


TGGGATGATT 


TGTAGCTTGA 


CAGGGATTAT 


CTGTGTAATC 


TTTGTATCGG 


AAGGTCGAGC 


360 


AAGTAATTAT 


CTTTTTGGCT 


TGATTAACTC 


TGTTATTTAC 


CTTATTTTGG 


CCCTACAGAA 


420 


AGGCTTTTAT 


GGTGAGGTGC 


TGACGACACT 


TTACTTCACA 


GTCATGCAGC 


CAATTGGACT 


480 


TCTAGTTTGG 


ATTTATCAGG 


CACAGTTTAA 


GAAGGAAAAG 


CAGGAGTTTG 


TCGCGCGTAA 


540 


ACTGGACGGC 


AAGGGCTGGA 


CAAAGTATCT 


TTCCATTAGT 


GTGCTTTGGT 


GGTTGGCCTT 


600 


TGGCTTCATT 


TATCAGTCTA 


TTGGTGCCAA 


TCGTCCCTAT 


CGTGATTCAA 


TCACAGATGC 


660 


AACCAATGGG 


GTAGGGCAAA 


TCCTCATGAC 


AGCTGTTTAC 


CGTGAACAGT 


GGATATTCTG 


720 


GGCGGCTACC 


AATGTCTTTT 


CAATCTATCT 


CTGGTGGGGA 


GAAAGCCTGC 


AAATTCAAGG 


780 


GAAATATCTA 


ATTTATCTCA 


TTAACAGTCT 


AGTTGGTTGG 


TATCAATGGA 


GCAAGGCAGC 


840 


TAAGCAGAAT 


ACTGATTTAC 


TTAACTAGGA 


AAAGATGTTT 


GAAAGTGCTG 


TTTTGAGATT 


900 


TCGATTAAAA 


CAGATATAGT 


TGATAATCAA 


GGATTTATAG 


TATGAAAAAG 


AGGATCGGCG 


960 


GGTCCTCTTT 


TGTTGTTGAA 


AAGATAAAAA 


ACTCAGTAAC 


CTAGAAATAA 


GACAACTGAA 


1020 


GCTTTACTCT 


ATATTCAATT 


TTTAGGAATG 


AGAAGGTCTA 


GATAAAATTG 


GACAACTTCC 


1080 


TGGTCTGTGA 


AATCTTGACC 


TTTTTTGAGC 


CACCAGGTCA 


ATGTCTCGAT 


AAAGTTGGAC 


1140 


ATGACCAAGT 


GTTGGAGGTA 


AGAAGTAGGC 


AGATTAGGGT 


GGGCTTCTTT 


TAAATTATCA 


1200 


GCTAGCACGG 


AATAGACATG 


GTGTTCTAGC 


TCTTTATGGA 


GTTGACGGAG GAAGTAGTCA 


1260 


TTTTTGGAAA 


ATAGCAGACT 


GGTGATATGG 


TCTTGGTTTT 


TATGAAAATG 


GAGAAAGAGG 


1320 


TGGGCGAGGT 


AGTCCTCGGT 


TGAAATGGCT 


TGCTCTCTTT 


CAAAAAGATG 


ATGGAAGAGG 


1380 


TAGCGGCAGA 


GCTGGTCCAG 


AAGAAGCTCC 


TTACTCTCAT 


AGTGACAGTA 


AAAGGTGGAT 


1440 


CGTCCCACAT 


CTGCGAGATC 


AATGATATCC 


TGAACAGTAG 


TGGCCTCGTA 


GCCCTTAGCA 


1500 


TTCAAAAGTT 


GTATAAAAGC 


TTGATAGATG 


GCTTTTTTGG 


TTTTGCTGAT 


ACGGCGGTCA 


1560 


ATGTTAGTCA 


TATGGACACT 


TAAGGCAAAT 


TGTTCAGAAC 


TGAATAAAGC 


TGACGTTTTG 


1620 


CTTCTATCCT 


TTCTTTGAGT 


TTTAGTGGAT 


AATGATAATG 


AACAAGGTGT 


TCATAAATCT 


1680 


ATTATAACAA 


AGGAATGAGA 


AATATGAAGG 


CAAAATATGC 


TGTTTGGGTG 


GCTTTTTTCT 


1740 


TAAATTTGAC 


TTATGCCATT 


GTTGAGTTTA 


TTGCAGGTGG 


AGTATTTGGT 


TCTAGCGCTG 


1800 


TTCTTGCTGA CTCTGTGCAT GACTTGGGAG ATGCGATTGC AATTGGAATA TCAGCTTTTC 


1860 


TAGAAACAAT 


CTCCAATCGT 


GAAGAAGACA 


ATCAGTACAC 


CTTGGGCTAT 


AAGCGGTTTA 


1920 
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GCCTGCTAGG AGCCTTGGTA ACAGCTGTGA TTCTCGTAAC GGGCTCTGTT CTAGTCATTT 1980 

TGGAAAATGT CACGAAGATT TTGCATCCGC AACCAGTCAA TGATGAGGGG ATTCTCTGGT 2040 

TAGGAATTAT TGCGATTACT ATCAATCTGT TAGCGAGTCT GGTGGTTGGT AAGGGAAAGA 2100 

CAAAGAATGA GTCTATTCTG AGTCTGCATT TTCTGGAAGA TACGCTAGGG TGGGTAGCTG 2160 

TTATCCTGAT GGCGATTGTT CTTCGATTTA CGGACTGGTA TATCCTAGAT CCTCTTTTGT 2220 

CCCTTGTCAT TTCTTTCTTT ATTCTTTCAA AAGCCCTTCC ACGTTTTTGG TCTACACTCA 2280 

AGATTTTCTT GGATGCTGTG CCAGAAGGTC TTGATATCAA GCAAGTAAAG AGTGGCCTGG 2340 

AGCGATTGGA CAATGTGGCC AGCCTTAATC AGCTTAATCT CTGGACTATG GATGCTTTGG 2400 

AAAAAAATGC CATTGTCCAT GTTTGTCTAA AAGAAATGGA ACATATGGAA ACTTGTAAAG 2460 

AGTCTATTCG AATTTTCCTA AAAGATTGTG GTTTTCAAAA TATTACCATT GAAATTGATG 2520 

CTGACCTAGA AACTCACCAA ACCCATAAGC GAAAGGTGTG TGACTTGGAA CGGAGTTATG 2580 

AGCATCAACA TTAGAAAAAA GTGAAAAATA CTTGGGTACT ATCTTATTTG GAATAGAGTA 2640 

ATTTCTTTAT TATTTAAATA TTTCAAAAAT TGGTAAGAGA AGAGCATTGT ATAAACTCCA 2700 

GATATATGAT TGTTAATGAT AAAAATTTTT CGATTAGATA CAAAATGCTT GACTTGGAGT 2760 

CAACTCAAAG TTATATAATA AGATAAGTGA GTTAGAATAG CGTGAATTCA GTGAATGAAA 2820 

TGAGAGGAGG TTAGCGTGTG AATATTAAAT CTGCCAGTGA TTTGTTGGGA ATTTCAGCGG 2880 

ATACGATTCG GTATTATGAA CGGGTTGGTC TTGTGCCACC GATTACTCGT ACTGCTACTG 2940 

GGATTCGTGA TTTTCAAGAT CAGGATATCG AAGCGCTGGA ATTTATTAAG TGTTTTCGTT 3000 

CGGCGGGTGT CTCTGTAGAT AGTTTAGTTG ACTATATGTC GCTCTACCAA AAGGGAGATG 3060 

AAACGAGAGA GGAGAGGCTT GGTATTTTAG AAGAGGAAAA GCAAAAATTA GAGGAGCGCT 3120 

TGTCTCAGCT ACAGACAGCT TTAAATCGTT TAAATCTCAA AATTAAACTT TATAAGGAAG 3180 

GAAAATTTTA AATGAAATCA GCAGTATATA CAAAGGCAGG TCAGGTTGGA CTTGCTAGCA 3240 

TTGAACGTCC GCAAATAATA GAAGCGGATG ATGTGATTAT TCGTGTGGTT CGTGCGTGCG. 3300 

TTTGTGGTTC AGATTTATGG AGGTACCGTA ATCCAGAAAC GAAAGCTGGA CACAAAAATA 3360 

GTGGACACGA AGCGATTGGG ATTGTTGAAG AAGCTGGGGA AGCCATTACG ACGGTGAAAG 3420 

CAGGTGATTT TGTGATTGTC CCTTTTACAC ATGGATGTGG TGAGTGTGAT GCCTGTCTTG 3480 

CTGGATTTGA CGGTTCTTGC GACAATCATA TTGGCAATAA TTTGGGGGGT GATTTTCAGG 3540 

CAGAATATAT TCGCTTCCAC TATGCAAACT GGGCGCTGGT TAAAATCCCT GGTCAACCTT 3600 

CTGACTATAC AGAAGGGATG CTCAAGTCCC TTTTGACTCT TGCAGATGTC ATGCCGACAG 3660 
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GCTATCATGC GGCGCGTGTT GCAAATGTTC AAAAAGGGGA CAAGGTTGTT 


GTTATCGGTG 


3720 


ATGGGGCTGT 


TGGTCAATGT 


GCTGTCATCG 


CGGCTAAGAT 


GCGTGGAGCA 


TCACAAATTA 


3780 


TCCTTATGAG 


CCGTCATGAA 


GACCGTCAAA 


AGATGGCTAT 


GGAGTCAGGT 


GCGACAgcTG 


3840 


TTGTTGCAGA 


ACGTGGTCAA 


GAAGGAATTA 


CCAAGGTGCG 


TGAAATCCTC 


GGTGGAGGAG 


3900 


CAGATGCAGC 


ACTTGAATGT 


GTTGGTACGG 


AGGCTGCTAT 


AGAACAGGCG 


CTAGGTGTTC 


3960 


TTCATAATGG 


AGGGCGTATG 


GGCTTTGTAG 


GAGTCCCACA 


CTATAATAAT 


CGTGCTCTTG 


4020 


GTTCGACATT 


TATGCAAAAT 


ATCTCTGTAG 


CAGGTGGGGC 


AGCTTCTGCT 


ACAACATACG 


4080 


ATAAGCAATT 


TTTACTAAAA 


GCCGTCCTTG 


ATGGTGATAT 


CAATCCAGGT 


CGCGTCTTTA 


4140 


CTTCAAGTTA 


TAAACTGGAA 


GATATCGACC 


AAGCCTATAA 


AGATATGGAT 


GAACGTAAGA 


4200 


CAATTAAGTC 


TATGATTGTA 


ATCGAATAAA 


AAACGAATAG 


GAGTTTTAGA 


ACTCTATTCG 


4260 


TTTTTTATGT 


TATCCTATTC 


TTGATTTAGG 


GTACTTTCTC 


TTAATGTCAG 


TCTGGTTCCC 


4320 


AGCATGGTCA 


GGCTAGGGAT 


TTTCCGACCG 


TGGAGGACTT 


CCTTGTTAAG 


AATATCCATA 


4380 


CCTGCTCGGC 


CCATTTCTTC 


AGTATAAACT 


GTAATACTAG 


AGAGGGGAGG 


ATAGACCTGT 


4440 


TTGGTCAGAC 


TAGTGTCGTT 


AAAGGAAATG 


AGGCTGACGC 


GATCTGGCAG 


GCTGATTCCA 


4500 


GCTTCTTGGA 


GGGCACGGAG 


GGCACCGATA 


GCTAAACTAT 


CGCTGGCTGC 


GAAAAATGCT 


4560 


GGCGGAAGTT 


GGTCTCCCAA 


GCTCTGAATG 


GCCTCCTTCA 


TTAAGTCATA 


GCCAGACTGG 


4620 


GCAGTAAATC 


TTCCTTGAAA 


GACCAGTTCA 


TCATGATAGA 


TTCCCCTCGC 


TTGACTATAG 


4680 


TTTTTGAAGT 


TTTCTAGACG 


CTPGTCCTGA 


ATGATTTCTT 


CTTGGTCTGT 


TGTTTCTTCA 


4740 


AGGCCTGTTA GAATCCCGAT ACGGTCCATT CCTTGACTGA GGAAATAATC 


GACAACCTGT 


4800 


TTCATAGCAG 


TGTAAAAATC 


CGTGATAATA 


CAGGTATGTC 


CCAGGGAAAG 


TGTATCGCTG 


4860 


TCTAGAAATA 


CAAGAGGCTT 


TTGGTATTCT 


TCAAAGGCAG 


AAATCTGAGC 


TCGACTAAAC 


4920 


TTTCCGATGC 


AGAGAATCCC 


AATCACTTCC 


TCGCTTAGGG 


TAAAAGGGTG 


GTCATTAAAA 


4980 


TAGCGCAAGA 


TATCATAGTC 


CAACTCTTGG 


GCTCTTTTTT 


CTATTCCTAG 


GCGAATCTGG 


5040 


TAGTAGTAGA 


GGTCGTCCAG 


CTCCCCTTGT 


TCGCTGACCC 


ATTGGATAAT 


GGCAATCTTT 


5100 


TGCTTGGGTT 


TGTGGGACTC 


GCCTGTCTTG 


AGGTGCTTGG 


TGTAGCCCAG 


CTCTTCAGCA 


5160 


ACGGTTAAAA 


TACGGTGTCT 


GGTTTCTTCT 


GTAACAGATA 


GGCTCTGGTC 


GCGGTTGAGG 


5220 


ACGCGGGATA 


CGGTCGCGAT 


AGAGACAGAG 


GCTAGCTGTG 


CAATGTCTTT 


TAAGGTAGCC 


5280 


ATAAATCCTC 


CTTGATTAGG 


TTAGTATATC 


ATGTTTTTCT 


TCTTTTTACT 


GATATTTTAC 


5340 


TAAAATTTTA 


GTAAAAAGGA 


TTGACCTTGG 


AAAATTCCTT 


GGATATAATA 


GAAAGAAAAC 


5400 


GATTACACGT 


TAAGATGGCT 


TAACGGACAG 


TCAAAGGAGA 


ATTCATATGG 


CACAACATCT 


5460 
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TACTACTGAA 


GCCCTTCGCA 


AAGACTTTCT 


TGCTGTTTTT 


GGTCAAGAAG 


CAGATCAAAC 


5520 


CTTCTTTTCA 


CCAGGCCGCA 


TTAATTTGAT 


TGGTGAACAC 


ACAGACTACA 


ACGGTGGGCA 


5580 


CGTTTTTCCT 


GCTGCTATTT 


CCTTGGGAAC 


TTACGGTGCA 


GCTCGTAAGC 


GTGACGACCA 


5640 


AGTCTTGCGT 


TTCTACTCAG 


CTAACTTTGA 


GGACAAGGGC 


ATTATCGAAG 


TGCCTCTCGC 


5700 


TGACCTCAAG 


TTTGAAAAAG 


AGCACAACTG 


GACCAATTAT CCAAAAGGTG 


TCCTTCATTT 


5760 


CTTGCAAGAA 


GCTGGGCACG 


TGATTGACAA 


AGGTTTTGAT 


TTTTATGTTT 


ATGGAAATAT 


5820 


TCCAAATGGT 


GCTGGCTTGT 


CTTCTTCTGC 


ATCCTTGGAA 


CTCTTGACAG 


GAGTCGTGGC 


5880 


TGAGCATCTC 


TTTGATTTAA 


AATTAGAGCG 


TCTCGATTTG 


GTTAAAATCG 


GCAAACAAAC 


5940 


AGAAAACAAC 


TTTATCGGAG 


TAAACTCTGG 


CATTATGGAC 


CAGTTTGCTA 


TTGGTATGGG 


€000 


GGCAGACCAA 


CGTGCTATTT 


ACCTAGATAC 


TAATACTTTA 


GAATACGACT 


TGGTGCCACT 


6060 


TGATTTGAAG 


GACAATGTCG 


TTGTTATCAT 


GAACACCAAC 


AAACGCCGTG 


AATTGGGGGA 


6120 


CTCTAAATAC 


AATGAACGTC 


GTGCTGAGTG 


TGAAAAAGCA 


GTGGAAGAAT 


TGCAAGTTTC 


6180 


CTTGGATATT 


CAGACTCTGG 


GTGAATTGGA 


CGAGTGGGCC 


GTTGACCAAT 


ATAGCTATCT 


6240 


GATTAAAGAT 


GAAAATCGTT 


TGAAACGTGC 


TCGCCATGCT 


GTGCTTGAAA 


ACCAACGTAC 


6300 


CCTCAAAGCT 


CAAGTAGCAC 


TCCAAGCAGG 


AGATTTGGAA 


ACATTTGGAC 


GCTTGATGAA 


6360 


TGCGTCACAC 


GTTTCTCTGG 


AGCATGATTA 


TGAAGTAACT 


GGTTTGGAAT 


TGGATACCCT 


6420 


TGTTCACACA 


GCTTGGGCAC 


AAGAAGGAGT 


TCTCGGTGCT 


CGTATGACAG 


GGGCTGGTTT 


6480 


TGGTGGCTGT 


GCcATTGCCT 


TGGTTCAAAA 


AGATACTGTT 


GAGGCCTTTA 


AGGAAGCTGT 


6540 


AGGCAAACAC 


TACGAGGAAG 


TAGTTGGATA 


CGCTCCAAGC 


TTCTATATCG 


CTGAAGTTGC 


6600 


AGGTGGCACT 


CGCGTCCTTG 


ACTAGTCAAA 


AGGAGGCTCT 


ATAGTGACCT 


TAGTAAATAA 


6660 


ATTTGTAACA 


CATGTCATTT 


CTGAAAGCTC 


ATTTGAGGAA 


ATGGATCGAA 


TCTATCTGAC 


6720 


CAATCGTGTT 


TTGGCACGAG 


TGGGAGAAGG 


TGTTTTGGAA 


GTTGAGACCA 


ATCTGGATAA 


6780 


ATTGATTGAC 


CTCAAGGACC 


AGCTGGTTGA 


AGAAGCCGTT 


CGATTAGAGA 


CGATTGAGGA 


6840 


TAGTCAGACT 


GCGCGTGAAA 


TCCTTGGTGC 


TGAACTGATG 


GATTTGGTGA 


CTCCTTGTCC 


6900 


AAGTCAGGTC 


AATCGTGATT 


TTTGGGCAAC 


CTACGCCCAC 


TCTCCAGAAC 


AAGCGATAGA 


. 6960 


GGATTTTTAC 


CAACTCAGTC 


AGAAAAATGA 


CTACATCAAA 


CTCAAGGCCA 


TTGCTAGAAA 


7020 


TATCGCTTAT 


CGTGTTCCAT 


CTGACTACGG 


AGAACTTGAA 


ATTACCATCA 


ATCTCTCTAA 


7080 


GCCTGAAAAA 


GATCCCAAAG 


AGATTGTGGC 


AGCCAAGTTG 


GTGCAAGCTA 


GTAATTATCC 


7140 


TCAGTGTCAG 


CTTTGTCTAG 


AGAATGAGGG 


CTACCATGGT 


CGAGTTAACC 


ACCCAGCTCG 


7200 
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TAGCAATCAC CGTATTATCC GTTTTGAAAT GGTTGGTCAG GAATGGGGTT TCCAGTATTC 7260 

GCCCTATGCT TACTTTAATG AGCATTGTAT CTTTTTAGAT GGCCAGCATC GTCCCATGGC 7320 

CATTAGTCGT CAGAGTTTTG AACGTCTGTT GGCTATCGTA GACCAGTTTC CAGGATATTT 7380 

TGCTGGATCT AATGCCGACC TGCCGATTGT GGGGGGCTCT ATTCTAACTC ATGATCATTA 7440 

TCAGGGAGGC CGTCACGTAT TTCCTATGGA ATTGGCTCCC TTGCAAAAGG CCTTCCGATT 7500 

TGCTGGTTTT GAGCAGGTCA AGGCTGGAAT TGTCAAGTGG CCCATGTCTG TCCTACGTTT 7560 

GACTTCGGAT TCCAAAGAGG ATTTGATCAA TTTGGCTGAT AAGATTTTGC AGGAATGGCG 7620 

CCAGTATTCA GATCCTGCAG TGCAGATTTT GGCAGAGACA GACAGGACAC CGCATCACAC 7680 

TATCACACCC ATTGCCCGCA AACGCGATGG ACAGTTTGAG TTGGACTTGG TCTTGCGAGA 7740 

CAATCAGACT TCAGCAGAGT ATCCTGATGG TATCTATCAT CCCCACAAGG ATGTCCAACA. 7800 

TATCAAGAAG GAAAATATCG GCTTGATTGA GGTCATGGGC TTGGCAATCT TGCCACCACG 7860 

TCTGAAAGAA GAAGTGGAGC AAGTCGCTAG CTATCTTGTA GGAGAAGCTG TTACAGTTGC 7920 

CGATTATCAT CAGGAGTGGG CAGACCAACT CAAATCCCAA CATCCAGACT AACGGATAAA 7980 

GAAAAAGCCC TTGCAATCGT CAAGGACTCT GTGGGTGCTA TCTTTGCGCG TGTACTTGAG 8040 

GATGCAGGAG TCTACAAGCA GACAGAACAA GGGCAGACAG CCTTTATGCG CTTTGTGGAA 8100 

CAGGTCGGAA TTTTACTAGA CTAGGAGCTT TCTCGG 8136 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10011 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

CCCATAGTGA AGAGTGGCCA TAAGAAGGTC TTCTAGGCTT AATTTAGGTT TTCGTCCACC 60 

TTTTGCGTGT TTAAGTTGAT AAGCTGTTTT TAACACAGCT GAACATCTCT TCAAAAGTCG 120 

TGCGCTGAAC ACCAACAAGA CATTTAAATC GTGTATCAGT TAGTTGTTTA CTTGCTTCAT 180 

CATTCATAGA ACTACTATAC CATGTTTTGT TTCGCAGGAA GTCTAATATT GTCAAATACT 240 

GGAACGCTCA TTGCTGGGAT ACGGAATAAG ATTGGCCCAG CTTCGATAAC TGGGATACCT 300 

GGTTCAAAAC CAAGGTCTGT TGCAGCGATT GGTGTAAAGA TATCGTAACC TTTCATAAGG 360 

TCTTCGTTTA CATCTTTCAC CATAACTGCA TCACAGTGAA CATCGTAACC ACGGTTTGAA 420 

AGTTCTTCTT CTAGAGCACT TTTAATTTGG TGACTTGAGT TAACACCTGC ACCGCAGGCA 480 
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GCAAGAATTT TAATCATTTG GATTTCCTCC GATTTTATTT TTTAATAGAC AAGATTAAGC 540 

GGTTGCTTCA GCAATGTAAG CATAAAGGGC TTCTGGTTCA GAAATTTTTG ATAGGTCTTC 600 

AAGATGACCA TTTCCTGTGA AGAAGTCCAT TAACTGAGCA AGAATGTTCG TTTGACTTGA 660 

ACTTGAATTA TTGATGATAA AGAAGAGCAA GGATACTTCT ACTTCCTTAC CTGGCGCAAT 720 

CATATTATGG AAAGTCACCG GTTTCTCTAA TCGAACAACC ACCACTTTCT CAGCTAGATT 780 

ATGAACAATA TCTGTGTGAG GAATCATTAC ATTTGCAAGT CCTTTCCTAG AAATTCCATA 840 

TATAAACCAG TTGGAAATGA CTTTTCACGC GTGATCAAGG CTTCACGATA AGTTGGAGTG 900 

ACAATTTCTC GTTCTTCCAA CAAGCTTGCT ACCTGATCAA AAAGTTATTC TTGATTATCC 960 

GCTTCTAAGC AAAACACAAG GTTTTTGTCA AAGAAATAAT CTAATACCAT AAGGTTTTCC 1020 

CTTCTTTCCA TTAACTTTAT GCTATAAGTA TAACACTATA TGAAATCGTT GTTAATTACT 1080 

TTCTATTCTT TTTTGTCTCT TTTTTTATAT TTTTGTTTTG TTTATAGTTT GTTATATAAA 1140 

AATAAACACA CAAACAAATA CTCCAAGCAT TTTTCTGTTC TAATACTCAA TGAAAATCAA 1200 

AGAGCAAACT AGGAAGCTAG COGCAGTTGT TCAAAACACA GTTTTGAGGT TGTAGATGAA 1260 

ACTGACGAAG TCACTCAAAA CATGGTTTTG AGGTTGTAGA TGAAACTGAC GAAGCAACAg 1320 

CCATACATAC GGTAAGGCGA CGCTGACGTG GTTTGAAGAG ATTTTCGAAG AGTATAAAAA 1380 

CTAAAAAAGC AGACCATCTA AGCCTGCTTT ACTATTGATT GTTATATAAA TTTCCTGTGA 1440 

ACAAGGAAAG GCATTTCTGA TAACTTATTC TTCATCCATA CTCAAGACGC TGAGGAAGGC 1500 

TTCTTGCGGA ACTTCAACTG ATCCGATGGA TTTCATGCGT TTCTTACCAG CTTTTTGTTT 1560 

TTCAAGGAGT TTACGCTTAC GAGAAACGTC ACCACCATAA CATTTAGCAA GTACGTTCTT 1620 

ACGAAGGGCC TTGATATCAG TACGAGCGAC AATCTTGTGT CCAATAGCCG CTTGGATTGG 1680 

AACTTCAAAT TGTTGGCGAG GGATGATTTT CTTGAGTTTA TCAACGATGA GTTTCCCACG 1740 

TTCGTAGGCA AAGTCCTTGT GAACGATAAA GCTGAGGGCA TCCACCTTAT CTCCATTGAG 1800 

AAGAATATCC ATTTTCACCA GCTTAGATGG GCGATATTCT GACAATTCGT AGTCAAAGCT 1860 

TGCATAACCA CGTGTCGAAG ACTTAAGTTT ATCAAAGAAG TCAAAGACAA TTTCAGCAAG 1920 

AGGAATTTGA TAGATAACAT TGACACGGTT ATCATCAATA TAGTCCATAG TCACAAAGTC 1980 

CCCACGCTTA CGCTGAGCTA GCTCCATTAC TGCTCCGACG AACTCCTGTG GTACCATGAT 2040 

TTGCGCCTTG ACATAAGGCT CTTCAATGGT CGCAATCTTA GTTGGGTCTG GAAACTCAGA 2100 

TGGGTTAGAC ACATCCATAG ACTCACCGTC GGTCAAATTA ACTTTGTAAA TAACAGACGG 2160 

AGCTGTCATG ATGAGGTCAA TATTGAACTC ACGCTCTAAA CGTTCCTGGA TAACATCCAT 2220 
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ATGGAGAAGT CCAAGAAATC CACAACGGAA ACCAAATCCA AGTGCCTGAG ATGTTTCTGG 2280 

TTCAAACTGA AGACTAGCAT CATTCAGTTG CAATTTTTCA AGcGCTTCAC GCAGGTCATT 2340 

GTACTTGTTT GATTCGATTG GGTAGAGACC CGCAAAGACC ATAGGATTCA TCTGCTTATA 2400 

ACCATGTAAT GGTTCTGCCG CAGGATTGGT TGCCAAGGTA ACGGTATCAC CCACACGAGT 2460 

ATCCTGAACC GTCTTGATAG ACGCCGCAAT GTAACCAACA TCACCAGTCG CAAGGAAATC 2520 

ACGACCAACC GCTTTTGGTG TAAAAATACC GACTTCGGCC ACATCAAAGG TCTTACTATT 2580 

GCTCATGAGC TGAATCTTAT CACCAGGTTT GACCACTCCG TCCATGACAC GCACTTGGAG 2640 

GATAACCCCA CGGTAAGCAT CGTAAACAGA GTCGAAAATC AAGGCCTTAA GTGGCGCCGT 2700 

CACATCACCC GTTGGTGCTG GTACTTTTTC TACAATTTGC TCGAGGATTT CTTCAATCCC 2760 

AATACCAGCC TTGGCAGAAG CCAAAACTGC TTCACTGGCA TCCAAACCAA TCACATCTTC 2820 

AATCTCTGTA CGCACGCGCT CCGGATCTGC AGCCGGCAGG TCAATTTTAT TAATGATAGG 2880 

CATGATTTCC AAATCATTAT CCAAAGCCAG ATAAACGTTG GCAAGAGTTT GAGCCTCAAT 2940 

TCCTTGAGCC GCATCGACCA CCAAAATAGC ACCCTCACAG GCAGCTAGCG AACGTGAAAC 3000 

TTCATAGGTA AAGTCAACGT GCCCTGGTGT GTCAATCAAG TGGAAAATAT AAGTTTCCCC 3060 

ATCTTTTGCA GTGTAATTCA ACTCGATGGC ATTCAACTTA ATAGTAATTC CACGTTCCCG 3120 

CTCTAGCTCC ATGCTATCCA AAAGCTGGGC CTGCATTTCA CGACTTGAAA CCGTCTCTGT 3180 

TTTTTCCAAA ATGCGGTCTG CTAGAGTTGA TTTTCCGTGG TCAATATGGG CGATAATAGA 3240 

GAAGTTACGG ATCTTCTCCT GTCGTTTTTT CAATTCTTCT AAGTTCATGA TTCTCTTCCT 3300 

TTCAGGGTAT CTATTTATTA TAAATTGTTT TTGATATTTT GACAAGACCA TACCCTGCTA 3360 

GGAGTACTAA TCTTCAGCGA CAAAGCCGTC ATTTTCGATA AAGTGGTGTT CTGTCATTCC 3420 

TTGGTCTGTA AAGACAATCC CGTGAAGGAC ACCACCATAA ACAGCTCCTC CATCCATTCC 3480 

AATCTTGCCA TCTTCTGTAG TCCAAAGCTC AGATGTACCG CGTTCTTGCT GTAACAAACC 3540 

ATAGACCGGT GTATGACCGA AGACAATGGT TTTTCCAGTA TGATTTTCAG CTCCGTGGAA 3600 

TGGTTTTCTA AGCCATACTT TTTTATAATC TGTTGTTTCA TGCCAGTCGT CCAAGGTCAA 3660 

ATCAATACCT GCGTGAACAA AGATATACTT GTCTGTCTCT ACTACAAATG GCATTTGACG 3720 

AATGAATTCG ACCAAGTCTG CCGCTTCAGC GgCAACCCGC TTGGCATCTT CTACTCCATC 3780 

AACTGGTGCA TCCAAGGGAC GACCTAGGAT AGAGTTAATG GTTGTATCTC CACCATTGCG 3840 

ACTATAATGG TCATAACTTT CTTCTGGGTC ATCTAGCCAA GTCAAAAACA TATACTCGTG 3900 

GTTTCCGGAC AAACAGATAG CCCCTTGATT GTCCACCAAG TCCTTGACCA TTTCAAGAAC 3960 

ACGGTGACTA TCCTCACCTC TGTCAATCAA ATCACCTAGA AAGAGCAACT GGGGCTGACC 4020 
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ATCCCAGGTT 


TTGAGAAGGT 


CTTCCAGCAT 


CCCAGCTTTT 


CCGTGAACAT 


CTCCAATTAC 


4060 


ATAATAATCT 


GTCATCTTAT 


TTCTCCCTGT 


TTCTCAACAA 


TTCTCTTGCT 


TGCGTCAGGG 


4140 


CTGCTTCTGT 


CACATCATCA 


CCTGCCAACA 


TCTTGGCAAC 


TTCCTCCACT 


CGCTCTTCGA 


4200 


CCGTCAAGAG 


ACGAACAGTC 


GAAACCGTTG 


AATGGTCATT 


ACTAATCTTC 


TCAATAAAGA 


4260 


ATTGATAATC 


TGCAATCGCA 


ATTACTTGTG 


GCAAATGGGA 


GATAGCCAAA 


ACCTGACCAT 


4320 


GCTGACCAAT 


TTTATGAATT 


TTCTGAGCAA 


TAGCTTGAGC 


AACACGACCT 


GAAACTCCCG 


4380 


TATCCACCTC 


ATCAAAGACA 


ATGCTAGTCT 


TGCCTTCTTT 


ACGTGAAAAG 


GCAGACTTAA 


4440 


TGGCTAACAT 


GAGACGAGAT 


AATTCCCCTC 


CAGAAGCAAC 


CTTAACCAAG 


GGTTTAAAGT 


4500 


CTTCTCCAGG 'GTTGGTTGAA 


ATATAAAACT 


CAACCATTTT 


ATTTCCCTCA 


CGACTGAATT 


4560 


TTCCCTTACT 


AAAACGAACC 


TGAAACTGGG 


CTTTTTCCAT 


ATAAAGATCT 


TGCAGTTCTT 


4620 


GTTTAATCTC 


AGCTTCGAGT 


TGCTGAGCCA 


AATTATGACG 


AGCAGAAGCA 


AGTTGACCTG 


4660 


CCAAATTGAC 


AAGATTGACT 


TCCAACTTCT 


TAAGCTCTGC 


TTCCATGTCC 


TCAGACGAAA 


4740 


GATTATTGCC 


TGTCAAGAGA 


TTGTATTCTT 


CCGTAATCTT 


GGCAAAATAA 


AGCAAAACAT 


4800 


CATCAACAGT 


CCCACCATAC 


TTACGAGTAA 


TAGTATGAAG 


GAGGTCCAAA 


CGATTCTCAA 


4860 


CCTGCATCAG 


GCGATTGCCA 


TCAAAATCAA 


GGTCCTCAAT 


GATAGCTTCC 


AAACGTTTGC 


4920 


TAATGTCTTC 


TAAAACATAG 


TAGGTCTCAG 


ACAGATAGCT 


TGAAATTTCA 


CGGTATTCAG 


4980 


GATCATACTC 


TTCGACACTT 


TCCATGTCAT 


TCATAGCTGA 


ACGAACATTG 


GCCAGACTTG 


5040 


AAAAATCTTC 


ATTGTCCAAC 


ATACTGTAGG 


CATTGGTCAG 


TGTATCCGCA 


ATATTTTTGT 


5100 


GGTTGAGGAG 


TTTATCTCGC 


TCTTGATTGA 


GAGCCAAGTC 


TTCTCCAGCC 


TGCAAGTTTG 


5160 


CTGCCTCAAT 


CTCTGCCATT 


TGAAATTCCA 


ACATTTCGAT 


ACGTGCCTTG 


TGTTCCTGTT 


5220 


GGTTTTTCTT 


GACTTCCAGA 


ACCTGCTTGC 


GCATTTTCCG 


ATAGGCATCA 


AAACTCGTTT 


5280 


GATAGGTTTC 


TTTCAAGTCC 


CAAAAAGCGG 


CATCACCAAA 


TTCATCCAAC 


ATCTGGATAT 


5340 


GCAGTTGGGG ACGCATTAAC 


TCCTCATGGT 


CATGCTGACC 


ATGAATATCT 


ACAAGATGTT 


5400 


GCCCAATAGC 


TCGCAAAACA 


GACAGATTAA 


CCATCTGACC 


ATTTACACGG 


CTGATACTAC 


5460 


GACCATTTTG 


CAAGATTTCC 


CGACGGATGA 


TAATTTCATC 


ACCTAATTCT 


AAACCTTGCT 


5520 


CATCAAAAAT 


TTCCTGTAAA 


AGACGACTAT 


TCTCAACTGA 


GAAAAGCCCC 


TCAATCTCTG 


5580 


CCTTTGGTGC 


ACCATGACGA 


ATAACATCTG 


TCGTCGCACG 


AGCTCCCAAC 


ATCATATTCA 


5640 


TGGCATCAAT 


GATAATCGAC 


TTCCCTGCAC 


CCGTTTCACC 


AGTCAGGACA 


GTCATOCCCT 


S700 


TTTCAAAATT 


GAGGGAAATA 


GCCTCAATAA 


TGGCAAAGTT 


TTTTATCGAA 


ATTTCAAGTA 


5760 
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ACATATAGAC CTACCAATTT 


TTTACTTGTT 


CAAAGATTTC 


CTCTGCTAGA 


CTTCCACTTC 


5820 


TGGCAATGAC 


TAAAATCGAG 


CTATCATCAG 


TCAAACAGCT 


AAAAATCTTG 


TCTGCAAAAG 


5880 


TCTCGATTAA 


CTGAGCTTTT 


ACAAAAGCCG 


TATTTCCTGG 


AATAACTTGG 


AGATTGATCA 


5940 


TCTTATCCAT 


CAATTCAGCC 


GATTCGATAT 


TGTCTTCAGC 


CAGTTGCAGA 


CTTTTTACGA 


6000 


TTGATTTTGG 


CAATTCGTAG 


ACATAGGTGT 


TGTCTCTCAA 


AGGAATTTTG 


ACAATACCTA 


6060 


ACTCTTTGAT ATCTCGGGAT 


ACCGTCGCCT 


GAGTGGCAGT 


GATACCTGCT 


TCTTTCAAAT 


6120 


GTTCTACAAT 


TTCTTCTTGC 


GTGCCGATTT 


GATAATCTGT 


CACCAATCTT 


CTAATTTTTT 


6180 


CAAGTCTCTC TTTTTTATTC ATTTTTAAAT 


TGACTATGCG 


CCCTCTCTAC 


TGCTTCTTTA 


6240 


ATCTCAGCAA GAATCTGATT 


GCTTGCTGAC 


TTTTCTTTTT 


TCAAATACGC 


TAAAAATTCA 


6300 


ATATTTCCAT 


GTCCACCTTG 


GATGGGAGAA 


AAGTCCAAGC 


CAAGGACTGA 


AAAACCTACC 


6360 


TCTACTGCCA 


TAGCTGTTAC 


AGATTCAAGG 


ACATTCTGAT 


GAACCTTAGC 


ATCTCGAATA 


6420 


ATTCCATTTT 


TCCCAATCTG 


CTCACGTCCT 


GCCTCAAACT 


GAGGTTTGAC 


AAGTGCTACC 


6480 


ACCTGACCTT 


GATCAGCCAA 


GACACGGTGC 


AAGGCTGGCA 


AAATCAGACT 


AAGGGAAATG 


6540 


AAACTCACAT 


CAATACTGGC 


AAAGCTCGGC 


TCCTGCTCGA 


AATCAGTCTT 


TTCAGCATAG 


6600 


CGGAAATTGA 


ACTGCTCCAT 


GCTGACAACT 


CGTGGGTCTT 


GGCGTAATTT 


CCAAGCCAAC 


6660 


TGATTGGTAC 


CAACATCGAC 


TGCAAAGACC 


AACTTGGCAC 


TATTCTGTAG 


CATGACATCG 


6720 


GTAAAACCTC 


CAGTAGAGGC 


CCCGATATCA 


ATCGTAGTCG 


CGCCATCCAC 


CGACAAATCA 


6780 


AAGACCTGCA 


AGGCCTTTTC 


CAGTTTCAAA 


CCACCACGGC 


TGACATACTT 


GAGTTTCTCC 


6840 


CCCTTGAGTT 


TTAATTCGGT 


GTCATCTGGA 


ATTTTCTCTC 


CTGGCTTGTC 


AAACCGTTCT 


6900 


CCATTAAGGA 


CTGCTACGAC 


TAGGCCAGCC 


ATCACACCTC 


GCTTGGCCTG 


CTCTCTCGTT 


6960 


TCAAACAACC 


CCTGTTTATA 


AGCTAGTACA 


TCCACTCTTT 


CCTTAGCCAT 


TGATTCTCAA 


7020 


ACTTTCTACT 


ACACTTACAA 


TCGATTCTGT 


TTCAAAGGGA 


AGCTGCTGGG 


CAATTTCT TC 


7080 


TAATTTTTCA 


TTAGCTTGAT 


CCAGGGTTTG 


GTTACAAAAG 


GCAATGGACT 


CTTCCAAGCC 


7140 


CAACAGGGCA 


GGATAGGTTG 


ATTTTTCTGC 


CTGCAGATCC 


TTTTGAGGTG 


TCTTGCCGAT 


7200 


TTCCTCAAAA 


CTAGCTGTCA 


CATCCAGTAC 


ATCATCTCTG 


ACTTGAAAAG 


CAAGTCCAAT 


7260 


CAATTCACCC 


ACAGTTTTCA 


GCTTCACCTG 


CATTTCAGGT 


GACAATTCAG 


CTATAATAGC 


7320 


TGCCGCTTGG 


AAGGGATAGG 


CTAGTAACTT 


CCCAGTCTTA 


TTGGCATGAA 


TAGTCTGAAG 


7380 


TTCTTCCAAA 


GACAAGTGCT 


GGTGTTCGCC 


CTCCATATCC 


AAAACTTGCC 


CTGCTACCAT 


7440 


ACCCAGACTA 


CCTGAAGCAA 


GGGATAAGTT 


GGCAATCAAG 


TCCACCTTAA 


TCTGACTTGG 


7500 


CAAATCTGCC 


TGCGCAATCA 


AGGCATATGA 


GTCTAAGAAT 


AAGGCATCTC 


CAGCCAAAAT 


7560 
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GGCCATAGCT TCACCGAATT TCTTGTGATT GGTTAACCGC CCTCTTCGAT AATCGTCATC 7620 

ATCCATAGCA GGAAGGTCAT CGTGAATCAA GCTCCCTGTA TGAATCATCT CTAAGGCAGT 7680 

AGCTACCTGC GCGTGAGCAG GTTTGATGGT AACCTGCAAG GCTTCCAGAA CTTCTAACAA 7740 

GAGAAAAGGC CGAATACGCT TGCCACCAGC ATGAATAGAA TAGAGAACAG ACTCCCGTAA 7800 

ACTAGAGGCA AACTGCTGGT CTCCATAAAA ATCTTCCAAA GCCGACTCGA CAAGAGCTAA 7860 

TTTTTCTTGC TTTTTCATTC AAAATCACTT TCTGTTCCGT CTTCTTGCAT GACCTTGACC 7920 

AAGGTCTTTT CAGCCTTGTC CAGCGTAGCT TGGAGCTCTT TTGACAAGAC CATGCCCTTT 7980 

TGAAAGGCAG TAATCGCATC TTCCAGAGCA ATTTCACCAT TTTCCAAACT TTGGACAATG 8040 

GTTTCCAGTT CTGCTAGATT TTCCTCAAAT TTCTTTTGTT TTGACATCTT TAACCTCTAA 8100 

TTCTACTTGA CCATCTCGCA TCAAAAGCGT TACTTGGTCT TTTTTCTTCA AACTCTCAAC 8160 

CGAATCTACA ACGGACTCTT CTTTTTTGAC AATAGCATAA CCACGCGCCA CGATTCGGCT 8220 

AGTATCCAAC ATGAGCAAAG CTTCCGAAAG TCGCTTGGCC TCAGCAACCT TGGCGTCATA 8280 

AACTAACGCC ATTTGGCTAC CTAAGAGCTT GTCCAACTGT CCTAAACGGT CTTGATAGCG 8340 

TTGGATTTTG GTAACAGGTG ATAATTGTAC TAATTGATGA GTTCTTGCTT GAACTAATTG 8400 

TTTGTTATCA GAAATCCGAG TTCGCAAACT TTGTTTCAAA CGCAGTTGCA GTTGGTCCAA 8460 

GCGTTGCAAA TAACCGTCAT ACAAGCGCTC AGGTTGTCTA AAGATAACAG ACTGACTGCA 8520 

TTTTTTCAAA GCCTCTTGTT TCTTAGATAG AACATTTCGG ACTGCCGTTA CCATCCGTTT 8580 

TTCCTGATTT TGCAAATGAG CTAATACATC CAACTTGGTC ACAGGTGTTG CCAGTTCAGC 8640 

CGCCGCTGTT GGCGTTGCAG CGCGTCGATC TGCCACAAAA TCTGCCAAGG TCACATCCGT 8700 

CTCATGCCCC ACACTAGAGA TAACTGGCAA ACGAGATTCA AAAATAGCTC GTACCACAAT 8760 

TTCTTCGTTA AAGGCCCAGA GATCCTCAAT AGAACCACCT CCACGACCAA TAATGAGCAA 8820 

ATCCAAATCG TCCCGTTGAT TAGCACGCGC AATATTTCTA GCAATTTCCT CCGCAGCCCC 8880 

TTCACCTTGA ACCTTGGTCG GATAAAGAAG GATGTCAACA CCTGGGAATC GCCTGCTGAC 8940 

GGTCGTGATA ATATCTCGAA TAACGGCTCC ACTACGGCTG GTTACTACAC CAATTCTCTT 9000 

AGAAAATTGG GGCAGAGCTT GCTTGAAGCG TTCTTGAAAC AGGCCTTCTT CTGTCAATTT 9060 

TTTCTTAAGT TGTTCAAACT GAATCGCAAG CGCCCCAACC CCATCAGGCT CAGCTTTTTC 9120 

AATGATGATG GAGTAGCTAC CACTTGGTTC ATAGACCTGT ACACGCCCAA TCACATTGAT 9180 

CTTCATTCCT TCTTCCAGGT CAAACCCTAA TTTCTGATAA ATCCCAGACC AGATGGTCGC 9240 

TTGAATAACT GCATGGTCAT CCTTTAGGGA GAAATATTGG TGAGTAGGTC GTTTACGAAA 9300 
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GTTGGAAACT 


TGACCAGTTA AATAGACCCG 


628 
TTCCAAGTAT 


GGGTCTTTAT 


CGAATTTCAT 


9360 


TTTCAGATAC 


TTGGTCAAAG 


TTGTTACCGA 


TAAATACTTT 


TCCATCTCCA 


CCTACTATTC 


9420 


ATTTACTTGC 


TCTTTCATGG 


GTATTATTAT 


ACCAAAAATA 


TGCCTAAAAA 


TCTCCATTTA 


9480 


TGTACCATTA 


TGAGGGAAAA 


ATAGAAAAAG 


GAGGCAAGGC 


CTCCACATGT 


GATTATTTGC 


9540 


TGTTTCGAGC 


TTCTTCCAAA 


ATCTTTGCAA 


TCTTGGTCGT 


CAACAGGTCG 


ATAGCCACGG 


9600 


TATTGCTAAC 


CCCTTCAGGA 


ATGACGATAT 


CAGCATAACG 


CTTAGTTGAC 


TCGATAAACT 


9660 


GGTGGTACAT 


TGGTTTGACC 


ACACCTAAGT 


ACTGGTTAAT 


AACGCTATCA 


AGGCTACGGC 


9720 


CACGCTCCTC 


CATATCACGC 


TTGATACGAC 


GAATAATGCG 


CACATCGTCA 


TCCGTATCCA 


9780 


CAAAAATCTT 


GATATCCATC 


AAATCGCGCA 


GACGCTTGTC 


CTCCAAGACC 


AAAATACCCT 


9840 


CAACGATAAA 


GACATCTTGA GGTTCCTGAC GATAGGTCTT GCTACTCCGT 


GTATGCTCTG 


9900 


TATAGTCGTA 


GGTCGGGATG 


TCCACCGGAC 


GCCCTGCCAA 


CAATTCCTTA 


ATCTGCTCGA 


9960 


TCATCAAGTC 


TGTATCAAAG 


GCAAAAGGAT 


GGTCATAGTT 


GGTTTTGACG 


G 


10011 


(2) INFORMATION FOR SEQ ID NO: 77: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5365 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY : linear 








(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 






CGTGTGGTCT 


TAAAAATAGA 


AGACAAAGAA 


CAAACTGTTG 


GAGGCTTTGT 


CCTTGCAGGC 


60 


TCAGCCCAAG 


AAAAAACCAA 


AACAGCTCAA GTTGTGGCTA 


CTGGACAAGG 


TGTTCGTACC 


120 


TTGAACGGTG 


ACTTGGTTGC 


TCCAAGTGTT 


AAAACTGGAG 


ATCGTGTCTT 


AGTTGAAGCC 


180 


CACGCAGGTC 


TTGATGTCAA 


AGATGGCGAT 


GAAAAGTACA 


TCATCGTAGG 


CGAcTAACAT 


240 


TTTGGCAATC 


ATTGAGGAAT 


AGAAGGAGAA 


AGTAAGTATG 


TCAAAAGAAA 


TTAAATTTTC 


300 


ATCAGATGCC 


CGTTCAGCCA 


TGGTTCGTGG 


TGTCGATATC 


CTTGCAGACA 


CTGTTAAAGT 


. 360 


AACCTTGGGA 


CCAAAAGGTC 


GCAATGTCGT 


TCTTGAAAAG 


TCATTCGGTT 


CACCCTTGAT 


420 


TACCAATGAC 


GGTGTGACCA 


TTGCCAAAGA 


AATCGAATTG 


GAAGACCATT 


TTGAAAATAT 


480 


GGGTGCTAAG 


TTAGTATCAG 


AAGTAGCTTC 


TAAAACCAAT 


GATATCGCAG 


GTGACGGAAC 


540 


TACGACTGCA 


ACAGTCTTGA 


CCCAAGCTAT 


CGTCCGTGAA 


GGAATCAAAA 


ACGTCACAGC 


600 


AGGTGCAAAT 


CCAATCGGTA 


TTCGTCGTGG 


GATTGAAACA 


GCAGTTGCCG 


CAGCAGTTGA 


660 


AGCTTTGAAA AACAACGCCA TCCCTGTTGC 


CAATAAAGAA 


GCTATCGCTC 


AAGTTGCAGC 


720 
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CGTATCTTCT CGTTCTGAAA AAGTTGGTGA GTACATCTCT GAAGCAATGG AAAAAGTTGG 780 

CAAAGACGGT GTCATCACCA TCGAAGAGTC ACGTGGTATG GAAACAGAGC TTGAAGTCGT 840 

AGAAGGAATG CAGTTTGACC GTGGTTACCT TTCACAGTAC ATGGTGACAG ATAGCGAAAA 900 

AATGGTGGCT GACCTTGAAA ATCCGTACAT TTTGATTACA GACAAGAAAA TTTCCAATAT 960 

CCAAGAAATC TTGCCACTTT TGGAAAGCAT TCTCCAAAGC AATCGTCCAC TCTTGATTAT 1020 

TGCGGATGAT GTGGATGGCG AGGCTCTTCC AACTCTTGTT TTGAACAAGA TTCGTGGAAC 1080 

CTTCAACGTA GTAGCAGTCA AGGCACCTGG TTTTGGTGAC CGTCGCAAAG CCATGCTTGA 1140 

AGATATCGCC ATCTTAACAG GCGGAACAGT TATCACAGAA GACCTTGGTC TTGAGTTGAA 1200 

AGATGCGACA ATTGAAGCTC TTGGTCAAGC AGCGAGAGTG ACCGTGGACA AAGATAGCAC 1260 

GGTTATTGTA GAAGGTGCAG GAAATCCTGA AGCGATTTCT CACCGTGTTG CGGTTATCAA 1320 

GTCTCAAATC GAAACTACAA CTTCTGAATT TGACCGTGAA AAATTGCAAG AACGCTTGGC 1380 

CAAATTGTCA GGTGGTGTAG CGGTTATTAA GGTTGGAGCC GCAACTGAAA CTGAGTTGAA 1440 

AGAAATGAAA CTCCGCATTG AAGATGCCCT CAACGCTACT CGTGCAGCTG TTGAAGAAGG 1500 

TATTGTTGCA GGTGGTGGAA CAGCTCTTGC CAATGTGATT CCAGCTGTTG CTACCTTGGA 1560 

ATTGACAGGA GATGAAGCAA CAGGACGTAA TATTGTTCTC CGTGCTTTGG AAGAACCCGT 1620 

TCGTCAAATT GCTCACAATG CAGGATTTGA AGGATCTATC GTTATCGATC GTTTGAAAAA 1680 

TGCTGAGCTT GGTATAGGAT TTAACGCAGC AACTGGCGAG TGGGTTAACA TGATTGATCA 1740 

AGGTATCATT GATCCAGTTA AAGTGAGTCG TTCAGCCCTA CAAAATGCAG CATCTGTAGC 1800 

CAGCTTGATT TTGACAACAG AAGCAGTCGT AGCCAATAAA CCAGAACCAG TAGCCCCAGC 1860 

TCCAGCAATG GATCCAAGCA TGATGGGCGG GATGATGTAA GCTTTCTATA GAAAACAACT 1920 

TATAAAAAAC ACAAAAGGAG GGAATGACTA ACCCTTCTTT TTATAGGCTC TTTGTCAACT 1980 

GTAGTGGGTT GAAGTCAGCT AAGCTCGAGA AAGGACAAAT TTCGTCCTTT CTTTTTTGAT 2040 

GTTCAAAGCG ATAAAAATCC GTTTTTTGAA GTTTTCAAAG TTTCGAAAAC CAAAGGCATT 2100 

GCGCTTGATA AGTTTGATGA GATTATTGGT CGCTTCCGGT TTGGCGTTAG AATAGTGTAG 2160 

TTGAAGGGCG TTGATAATCT TTTCTTTATC TTTGAGGAAG GTTTTAAAGA CAGTCTGAAA 2220 

AATAGGATGA ACTTGCTTAA GATTGTCCTC AATAAGTCCG AAAAATTTCT CCGGTTCCTT 2280 

ATTCTGAAAG TGAAACAGCA AGAGTTGATA GAGCTGATAG TGATGTTTCA AGTCTTGTGA 2340 

ATAGCTCAAA AGCTTGTCTA AAATCTCTTT ATTGGTTAAA TGCATACGAA AAGTAGGACG 2400 

ATAAAATCGC TTATCACTCA GTTTACGGCT ATCCTGTTGT ATGAGCTTCC AGTAGCGCTT 2460 
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GATAGCCTTG TATTCATGGG ATTTTCGATC CAATTGGTTC ATAATTTGAA CACGCACACG 2520 

ACTCATAGCA CGGCTAAGAT GTTGTACAAT GTGAAAGCGA TCCAACACGA TTTTAGCATT 2580 

CGGGAGTGAA ACAGTCTGGG AGACTGTTTC AGCCTGAGCC TAGAAATTTG AAAGCGAAGC 2640 

TGTTTAGCCA AGTCATAGTA AGGACTAAAC ATATCCATCG TAATGATTTT CACTTGACAA 2700 

CGAACGGCTC TATCGTAGCG AAGAAAGTGA TTTCGGATGA CAGCTTGTGT TCTGCCTTCA 2760 

AGAACAGTGA TAATATTAAG ATTATCAAAA TCTTGCGCAA TGAAACTCAT CTTTCCCTTA 2820 

GTGAAGGCAT ACTCATCCCA AGACATAATC TTTGGAAGCC GAGAAAAATC ATGCTCAAAG 2880 

TGAAAGTCAT TGAGCTTGCG AATGACAGTT GAAGTTGAAA TGGCCAGCTG ATGGGCAATA 2940 

TCAGTCATAG AAATTTTTTC AATTAACTTT TGAGCAATCT TTTGGTTGAT GATACGAGGG 3000 

ATTTGGTGAT TTTTCTTTAC CAGGGGAGTC TCAGCAACCA TCATTTTTGA ACAGTGATAG 3060 

CACTTGAAAC GACGCTTTCT AAGGAGAATT CTAGAAGGCA TACCAGTCGT TTCAAGATAA 3120 

GGAATTTTAG AAGGTTTTTG AAAGTCATAT TTCTTCAATT GGTTTCCGCA CTCAGGGCAA 3180 

GATGGGGCGT CGTAGTCCAG TTTGGCGATG ATTTCCTTGT GTGTATCCTT ATTGATGATG 3240 

TCTAAAATCT GGATATTAGG GTCTTTAATA TCGAGCAGTT TTGTGATAAA ATGTAATTGT 3300 

TCCATATGAA TCTTTCTAAT GAGTTGTTTT GTCGCTTTTC ATTATAGGTC ATATGGGACT 3360 

TTTTTTCTAC AACAAAATAG GCTCCATAAT ATCTATAAGG GATTTACCCA CTACAAATAT 3420 

TATAGAGCCG AAAATTCACA TCTAATATAT GCAGACTACT TTGAAATGAA ATTAAAAAAA 3480 

TTATTAAAGG ATGACACAAA AGTTTTTGAA AAATCTACAT TCAAATTTGT AGAAGGATAT 3540 

AAAATATACC TGACAGAATC TAAAGAATCT GGAATTAAAC AAATGGACAA TGTCATAAAA 3600 

TATTTTGAGT TTATTGAATC TAAAAGTATT GCTTTATATT TTCAAAAACG ATTAAATGAG 3660 

CTGATAGATT AAATAGCATT TTCTCTGTTG AGATATTGTT TTTAAAATAT TGTACTAAAT 3720 

GATTGATGCT ATGTGGAAAT ACAAAAAAAT GTTTTTGATA CGAAGTTGAC CTGTATTTTT 3780 

TATACTAATC ATTTTCGTAT TTTTTGTATT AAACGATATA AGTTTGTTGT AAACTTACAA 3840 

GGAATAAAGA CATTAAAAAA TAACAGTATA TCTATTTGTT TTATATATTT TACGAATTCT 3900 

GCATAAATCT CTTTCTAGTA ATGTGTTGTA ACTCTGCTAT AATAGATTTA TTCCTTTTTG 3960 

TGTTTACACA ATTTATTTTA TAGTACCAAA AAAGGTCAGG ATTTTGTTCC TGACCTTTGA 4020 

CAACTTTACC GATTCTTTAG TTCTACATAG CGCTTGTACC AAATGTTTAC ATAGGCTTCT 4080 

GAGAAAGGAC CACGTCCATT GTTAATCCAA TCAACAAGAA TTTTGACATG TTCTTTTAAA 4140 

ATATAGTCCA AGTCATCAGA ATAATTCATT TTGCGTTTGT GACGCTCGTA CTCTTCAACG 4200 

TCCAAGAGAC GTTTTTCCCC ATCTGTAAAA ATTTTAACAT CCAAATCGTA ATCAATATAC 4260 
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TTCAGTGCTT CTTCATCCAG ATAGTAGGGG CTAGCCATAT TGCAATAGTA AGAAGTTCCA 4320 

TTATCACGAA TCATGGCAAT GATATTAAAC CAATATTTCT TGTGAAAGTA AACAATAGCC 4380 

GGTTCTCGAG TGACCCAACG ACGACCATCA CTTTCGGTAA CAAGTGTATG ATCGTTGACA 4440 

CCAATAATGG CGTTTTCTGT TGTTTTTAGT ACCATGGTGT CCCGCCAAGT TCGGTGGAGA 4500 

CTCCCATCAT GCTTATAACT TTGAATTGTA ATAAAGTCGC CTTCTTTTGG AAGCTTCATA 4560 

ACTAACCAAC TTTCTACAAT TTATAAGTTT ATCATTTACT ATTGTACCAT AAAATTACCC 4620 

AAAATCTGTG AATTTCACTT GGAAATATTA AAGATATTCT CTAAGAGCGC TTGCTATATC 4680 

CGAAAAATCG TAGCCCTTTC GTGCTAAAAC TTGAGTTAAA CGCTGCTTCA GTTCGTATCC 4740 

TTCATACTTT CGGGCATACT TAGTATATTG CTTATCAAGT TCCTTGAAGA TGAGTTCCTG 4 800 

AGTCGTTTCT TCATCAACTT GACTATCCAA TTCGTCAAAG GCAATTTTAG CATCAAAATA 4860 

AGAGAAGCCC TTGTTAGTCA AGTTCTGGAT AATCTTATCT TGCAGGGCAC GAGCTGGAAG 4920 

TTTTCCCTCA TATTTTTTCA ATAGTTTATT GGCTACACGT TGAGCAACTT CCGAAAAATC 4 980 

AAAATCATTC AAGATTTCTT CTATAGTAGA TTTTGAAATT CCTTTTTGTG CTAATTTCTG 5040 

AGTCAGTACA TAAGGTCCCT TGTCTCCTGA AAGTTGATTG GCATTGATGA TAGCATAAGC 5100 

GTACTGGCTA TCATTAATCC ACTTCTCTTC TTTAAGATTA GCAATGACTT GAGAAACGAT 5160 

GTTTTCATTA ATAT CAT ATT TTTTCAGATA TTCTCTGACC TCTTTTTCAG TACGTGCTTT 5220 

AAAGGATAAG TGGTAGAGGG CCAGATTCTT ACCATAAGAA AATTGAGCAA AGTCTTGAAT 5280 

CTCTTTCAAT TCCTCTTCGC TTATCACCTT ATCTCTCGAT AACATAAAAC GAACAATTGT 5340 

GTCTTCGGTG ATATAGCATT TGTCG 5365 



(2) INFORMATION FOR SEQ ID NO: 78: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

TTTCCAGAAA GAAGTTGAGT AAAGTCTTTA TCAAAGAGAA TGACTTCCGT ATTGGAACTG 60 

ACATTAGGTT TTATTTCTAC TTTACTAGCG TCCGCCCTAG CATTTTCTAA ATCTTTAATC 120 

TCTTCTGTTG CCCTATTTAT AGCCAGCTGA ATAACTGCTT GAGGATTTTC ACTCAGTCCA 180 

TGAAGCTTAT CGTCCACCGA AGT ATAAAGA CTCGAATGCA TGACTTGTAA AATAATCAGA 240 
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GTCATTGTAG AAAAAATCAG GGTGAAGACA 


CCGAAGTTGC 


GGATAAAATA 


ACTAAAGTCA 


300 


TCCGCATACC 


ATGTTTTTTT 


AAGTTTACTG 


AACATCTTTT 


AAAAGATACC 


CAACACTACG 


360 


CAAAGTTTGC 


AAATTCTCTG 


CAAAAGTGGT 


TCCCTTTAAT 


TTCTTACGGA 


CTTTTGAAAC 


420 


ATAGACTTCG 


ACAACCGAAA 


TCGTTGTATC 


ACTATCAAAT 


CCCCATAGAC 


GGTCAAAAAT 


480 


CTGCGTCTTA 


GGCAAAATCA 


CATTTTGATT 


TTGAAGGAAA 


TAAACTAGTA 


AATCGAACTC 


540 


TTTCCCCAGC 


AATTCGACAG 


GAGTATCTTC 


AACTTTAACG 


GTATTGGTTG 


ATAAATTAAC 


600 


CACGATATTC 


CCATAAGTCA 


AGGTGTTTTC 


ATTAAACTTC 


CCTGAACGTT 


TGAGAAGGGC 


660 


CTGAATCCGC 


ATTTTAAGTT 


CTTCTAGGTA 


GAAAGGTTTG 


GTCAGATAAT 


CATCCGCTCC 


720 


CAGTTCAAAT 


CCATGTCCCT 


TGTCATCCAA 


ACTTTCCTTG 


GCAGTCATAA 


TCAGAACTGG 


780 


TGTCGTAATT 


CCCTTTTCAC 


GCAATTCTTT 


TAAGACTTGG 


AAACCATTTT 


TTTCTGGCAA 


840 


CATCAAATCC 


AGCAAAATCA 


AGTCATAGAC 


ACCACTCTCA 


GCTTCGTAGA 


GACCTTCTTC 


900 


TCCATCAAAT 


ACCTGCATAA 


CATCCGCAAA 


ATCGTCTAAA 


AAGTCAAATA 


CTGAATTTGA 


960 


CAGACCTAGG 


TCATCCTCAA 


CCAATAAGAT 


TTTTATCATG 


AGAAACTCCT 


CCTTATTAAA 


1020 


ACTATTATAC 


CAAATTTGCC 


TTAAAAAAAA 


CTCAACTCTC 


TGCATTTTAC 


ATGAGATAGC 


1080 


TGAGTTTTCT 


TTTTATTTTA 


GGCTTATTTA 


TGCATTTCCG 


TATTGAAGAA 


CAACTGCTTC 


1140 


GACTGCAGCT 


TTTTCACGGC 


TAATCAAGTC 


AACACGCGCT 


GCAATTTCCT 


TGATTCCCAT 


1200 


ACCGATGTTA 


CGGCTAAGAG 


CAAGGTCAGA 


AAGTTGCGGT 


TCAAAGAACT 


CCTTGTATTC 


1260 


CGCCAAGCGT 


TGCTGAGTCT 


TAAATACATG 


AGCAGGAAGG 


ATAACAAAGC 


TATCAAAGCT 


1320 


CATATCTCCT 


CCAAGGGCTG 


CCTTAATCCA 


AGCCCAGTTT 


TCACGCGCCC 


AAGACCAAGC 


1380 


TGTTTTCTGA 


GTTGCTTGAT 


GAGCTAGGAA 


TTGGTAATAC 


CAAGCAGACA 


AGTCCTGTGG 


1440 


TTTGACCACA 


AATTTGTCCT 


TCCAAGAAGT 


AATCAGGTTT 


TGGATATTAT 


CCGCATCTGT 


1500 


ACTGTATGCA 


AGAGCTGCTG 


CCAACTGGCG 


1 1 livlnuAUi 




1 uAo 1 AM A 


1560 


AGTATCAAGA 


TAAAGTGCTA 


ACAAGTCTTT 


AGTCTCATGA 


TGTTTCATCT 


CATTAATCAG 


1620 


AACTTGTGAG 


CGAATAGCTG 


CTGGGAGTCC 


TGCAAGATTC 


TCCTTGTGTG 


TTGCGAAGAT 


1680 


TTGGCTAGCG 


ACTTGACTAG 


CTTCTGCATC 


ATTTGAGCGA 


ATCATCATCG 


AAACAGCCAG 


1740 


CTGACGAACC 


AATTCATCCT 


CATCTGATTC 


TCCGTCTTTA 


GCTTCAAAAC 


CAAGACGGTC 


1800 


ATAGTTATGA 


CGAGCCAATT 


TAGCAACCAG 


TCCTTTGAAG 


GCTGTTTCAG 


CATCCGTTCC 


1860 


TTCATCAATA 


AAGCGCTCAA 


GGGCTGAAAT 


CACTTGAGAA 


ACAGCTGAAA 


CCACCAGATA 


1920 


AGACTCTTCC 


TTAGCAAGTT 


TATCAAGAAC 


TGGAAGCAAG 


TCTGCATAAG 


AAATGTGCCC 


1980 


TGCCTCAGCC 


AACAAACGAC 


GTTCTTGAAC 


AATTTGCAGT 


TTGCTTGTGT 


TATCAAGTGT 


2040 
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CTCTAGCTCA GCAAGAACAG CTGCTAACAA GTCTCCTTGA TAGTCGGTAA TATAGTGGGC 2100 

AGTATTTTCA GTGTTGAGAC GAAGAGCTCC TTCATTTTCA GCAAGAAGAG CTGCGTAGCC 2160 

AGGGATTTCG ATACTTTCAG TTTCGAGTGT ATCAGGCAAG CCTTTCCAGT TGCTATTGAG 2220 

GGGCACCACC CAGAGACGGT TCTTGTCTTC GTTCTCACCG ATGAAGAATT GTTTTTGTGA 2280 

AATCTTCAAG ACATCATTTT CAACTTTAAC AGTAAGAACT GGGTAACCAG GCTGTTCCAA 2340 

CCAAGAATCC ATGAAGGCTG CGACATCACG TCCTGACGCT TGACCAAGGG CATCCCAAAG 2400 

GTCACTACCA ATGGTGTTGC TGTATTGGTG TTTTTCAAAG TAGGCGTGCA AACCTTTAGC 24 60 

AAAATCAGCA TCTCCTAGCC AACGGCGAAG CATGTGCATG AGACGGCTTC CTTTGGCATA 2520 

GACGATAGCG CCGTCAAAGA GTGTATTGAT TTCATCTGGA TGTTTAACTT CGACGTGGAC 2580 

AGACTGAACG CCATCAGTAG CGTCACGTTC AAGAGCAAGA GGTACTCCAC CTGTTTGGAA 2640 

ATCTTCAAAG ATATTCCAGC TTGGTTCGAT GGTATCCACA CAGACGTATT CCATCATATT 2700 

AGCGAAACTT TCATTGAGCC AAAGGTCATC CCACCATTTC ATAGTCACGA GGTTCCCAAA 2760 

CCATTGGTGA GCCAATTCAT GGGCCACAAC AAGGGCAACT TGTTGACGGC TAGCAAATGT 2820 

AGAGTTCTCA TCGACAACCA AGTAAACTTC ACGGTAGGTC ACAAGACCCC AGTTTTCCAT 2880 

AGCACCAGCT GAGAAGTCAG GAAGGGCGAT GTGGAGAGAT TGAGGAATTG GGTACTTAAC 2940 

TCCATAGTAA TCTTCGTAAA ACTCGATAGA GCGAACAGCG ATATCCAGTG AGAAATCAAG 3000 

ATTTGAAAGT GGATGTGCTT TGGTTGAGTA GACACCTACC AGGGTACCAT TTTTAGTTTT 3060 

AGCGGTCACC CCTTGCAAAT CACCAGCAAC AAAGGCCAAC AAGTAAGAAG ACATGCGAGG 3120 

TGTTGTCTCA AACTTCCAGA TACCTGTTTC CTTACGGTTT TCAACATCGA TTTCTGGCAT 3180 

GTTTGACAAG GCCAATTCAC CTTCTGCTTG GTCAAAGCGA AGAGAGAGGT CAAAAGTTGC 3240 

TTTGGCTTCA GGCTCATCCA CACATGGGAA AGcTTCGCGC GCAAAATGGC TCTCGAACTG 3300 

AGTAGACAAG ACCTCCTTCT TGACTCCATC AACTGTATAA TAAGAAGGGT AAATCCCTGT 3360 

CATGTTGTCT GTAATTTTAC CAGAAAAGGC AAGAACCAAT TCAACTTGAC CAGCCTCAGC 3420 

CAATTCGATA TGAAGGGCTT CATTGTCATG GTCAACTGTA AATGGACGAG CTTGACCTGC 3480 

AACTTCTACA GAGGTGATTT CCAAATCTTT TTGGTGGAGG GAGATGCGGT CACTCTGTGC 3540 

TTGACCAGTG ATGGTCACTT TCCCAGAAAA AGTCTTGGTC TCACGACTCA AATCTAAAAA 3600 

TAAATCATAA TGTTCAGGAA CAAATTGCTT AATGGG 3636 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5066 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

ATAGCGTGTA ATAATCGATT TTAGAGGTAC CATAAGCCAC CTCCTACAAA TAGAAACCGA 60 

TATAAATCAA TGCCTTCCAC CCTTAGACTT CCCTAGTTCC TGTCTCAAGC GAAACATTTC 120 

TTTGAAACAG GAATAAGTTA ACCAATTCAT ACCAATAGCT AGCAGAATAA AAAGAAACCA 180 

AATGCCCCAT AACTTGATAT CTGTCACATT TCTCAAGACG GTATTGAAAA ACAGAACTGA 240 

AACAACTGTC CAAGCAAGGC TAAAAAGAGA ATAGAAGGGG ATGTAAAACC AGTAAAAATA 300 

ATAAAAAATT GGAAAAAACT TACTATTTCT GTTGGCCTTT TCAATCCAGT TATCAAAATA 360 

AAAGTACGGT GCTAAAAGTA AGAATTTAAA CAAATGTTCC ATCACCGACA TCCCCCCTTC 420 

TTTTGATAGC GTTTTCTATT ATTTTATTAT ATCAAAAAAA TCCGGAACTG TCATTCCAGA 480 

TTCTACTTTT TTATTTGCGT TTTCTTGCGA TGAGATGAAT CGGTGTTCCC TCAAAAACAA 540 

AGGCCTTGCG GATTTGATTT TCCAAGAAAC GCAGGTAAGA AAAGTGCATG AGTTCTTCTT 600 

CATTGACAAA GATGACAAAG GTTGGTGGTT TGGTTGCCAC TTGGGTCGCA TAGAAAATCT 660 

TGAGACGTTT TCCTTTGTCT GTCGGTGTTG GGTTGATGGC AATGGCATCC ATGATGACAT 720 

CGTTCAAGAC AGCTGATGGA ATACGTGTAT TTTGACTTTC GCTGATTTGC TTAATCATCT 780 

CAGGAAGTTT GTGGAGACGT TGCTTGGTTA AAGCTGATAC AAAGATAATC GGTGCGTAAG 840 

GCAGGTATTG GAACTGCTCA CGGATATCTT CTTCCCAGTT TTTCATAGTG TGGTTATCTT 900 

TTTCAAGCGT ATCCCACTTG TTGACCACGA TAATCATCCC TTTACCAGCT TCATGGGCAA 960 

ATCCTGCGAT ACGCTTGTCG TACTCACGAA TGCCTTCTTC CGCATTGATG ACCATCAAGA 1020 

CCACATCTGA ACGGTCAATA GCACGCATGG CACGCATAAC AGAGTATTTC TCAGTATTTT 1080 

CATAAACCTT ACCAGACTTA CGCATACCAG CCGTATCAAT CATGGTAAAC TCTTGACCAT 1140 

CTGTATCTGT AAAGTGGGTA TCAATGGCAT CACGAGTTGT TCCAGCAACA GGACTAGCAA 1200 

TAACACGGTC TTCTCCCAAG ATAGCATTGA TCAAGCTTGA TTTTCCAACG TTAGGACGAC 1260 

CAATCAAGCT AAACTTAATG ACATCTGGAT TTTCTTCCTC ATATTCATTT GGAAGATTTT 1320 

CTACGATCGC ATCTAGCACA TCCCCTGTAC CGATTCCATG GACAGATGAG ATAGGCAATG 1380 

GTTCACCCAA ACCGAGAGCA TAGAAATCAT ATATATCATT TCTCATCTCA GGGTTGTCCA 1440 

CCTTGTTGAC TGCGAGGATA ACTGGTTTGT GGGTCTTATA AAGCTTACGA GCTACGTATT 1500 

CGTCTGCATC AGTAATTCCT TCCTTACCAG ACACGACAAA AACGATAACA TCTGCTTCTT 1560 
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CCATGGCAAT TTCTGCCTGG TGCTTGATTT GTTCCATGAA AGGAGCATCG ACATCATCAA 1620 

TTCCTCCTGT ATCAATCATG CTAAAAGAAC GATTGAGCCA CTCACCCGTT GCATAAATAC 1680 

GGTCACGTGT CACTCCTTCG ACATCTTCTA CAATGGAGAT TCGCTCACCA GCGATCCGAT 1740 

TAAATAGGGT TGATTTCCCA ACATTGGGAC GTCCTACAAT GGCAATAGTT GGTAGGGCCA 1800 

TAATTTCTCA CTTTCTACAA TAATTTCTTC TGTTCAAGAT TTTTTCTAGT TGAGCTTGGT 1860 

TCAGCTTGAC CAAACTGTTC TGCTAGGCGC TGACTCCAGC TTGTGGTCGC ACGCGCCCCA 1920 

GCATAGTCAG CCTGAACACG GTCATAAGCT TGGATTGCCT CAGTTGACTG TTCTTGGTAT 1980 

TCTTCCTCAA AGACAACATT CTCTAGTGGC AGTCTCGGTT TCATATCATG ATGTTGATTT 2040 

GGCACACCCA GTGCCATCCC AAAGACAGAA TAGGTGTAGT CAGGTAGGTT AAAGAGCTCT 2100 

GCCACTTCTT CAGACTTGTA TCGAACCAAA CCGATAATCA CACCACCATA GCCCAAGCTT 2160 

TCAGCTGCCA ACAAGGCGTT TTGTCCAGCA AGAGCTGCAT CGACCGAACT AATCAAGAGA 2220 

CCTTCCACAC CTTGGGGTTG GAAGGTGTCG GTATGAAGTC GGGCTCCCTT TTCTGCTCGG 2280 

TTCAAATCTC CGACAAAGAG AAGGAAAACA GCAGACTGGC GAATGGCTTC TTGAGGTACC 2340 

AATTCATACA AGGCATCTTT CTTCTCTTGA CTTCGTACCA CAATCACAGA GTAGGATTGG 2400 

AAATTCTTCC AAGATGATGC CATCTGGGCT GCTGTCAAAA TCTCATTTAA GTCTACTTGG 2460 

GGAATTTCTT GCTCTTTAAA CCTGCGCACT GAAGTATGAG CCTTCATCAA TTTAATGGTT 2520 

TCTGTCATCG ACGGTTTACT CCTTCTAAAC GAGTCTCCTC AGCCAAATAA CGGATGCGTT 2580 

CCATGACCCG TCTGGCTTCC CAGGTTTCGT CATTTCCATG TTTCACTTTC GCAAAATGCT 2640 

TCTCCAAATC TTCAAAGTTG AAGTTGGATG TGAAAAAGGT CGGTAAATTT TCCTGCATCC 2700 

GATATTGGAG AATGACCTGC AGGATTTCGT CACGCACCCA AACGGTTGAT TGCTCGGCGC 2760 

CAATATCATC TAAAATCAGG ACCTCAGACA GCTTAATCTC ATCCACCAAG GTCTTAACAT 2820 

TGCCATCACT GATAGCATTT TTGACATCAA TGACAAAGCT AGGATAGTGG AGGAGAGTTG 2880 

ATGAAACACC ACGTTTTTCT GATAAATCAT GAGCT AAGGC CGCCACCATG AAACTTTTAC 2940 

CCACACCAAA GTCTCCATAT AAGTAAAGAC CTTTTCGAAT AGCTGGATAT TGCTCCACGA 3000 

AGGCTAGTAG CTTTTCAAAA ACTGGTAAGC GCCCCAAATC ATCCAAGTCA ACTTGAGCCA 3060 

AACTAGCTTT CTTGAGACTG GCTGGTAGAT TGATTAACTT GAGACGGTTC TTAATAGCCG 3120 

CTTCTTTTTC AGCCGCGATT AGCTCAGGAG TTTCTTCATA TGAAACATCT GCATAACCAT 3180 

GATTCTTAAC CAAAATCGGC TTGTAGCCTT TGGCAATATA ATCCGTATCC CCACGGAGAA 3240 

ACTTGTCACG CTCGGTGATG TACTGATTAA ACTTGGAGAT ACTGCGATTT AATTCCTTTG 3300 
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GAGTTAAGGA TTCTTGCTGG ATAAAGGCCG CAACATCAGG GTCCTTCATG ATTTTCTGGA 3360 

CCAAATCTTG ATAATAAAAA CGGCTGGGTT GACGTTTGAG TACGTCTCCG ACACTTTCCA 3420 

TCTAATCTCC TCCTTTTTCT AATCGAGCTA ATAGTTCTTG CTTCTTACGT TCTAGTTCCA 3480 

GACGAGTTTC CTCGCTGGTT TCATTCTTAT ATTCAGGATT ACTCCATTTA GGAACATTGG 3540 

TTTTTTCTGG GGCAGTCTGA TTCTGTTTTT GTGTTTTTGC TTTCTGCCCT CGATCACGAA 3600 

TTCGTAAAAC GGCCTCTTCT GCCGAATGAA TCTTTTGATA GGCATAGTCA TTGGCTACCT 3660 

TCATGGCATA TTTCTCATTG ATATTTGCCG AATCCACCTT ATTAAAGGTC AATAAGAGAA 3720 

TAATATTGAT GACTTCGTCC AGTAAGCCCA AGCCAGCCAT CTGTTGCAAG AGTTCTCTTT 3780 

CTGTTTGGGT AATGGTTCCC TTGCGTGTTT GCTTGATTTC TGCTAAGAAC TGCAGGGCAG 3840 

TTTTACTTTT AGCTTCTTTG ATAATGGTCG CTTCCTTAAG ACTAAAGTCA GAGGAAACTG 3900 

GTTTTTGAGC AATTTTTTCA CGCATGCGTT TGGTTGAAAT AACCTGGGAA ACAGCTGTTG 3960 

ACTTGGCCAA TTGATAGGTT TCAAACCAAG TCCATTTCTT CTCCTCGGCA ATAGCAAAGA 4020 

GGTTTAAGAC ATCGGACTGC TCATCCGCAA AACGAAGTCC ATCTCGAGCC ATCAGCTGGC 4080 

GAAAATGTTC CAAGTCAAAA TCATTGGCCA CTTTCTTCTT GAGACCAAGG TCTTCTTGAC 4140 

TGCCTAGTTC TGCCAATTCT GGAAAGACTT GATTGAGTGA GACAGGTATT TCTTCACCAT 4200 

CAGCACTTTC AACTTTCAAA TCCTCCACAG CTACATCGCC AATCTTTTTC TCTAAGAGTC 4260 

TGCGATAAAC AGGATGCCCC AAGAAGTCTT GACTAGATAG AGGAGCATGG AGGGCTAGCT 4320 

GATAAACATC ACCCTTTTGA TAGAGGGTCA AGAGATTAAA AGCAGATAAG ATTTTCAATG 4380 

ATTTTATCAG TCTATCCATC CCAAAGTTGA GATGGTTGAG AATGCTTGAA AAAAGATATT 4440 

CCTTTCTACC ATTATCCCAA AAACTGATTG TATAAAGATA AAGGCTCAGT GCCTCCTGAC 4500 

CGATAATCGG GAGGTAGCAC TGTACCAGAG ATGAGGTATC TTGCGACACC CGATTATTCT 4560 

TTAGATAAGA AAAACGGTCA ATTGGCTTCA TTTATCTTTC CTTTTTCTTT TTAGAGGACT 4620 

GGGTGATTTG TTGGAGCAAG CTCTCTAACT CACTGACATC CTTAAAACTA CGATAGACAC 4680 

TAGCAAAACG TACATAGGTA ATCTCGTCCA ATTCAGCCAA CTCCTCCATG ACGAGTGAAC 4740 

CAATGTCCTC ACTTTGAATT TCATTTTCAT TTCGACCACG GAGTTTCTGT TCGATACGAT 4800 

TGACTACCAT GTTGATTTCA TCACTTGACA CAGGACGTTT CTGGGCTGAG CGGATAATCC 4860 

CATTAAAGAT TTTATCTCTG GAGAATTGTT CCCGTGTGCC ATCTTTTTTA ACAACCACTA 4920 

AGGTTCTTTC TTCTACTCGT TCGTAGGTTG TAAAACGGTG TTGGCATTCG TCGCACTCAC 4980 

GTCTTCTACG AATGGTGTTC CCTTCTTCTG CTTGGCGACT ATCGATAACA CTTGACTTGG 5040 

TAGCCCCACA TTTTGGACAG GGTACC 5066 
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(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9607 base pairs 
(BJ TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

CACTTGAAGT ATTTGAAACA GCTATGGAAA ACATCATGCC TGTACTTGAA GTACGTGCAC 60 

GTCGTGTTGG TGGTTCTAAC TACCAAGTCC CAGTTGAAGT TCGTCCAGAA CGTCGTACAA 120 

CACTTGGACT TCGTTGGTTG GTAACAATCG CTCGTCTTCG TGGTGAACAC ACAATGCAAG 180 

ACCGTCTTGC AAAAGAAATC TTGGATGCTG CTAACAACAC TGGTGCAGCA GTTAAGAAAC 240 

GTGAAGATAC TCACCGTATG GCTGAAGCTA ACCCTGCATT CGCACACTTC CGTTGGTAAG 300 

ATAGGATGCG AAAGCGTTAA GAAAGTCCCA GAGAAAATAG GGAATCGAAG CAGGTTGCGG 360 

TTGCAACCAA TGAGATTCAT CTTTTTCTCC AGACTTTTAG CTTGAGCTCA ACTAAATCAT 420 

GATGCTAGGA ACGGTAAGGA TGCAAGGTAA AAATAGGAAA CTGACGCAGT ATTCGACGAA 480 

TACAAGGAGT TTTATCTTTT TCACGCAGCA TCCCGTTCCA GCTCACATCG GCTAACTAAC 540 

TTTAGCCCGG GTTCAAATTA GCTAAATCGA TTAGTATTAG CTATAACTCA GCTTACCATC 600 

TCGTAAGTTG AAACCAACAA TAGCATGAAA ACATTGAGAA CGGGTAGGTC CTGCCTATCC 660 

GTTTTTATTA AAATCGTGTT ATAATAGAAT AGAAATCAAA AATAAATAGG AGAAACAAAC 720 

CTCATGGCAC GCGAATTTTC ACTTGAAAAA ACTCGTAATA TCGGTATCAT GGCTCACGTC 780 

GATGCCGGTA AAACAACAAC TACTGAGCGT ATTCTTTACT ACACTGGTAA AATCCACAAA 840 

ATCGGTGAAA CTCACGAAGG TGCGTCACAA ATGGACTGGA TGGAGCAAGA GCAAGAACGT 900 

GGTATCACGA TCACATCTGC TGCGACGACA GCTCAATGGA ACAACCACCG CGTAAACATC 960 

ATCGACACAC CAGGACACGT GGACTTCACA ATCGAAGTAC AACGTTCTCT TCGTGTATTG 1020 

GATGGTGCGG TTACCGTTCT TGACTCACAA TCAGGTGTTG AGCCTCAAAC TGAAACAGTT 1080 

TGGCGTCAAG CAACTGAGTA CGGAGTTCCA CGTATCGTAT TTGCCAACAA AATGGACAAA 1140 

ATCGGTGCTG ACTTCCTTTA CTCTGTAAGC ACACTTCACG ATCGTCTTCA AGCAAATGCA 1200 

CACCCAATCC AATTGCCAAT CGGTTCTGAA GATGACTTCC GTGGTATCAT TGACTTGATC 1260 

AAGATGAAAG CTGAAATCTA TACTAACGAC CTTGGTACGG ATATCCTTGA AGAAGACATC 1320 

CCAGCTGAAT ACCTTGACCA AGCTCAAGAA TACCGTGAAA AATTGATTGA AGCAGTTGCT 1380 
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GAAACTGACG AAGAATTGAT GATGAAATAC CTCGAAGGTG AAGAAATCAC TAACGAAGAA 1440 

TTGAAAGCTG GTATCCGTAA AGCGACTATC AACGTTGAAT TCTTCCCAGT ATTGTGTGGT 1500 

TCAGCCTTCA AAAACAAAGG TGTTCAATTG ATGCTTGATG CGGTTATCGA CTACCTTCCA 1560 

AGCCCACTTG ACATCCCAGC AATCAAAGGT ATTAACCCAG ATACAGACGC TGAAGAAATT 1620 

CGTCCAGCAT CTGACGAAGA GCCATTTGCA GCTCTTGCCT TCAAGATCAT GACTGACCCA 1680 

TTCGTAGGTC GTTTGACATT CTTCCGTGTT TACTCAGGTG TTCTTCAATC AGGTTCATAC 1740 

GTATTGAATA CTTCTAAAGG TAAACGTGAA CGTATCGGAC GTATCCTTCA AATGCACGCT 1800 

AACAGCCGTC AAGAAATCGA CACTGTTTAC TCAGGTGATA TCGCTGCTGC CGTTGGTTTG 1860 

AAAGATACTA CAACTGGTGA CTCATTGACA GATGAAAAAG CTAAAATCAT CCTTGAGTCA 1920 

ATCAACGTTC CAGAACCAGT TATCCAATTG ATGGTTGAGC CAAAATCTAA AGCTGACCAA 1980 

GACAAGATGG GTATCGCCCT TCAAAAATTG GCTGAAGAAG ATCCAACATT CCGCGTTGAA 2040 

ACAAACGTTG AAACTGGTGA AACAGTTATC TCAGGTATGG GTGAACTTCA CCTTGACGTC 2100 

CTTGTTGATC GTATGCGTCG TGAGTTCAAA GTTGAAGCGA ACGTAGGTGC TCCTCAAGTA 2160 

TCTTACCGTG AAACATTCCG CGCTTCTACT CAAGCACGTG GATTCTTCAA ACGTCAGTCT 2220 

GGTGGTAAAG GTCAATTCGG TGATGTATGG ATTGAATTTA CTCCAAACGA AGAAGGTAAA 2280 

GGATTCGAAT TCGAAAACGC AATCGTCGGT GGTGTGGTTC CTCGTGAATT TATCCCAGCG 2340 

GTTGAAAAAG GTTTGGTAGA ATCTATGGCT AACGGTGTTC TTGCAGGTTA CCCAATGGTT 2400 

GACGTTAAAG CTAAGCTTTA TGATGGTTCA TATCACGATG TCGACTCATC TGAAACTGCC 24 60 

TTCAAGATTG CGGCTTCACT TTCCCTTAAA GAAGCTGCTA AATCAGCACA ACCAGCTATC 2520 

CTTGAACCAA TGATGCTTGT AACAATCACT GTTCCAGAAG AAAACCTTGG TGATGTTATG 2580 

GGTCACGTAA CTGCTCGTCG TGGACGTGTA GATGGTATGG AAGCACACGG TAACAGCCAA 2640 

ATCGTTOGTG CTTACGTTCC ACTTGCTGAA ATGTTCGGTT ACGCAACAGT TCTTCGTTCT 2700 

GCATCTCAAG GACGTGGTAC ATTCATGATG GTATTTGACC ACTACGAAGA TGTACCTAAG 2760 

TCAGTACAAG AAGAAATTAT TAAGAAAAAT AAAGGTGAAG ACTAATCCGT CCTCACTCTA 2820 

GAAGGAAGTC ACTTAGTGGC TTCCTTTTGT CTTTAGAAAA TACCTCTAAA TATGGTAAAA 2880 

TAGTAGAAGA ATAATGTGAG GAAAATGAAT GTCAAATAGT TTTGAAATTT TGATGAATCA 2940 

ATTGGGGATG CCTGCTGAAA TGAGACAGGC TCCTGCTTTA GCACAGGCCA ATATTGAGCG 3000 

AGTTGTGGTT CATAAAATTA GTAAGGTATG GGAGTTTCAT TTCGTATTTT CTAATATTTT 3060. 

ACCGATTGAA ATCTTTTTAG AATTAAAGAA AGGTTTGAGC GAAGAATTTT CTAAGACAGG 3120 

CAATAAAGCT GTTTTTGAAA TTAAGGCTCG GTCTCAAGAA TTTTCAAATC AGCTCTTGCA 3180 
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GTCCTACTAT AGGGAGGCTT TCTCTGAAGG TCCATGTGCT AGTCAAGGTT TTAAGTCCCT 3240 

TTATCAAAAT TTGCAAGTTC GTGCTGAGGG TAATCAGCTA TTTATTGAAG GATCTGAAGC 3300 

GATTGATAAG GAACATTTTA AGAAGAATCA TCTTCCTAAT TTAGCCAAAC AACTTGAAAA 3360 

GTTTGGTTTT CCAACTTTTA ACTGTCAAGT CGAGAAGAAT GATGTCCTGA CCCAAGAGC A 3420 

GGAAGAGGCC TTTCATGCTG AAAATGAGCA GATTGTTCAA GCTGCCAATG AGGAAGCGCT 3480 

CCGTGCTATG GAACAACTGG AGCAGATGGC ACCTCCTCCA GCGGAAGAGA AACCAGCCTT 3540 

TGATTTTCAA GCGAAAAAAG CTGCAGCTAA ACCCAAGCTG GATAAGGCGG AGATTACTCC 3600 

TATGATCGAA GTGACGACAG AGGAAAATCG TCTGGTATTT GAAGGGGTTG TTTTTGATGT 3660 

GGAGCAAAAA GTGACTAGAA CAGGTCGTGT TTTAATCAAC TTTAAAATGA CGGACTATAC 3720 

TTCAAGTTTT TCTATGCAAA AGTGGGTTAA AAACGAGGAA GAGGCCCAGA AGTTTGACCT 3780 

CATCAAGAAG AATTCTTGGC TCCGAGTTCG AGGGAATGTG GAGATGAATA ACTTCACACG 3840 

CGATTTGACT ATGAACGTAC AGGATCTGCA GGAAGTTGTT CACTATGAGC GGAAGGATTT 3900 

GATGCCAGAA GGTGAGCGTC GGGTTGAGTT TCATGCTCAT ACTAACATGT CGACTATGGA 3960 

TGCTTTGCCA GAGGTCGAAG AGATTGTTGC AACAGCTGCT AAGTGGGGAC ACAAGGCGGT 4020 

TGCTATCACG GACCATGGGA ATGTCCAGTC CTTTCCACAT GGCTATAAGG CGGCTAAGAA 4080 

AGCGGGAATC CAGCTGATCT ATGGGATGGA AGCCAATATC GTGGAGGACC GTGTCCCTAT 4140 

CGTCTATAAC GAAGTGGAGA TGGACTTGTC AGAAGCAACC TACGTGGTCT TTGACGTGGA 4200 

AACGACGGGA CTTTCAGCTA TCTATAATGA CTTGATTCAG GTTGCGGCTT CTAAGATGTA 4260 

CAAGGGGAAT GTTATTGCTG AATTTGATGA ATTTATCAAT CCTGGGCATC CCTTGTCAGC 4320 

CTTTACTACA GAGTTAACTG GAATTACAGA TGATCATGTC AAAAATGCCA AACCACTAGA 4380 

ACAAGTTTTG CAAGAATTCC AAGAATTTTG CAAGGATACG GTCCTAGTTG CCCACAATGC 4440 

TACCTTTGAC GTTGGCTTTA TGAATGCTAA TTATGAGCGG CATGATCTTC CAAAGATTAG 4500 

TCAGCCAGTT ATTGATACGC TGGAGTTTGC TAGAAACCTC TATCCTGAGT ATAAACGCCA 4560 

TGGTTTGGGG CCTTTGACCA AGCGTTTTGG TGTGGCCTTG GAACATCACC ACATGGCCAA 4620 

CTACGATGCG GAAGCGACTG GTCGTCTGCT TTTCATCTTT ATCAAAGAGG TAGCAGAAAA 4680 

ACATGGTGTG ACCGATTTAG CTAGACTCAA CATTGATCTA ATCAGTCCAG ATTCTTACAA 4740 

AAAAGCTCGG ATCAAGCATG CGACCATCTA TGTCAAGAAT CAGGTAGGTC TAAAAAATAT 4800 

CTTTAAGCTG GTTTCCTTGT CTAATACCAA GTATTTTGAA GGAGTGCCAC GGATTCCGAG 4860 

AACGGTTCTA GATGCCCATC GAGAGGGCTT GATTTTAGGT TCAGCCTGTT CAGAGGGTGA 4920 
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AGTTTTTGAC GTGGTCGTTT CTCAAGGTGT GGATGCGGCG GTTGAGGTGG CCAAGTATTA 4980 

TGATTTTATC GAGGTCATGC CACCGGCTAT CTATGCACCC TTGATTGCCA AAGAGCAGGT 5040 

CAAGGATATG GAGGAACTCC AGACCATTAT CAAGAGTTTG ATAGAGGTTG GAGACCGCCT 5100 

TGGCAAGCCT GTTCTGGCTA CGGGAAATGT TCACTATATC GAACCGGAAG AAGAGATTTA 5160 

TCGTGAAATT ATCGTCCGTA GTTTGGGACA GGGTGCGATG ATTAATCGAA CTATCGGTCA 5220 

TGGTGAACAT GCCCAACCAG CACCACTTCC AAAGGCTCAT TTTCGAACGA CTAATGAGAT 5280 

GTTGGATGAA TTTGCCTTTT TGGGAGAGGA ACTGGCTCGT AAACTGGTTA TTGAAAACAC 5340 

CAATGCCTTG GCAGAAATAT TTGAATCCGT TGAAGTCGTT AAGGGTGACT TGTATACGCC 5400 

TTTCATCGAC AAGGCTGAAG AAACAGTTGC TGAGTTGACC TATAAGAAAG CTTTTGAGAT 5460 

TTATGGAAAT CCGCTGCCAG ATATTGTTGA TTTGCGGATT GAAAAAGAAT TAACATCCAT 5520 

ACTGGGGAAT GGATTTGCTG TGATTTATCT GGCATCGCAG ATGCTGGTGC AACGTTCTAA 5580 

TGAACGGGGT TATTTGGTTG GTTCTCGTGG GTCTGTCGGA TCTAGTTTCG TTGCGACCAT 5640 

GATTGGGATT ACGGAGGTCA ATCCTCTCTC TCCTCACTAT GTCTGTGGTC AGTGTCAGTA 5700 

CAGTGAGTTT ATCACAGATG GTTCGTACGG TTCAGGATTT GATATGCCCC ATAAGGACTG 5760 

TCCAAACTGT GGTCACAAAC TCAGTAAAAA CGGACAGGAT ATTCCGTTTG AGACCTTCCT 5820 

TGGTTTTGAT GGGGATAAGG TTCCTGATAT TGACTTGAAC TTCTCGGGAG AAGATCAGCC 5880 

TAGCGCCCAC TTGGATGTGC GTGATATCTT TGGTGAAGAA TATGCCTTCC GTGCGGGAAC 5940 

GGTTGGTACG GTAGCTGCCA AGACTGCCTA TGGATTTGTC AAAGGTTACG AGCGAGATTA 6000 

TGGCAAGTTT TATCGTGATG CAGAAGTAGA ACGCCTCGCT CAAGGAGCGG CGGGTGTCAA 6060 

GCGGACAACA GGCCAACACC CGGGGGGAAT CGTTGTTATT CCGAACTACA TGGATGTCTA 6120 

CGATTTTACG CCTGTCCAGT ATCCAGCAGA TGATGTCACG GCTGAATGGC AGACCACTCA 6180 

CTTTAACTTC CACGATATCG ATGAGAACGT CCTCAAACTC GATGTACTGG GACATGATGA 6240 

TCCGACTATG ATTCGAAAAC TTCAGGATTT GTCTGGTATT GACCCTAATA AAATTCCTAT 6300 

GGATGACGAA GGCGTGATGG CACTCTTTTC TGGGACTGAT GTGCTAGGGG TAACACCTGA 6360 

ACAAATTGGA ACGCCTACGG GTATGTTGGG GATTCCAGAG TTTGGAACAA ATTTCGTACG 6420 

TGGAATGGTA GACGAAACCC ATCCGACAAC CTTTGCGGAA TTGCTTCAGC TGTCTGGTCT 6480 

GTCCCACGGT ACTGATGTTT GGTTGGGGAA TGCTCAGGAT CTGATTAAGC AAGGAATAGC 6540 

GGACCTATCG ACTGTTATCG GTTGTCGGGA CGACATCATG GTTTACCTCA TGCATGCGGG 6600 

TCTGGAACCT AAGATGGCCT TTACCATTAT GGAACGGGTA CGTAAGGGTT TGTGGCTAAA 6660 

GATTTCAGAA GAGGAGAGAA ATGGCTATAT CGAAGCAATG AAGGCTAATA AGGTGCCAGA 6720 
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GTGGTATATC 


GAATCCTGTG 


GGAAAATTAA 


GTACATGTTC 


CCTAAGGCCC 


ATGCGGCAGC 


6780 


CTACGTTATG 


ATGGCCTTGC 


GTGTAGCTTA 


CTTCAAGGTT 


CACCATCCTA 


TTTATTACTA 


6840 


CTGTGCTTAC 


TTCTCCATTC 


GTGCTAAGGC 


TTTTGATATC 


AAGACCATGG 


GTGCGGGCTT 


6900 


GGAGGTCATC 


AAGCGCAGAA 


TGGAAGAAAT 


CTCTGAAAAA 


CGGAAGAACA 


ATGAAGCCTC 


6960 


TAATGTGGAA 


ATCGATCTCT 


ATACAACTCT 


TGAGATTGTC 


AATGAGATGT 


GGGAACGAGG 


7020 


TTTCAAGTTT 


GGTAAATTAG 


ATCTCTACTG 


TAGTCAGGCG 


ACAGAGTTCC 


TCATCGACGG 


7080 


GGATACCCTT 


ATCCCACCAT 


TTGTAGCAAT 


GGATGGTCTG 


GGAGAGAACG 


TTGCCAAGCA 


7140 


ACTGGTGCGG 


GCGCGTGAAG 


AGGGAGAATT 


CCTCTCTAAA 


ACAGAACTAC 


GCAAGCGTGG 


7200 


TGGACTCTCA 


TCAACCTTGG 


TTGAAAAGAT 


GGATGAGATG 


GGTATTCTTG 


GAAATATGCC 


7260 


AGAGGATAAC 


CAGTTGAGTT 


TGTTTGATGA 


GTTGTTTTAA 


AAAATTGCTT 


AATAATCTAT 


7320 


TAAAAGAGGC 


TAACGTATAT 


CCAATAGATT 


TACATTAGCT 


TTCTTTTTTG 


TTAAAATAGT 


7380 


CTATGGAAAG 


AGGGTGAGAG 


TATGTCAAAG 


ATGAGTATAA 


GCATCCGTCT 


GGATAGTGAG 


7440 


GTTAAGGAGC 


AGGCCCAACA 


GGTGTTTAGT 


AATCTGGGAA 


TGGATATGAC 


AACAGCTATT 


7500 


AATATTTTCC 


TTCGTCAGGC 


AATTCAATAT 


CAGGGATTAC 


CTTTTGATGT 


TAGACTAGAC 


7560 


GAAAATCGGA 


AGTTGCTCCA 


AGCGTTAACG 


GATTTAGACC 


AAAATCGTAA 


TATGAGCCAG 


7620 


TCTTTTGAAT 


CAGTCTCAGA 


TTTGATGGAG 


GACTTACGTG 


CTTAAGATTC 


GTTATCATAA 


7680 


ACAGTTTAAA 


AAAGATTTTA 


AGTTGGCTAT 


GAAGCGTGGT 


TTGAAGGCAG 


AATTATTAGA 


7740 


AGAAGTTTTG 


AATTTTCTGG 


TTCAAGAAAA 


AGAACATCCT 


GCCAGAAATC 


GTGATCATTC 


7800 


AT1-GACGGCA 


TCCAAGCATT 


TTCAAGGAGT 


TCGTGAATGC 


CATACCCAGC 


CAGATTGGCT 


7860 


TTTGGTTTAT 


AAAGTAGACA 


AGTCGGAATT 


GATTTTAAAT 


TTGCTGAGGA 


CAGGCAGTCA 


7920 


CAGTGATTTA 


TTTTAATCTA 


TTTTAAGGGG 


GTTCTCATGA 


AACTAAGAAT 


ATTTGCGGAA 


7980 


GATAAGCCGG 


CTAAGAAGGT 


ATTTGAATAT 


CAATTAGAAC 


TTGCTGATCG 


TACAATTCTT 


8040 


CTATCGACAG 


CACTCTTGTC 


AGGTGCTATT 


GCTTTAGCAG 


GAATCTTTTC 


TGCTTTGAAA 


8100 


GAAAAATAAA 


AATAGAAAAG 


AGAAAACAGA 


ATGGTTTTAC 


CAAATTTTAA 


AGAAAATCTA 


8160 


GAAAAATATG 


CGAAATTGTT 


GGTTGCGAAC 


GGAATTAACG 


TGCAACCTGG 


TCACACTTTG 


8220 


GCTCTCTCTA 


TTGATGTGGA 


GCAACGTGAA 


TTGGCACATC 


TAATCGTGAA 


AGAAGCTTAT 


8280 


GCCTTGGGTG 


CGCATGAGGT 


CATCGTTCAG 


TGGACAGATG 


ATGTGATTAA 


CCGTGAGAAA 


8340 


TTCCTCCATG 


CCCCGATGGA 


GCGTTTGGAC 


AATGTGCCAG 


AATACAAGAT 


TGCTGAGATG 


8400 


AACTATCTCT 


TGGAGAATAA 


GGCTAGCCGT 


CTTGGAGTTC 


GTTCATCTGA 


TCCAGGTGCC 


8460 
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TTGAACGGAG TGGACGCTGA CAAGCTTTCA GCTTCTGCTA AAGCTATGGG ACTTGCCATG 8520 

AAGCCTATGC GTATCGCAAC TCAATCTAAC AAGGTTAGCT GGACTGTAGC AGCTGCAGCA 8580 

GGACTTGAGT GGGCTAAGAA AGTCTTCCCA AATGCTGCGA GCGACGAAGA AGCAGTTGAT 8640 

TTCCTTTGGG ACCAAATTTT CAAAACTTGC CGTGTCTACG AAGCAGATCC TGTTAAGGCT 8700 

TGGGAGGAAC ATGCAGCCAT TCTCAAGAGC AAGGCCGATA TGCTTAATAA GGAGCAATTT 8760 

TCAGCCCTTC ACTACACAGC GCCAGGAACA GATTTAACAC TTGGTTTGCC AAAGAACCAC 8820 

GTTTGGGAAT CAGCTGGTGC TGTCAATGCA CAGGGCGAAG AATTCTTGCC AAATATGCCA 8880 

ACAGAAGAGG TCTTCACAGC GCCTGACTTC CGTCGTGCAG ATGGTTATGT CACTTCTACA 8940 

AAACCGCTTA GCTACAACGG AAATATCATT GAAGGCATTA AGGTGACCTT TAAGGATGGA 9000 

CAAATCGTAG ATATCACTGC TGAGAAGGGT GATCAGGTTA TGAAAGACCT TGTCTTTGAA 9060 

AATGCGGGTG CGCGTGCCTT GGGTGAATGT GCCTTGGTAC CAGATCCAAG TCCAATTTCT 9120 

CAGTCAGGCA TTACCTTCTT TAACACCCTT TTCGATGAAA ATGCGTCAAA CCACTTGGCT 9180 

ATCGGTGCAG CCTATGCGAC TAGCGTTGTT GATGGAGCGG AGATGAGCGA AGAGGAGCTT 9240 

GAAGCTGCAG GGCTTAACCG TTCAGATGTT CACGTAGACT TTATGATTGG TTCTAACCAA 9300 

ATGGATATCG ATGGTATTCG TGAGGATGGA ACGCGGGTAC CTCTTTTCCG TAATGGGAAT 9360 

TGGGCAAATT AAGGAGATAA TATGTTAGGA AGTATGTTCG TTGGTCTCCT AGTGGGATTT 9420 

TTAGCAGGTG CTATGACCAA TCGTGGAGAG CGAATGGGAT GTTTTGGAAA AATGTTTCTC 9480 

GGTTGGATCG GAGCCTTTCT AGGTCACTTG CTCTTTGGAA CTTGGGGGCC AGTTTTATCA 9540 

GGAACAGCTA TTATCCCAGC GATTTTAGGA GCCATGATTG TTTTAGCTAT TTTTTGGAGA 9600 

CGAGGAA 9607 
{2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14231 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

CTACAAGATA ATTCCAGCTA TAACATCCGC TATAATAGTA AGAGCGAGCT CTATGATAAG 60 

GCTCATTAGT TTCACCTCCT CTCACGAACC CATAGGAACG TAATCGGTAA CCGATGACAA 120 

AAATAGTATA CCACAATACA TTTAGATCAT CAAGGTCACT TAATTCTTGA AATATCAGAT 180 

CTAAGAGAAA AATCTTTAAA ATCAGAAAAA CGCATAATAT CAGGTGTGCA AAAACTTGAT 240 
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ACTATGCGTT 


TTATTGTGGG 


AAGGTTTACT 


CCATTTTCTC 


CTGAAATTGA 


GTTTTTGTCC 


300 


AGCCTCTGTT 


TTTAGGGTTG 


CTAAGAAAAT 


AATGTCATGT 


GGTGAATATT 


TGTAAATCAG 


360 


TCAGCAGACA 


GAACGATACT 


CTTCGAAAAT 


CTCTTCACAT 


CATGTCAGCT 


TCGTCTTTCC 


420 


GTATATATGT 


GACTGACTTC 


ATCAGTTCTA 


TCTACAACCT 


CAAAACAGTG 


TTTCGAGCTG 


480 


ACTTGATCAA 


TTTTCAAATC 


TGTACTTTGA 


GCAAGCTGAG 


ACTAGCTTCC 


TATTTGATTT 


540 


TCATTGAATA 


TCAGAAACCC 


ATTCTCCATC 


AAATAATTCG 


ACTGCGTCTA 


ATAATTTTTG 


600 


ATCTGGCACG 


GTGTCTGAAA 


TAAAGGTTGT 


GTATTTGGAG 


AGGGGATTAA 


TTTTAAAAAA 


660 


TCCAGTCTTG 


TAAAATTTAG 


AACTATCAAT 


CAGTAAGATG 


GTTTCATGGG 


CTTTGTCAAT 


720 


AATATTCTTT 


TTTGAAATAG 


CTTGGCTGAG 


AGAAGCTTCA 


TAAACATATT 


GGTCATCAAT 


780 


ACCTCTTGCT 


GAACAAAATG 


CTAAATCGAT 


ATTAAAATGA 


TCTAATAAAG 


AATTTTCCTT 


840 


ATCATAGTTG 


ACCACGGAAC 


AGGATTGATG 


TTTGACCTCG 


CCAGATGTGA 


TAAAGATTTT 


900 


GGAGCTATCT 


TTAACAGTTT 


CAGATAGGGT 


TTGTGCAGTA 


TGTAAACCAT 


TTGTAAAAAT 


960 


AATCAAATTA 


TCAAGTTCAG 


AAAGATAGGG 


ACAGAGTTCG 


TAGACAGTAG 


TACTAGAATC 


1020 


TAGATAGATA 


CACATACCAG 


ACCGAATAAA 


GTCTTTAGCG 


AGACTAGCGA 


TTAGTCTTTT 


1080 


TTGCCTAGTA 


CTTTCTCCTT 


CACGTATTTG 


ATGAGAAAGT 


TCAATTGTGT 


TCATAGAGGA 


1140 


CAGGGTCACG 


TATCCGTGCT 


TTCTTTTGAT 


AAGACCTTGA 


TTTTCTAAGA 


AAATTAAATC 


1200 


ACGACGTAAG 


GTACTTGTGC 


TGGAGAAAGT 


GATTTCTGCC 


AGCTCTTTTA 


CGGCAATTCT 


1260 


TTTTTTCTTT 


TTGATAATTT 


CAATCAATTC 


AAGTACACGT 


TCATCTTTTA 


TCATAAGCTC 


1320 


CTCCTAATTT 


ATCATTTCAA 


CTATATTATA 


GCACAAATTG 


GAGGAATTTG 


AATTATTTTT 


1380 


ATGAATATTG 


GGTTAACATT 


TGAACATTAT 


TCAAGTAAGC 


GTTCACATAT 


TGAAAAAATA 


1440 


AAACGTGGGG 


ATTATAATAA 


AGTTAATCmA 


GGACGAAGAG 


AGAAGAAAAA 


TGGAAGCGGT 


1500 


TTTAGCAATA 


GATTTAGGTG 


CGACTTCTGG 


AAGAGCAATC 


GTTGGTTACC 


TTTCTGAAAA 


1560 


TAAACTAGTA 


ATGGAAGAAA 


TAAATCGCTT 


TTCTAATCTA 


CCTATTAGAG 


TAAAAGGGCA 


1620 


TTTATCTTGG 


GATATTGACT 


TTCTACTAGC 


TAAAATTCTT 


GAAAGTATCC 


GCTTGGCTAA 


1680 


TACTAGTTAC 


AAGATTTTAT 


CTATCGGTAT 


TGACACATGG 


GGAGTTGATT 


TTGGACTGAT 


1740 


TGATAATGAA 


GGTAAGCTGT 


TATTACAACC 


TGTTCATTAT 


CGTGATGAAA 


GAACAAAGGG 


1800 


AGTGTTAAAG 


GAAATATCTG 


AAATGACTGA 


ATTAGAAAAA 


CTGTATTCAG 


AGACAGGAAA 


1860 


TCAGATTATG 


GAGATAAATA 


CCTTGTTTCA 


ACTCTTTAAG 


GCACGTCAAG 


AATCTCCTGA 


1920 


CTCTTTCTAT 


AAGACCAATA 


AGATTCTTTT 


AATGCCAGAT 


TTGTTTAATT 


ATCTCTTGAC 


1980 
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AGGTAAGTTT GCTACAGAAA AAAGCATTGC TTCAACAACT CAATTATTTG ATCCTAGGAG 2040 

TCAAAATTGG AATCAGAATA TCTTAAAACT ATTTGAATTG GATTCATCTT TACTTCCTGA 2100 

AATTGTTTCA GAGGGAAATG TTCTTGGAAG GATAAAAGAG GAGTATGGTT TAGGCGATAT 2160 

TCCTGTTGTG AATGTTTGTA GTCATGATAC AGCAAGCGCG ATTGTCTCAG TACCTAAGAC 2220 

AGAAGGTAGT TTATTTATTT CATCAGGTAC TTGGTCTTTG GTTGGAGTGG AACTTACTTC 2280 

ACCGATTCTT ACTACCGAAT CCTTCAGTTA TGGATTTACA AATGAAGTCG GTAAAGATGG 2340 

AGTGATTACA TTTCTGAAGA ATTGTACAGG GTTGTGGATC ATAGAGGAAC TAAGACGTTC 2400 

ATTTGAACGA AGAGGGAAAG CCTATTCTTT TGATGATATT AGGACAATGG TGGAGAAAGA 2460 

AAAAGAAAAT CTTCCTCTGA TTGATACTGA ATCAACTGAA TTTGCAACAG AATCTGATAT 2520 

GCACAAGACT TTGACAGAAT ATCTAGCTTA TCATCATGAA ACTAGAGAGT GGACAGATGG 2580 

ACAACTATTT AAGATTGTTT ATGAAAGCCT AGCTGAAACG TATAGGAAAG CGATAGAGTT 2640 

ACTAGAAGAA CTAACTCATA AGGTTTATAA GAGGATATAT GTGATTGGAG GAGGTGCTAG 2700 

AGCCAGTTAC TTTAACCAAA TGATTGCTGA TAGAACTGGT AAAGAGGTTC TTACAGGTTT 2760 

GACTGAGGCT ACAGCTGTGG GGAATATTGT TGTGCAGCTC ATAGCTATGG GACAATTAAA 2820 

AGGGATGGAA GAGGCTCACC ATGTTATTGA GGAGTTTCTA CAATTAGAGA GTTATTACTC 2880 

CCAAAAGAAT TAAAAAGATT GAGAGTTTGT AAATTTGCCT CCCTCCCCCT TCTTAGCTTT 2940 

TGTGCAGGAA GGGGGGATAA TTGGTGAATT GAAAAATATT TAGTGTTTTG ATATGAGGAG 3000 

GACAAGGATG TCAGATGTAA AACAAGAATT AATTAAATAT GGTAAGAAGC TAGTAGAAAC 3060 

AGATTTGACG AAAGGAACAG GTGGGAATCT CAGCGTTTTC GATCGTGAAA AACAATTGAT 3120 

GGCAATTACC CCGTCGGGTA TTGATTTCTT TGAAATCAAA GAATCCGATA TTGTAGTGAT 3180 

GGATATTAAT GGAAATGTTG TAGAGGGAGA ACGCTTGCCA TCTAGCGAAT GGTATATGCA 3240 

TTTGATTCAA TATCAAACTC GTGATGATAT CGATGCAATT ATCCATGCTC ATACAACTTA 3300 

TGCAACAGTA TTAGCTTGTC TCAGAGAACC ACTTCCAGCG AGTCATTATA TGATTGCAGT 3360 

GGCAGGGAAA GATGTTCGGG TAGCTGAGTA TGCAACATAT GGCACGAAAG AATTGGCTGT 3420 

GAATGCAGCT AAAGCAATGG AAGGTCGTAG AGCAGTTTTA CTAGCGAATC ATGGAATTTT 3480 

AGCAGGTGCA CAAAATTTAT TGAATGCATT TAATATTGTT GAAGAAGTTG AATATTGTGC 3540 

AAAAATTTAT TGTTTAGCTA AGAATTTTGG AGAGCCAGTA GTTCTTCCTG ATGAGGAGAT 3600 

GGAATTGATG GCAGAAAAAT TTAAAACATA CGGTCAGAGA AAATAGGGAG GATATTAATG 3660 

TTAAAACATA TACCGAAAAA TATTTCTCCA GATTTATTGA AGACTTTAAT GGAAATGGGA 3720 

CATGGAGATG AAATAGTATT AGCTGACGCG AATTATCCTT CTGCCTCATG TGCAAATAAG 3780 
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CTAATTCGTT GTGATGGTGT AAATATTCCA GAATTATTAG ATTCCATTCT GTATTTAATG 3840 

CCATTAGATA GTTACGTCGA TAGTTCAATT CAGTTTATGA ACGTTGTTTC GGGTGATGAT 3900 

ATTCCTAAGA TATGGGGTAC CTATAGACAG ATGATTGAAG GTCATGGTAC AGATCTTAAA 3960 

ACGATTACTT ATCTTAGAAG AGAAGACTTT TATGAACGTA GTAAGAAAGC TTATGCTATT 4020 

^ GTTGCTACAG GAGAAACTTC ACTTTATGCT AATATTATCC TTAAGAAAGG AGTAGTTGTT 4080 

GAAAGAGAAA ATGTTCAATA GAGGAATTTT AGTTGCCAGT CATGGTAATT TTGCTAGCGG . 4140 

AGCTCTCATG ACCGCAGAAA TGTTTGTTGG TGAGACAACA AATGATAGAG TTAGGACATT 4200 

AGGTTTGATG CCTGGAGAGA ATATTGTAGA GTTTGAGCAT TATTTTAAAA ATCAAGTGGA 4260 

TGAACTGTTA GACTCAAATC AAGAGGTTAT CGTTTTGACT GACTTGATTG GAGGAAGTCC 4320 

TAATAATGTG GCTTTGTCAC GGTTTTTAAA TTTGGATTCA GTTGATATTG TAACAGGGTT 4380 

TAATATCCCT CTCCTAGTGG AATTAATATC AAGTTATGAT TCAAAAATCA ATTTAGAAGA 4440 

AATTGTTCAC AATGCTCAAA ATAGTTTGTT TAATGTTAAA CAACAACTTA ACGTAGAGGA 4500 

GGAAGAAGAT TTATGTCTAT AGAGTTTGTT CGTATTGATG ACCGTCTGGT ACATGGTCAA 4560 

GTTGTCACTA CGTGGCTAAA AAAGTATGAT ATTGAGCAAG TTATCATTGT TAATGATCGC 4620 

ATCTCAGAAG ATAAAACACG ACAATCTATT TTAAAGATTT CTGCACCGGT AGGTTTAAAA 4680 

ATTGTTTTCT TTAGTGTAAA ACGGTTTGTG GAAGTTTTAA ACTCTGTGCC AATAAAAAAG 4740 

AGAACAATGC TGATATATAC AAATCCAAAA GATGTGTATG ATTCTATTGA AGGAAATTTA 4800 

AAATTGGAGT ACCTCAATGT AGGACAGATG AGTAAAACGG AGGAAAATGA AAAGGTAACG 4860 

GGAGGTGTAG CTCTAGGTGA AGAAGACAAA TATTATTTTA AGAAAATAGT TGATAAGGGA 4920 

ACGAGAGTTG AAATTCAAAT GGTTCCTAAT GATAAAGTTA CAATGTTGGA AAAATTTTTA 4980 

TAAAAATAAT TTAAGGAGGT ACAGTATATG CTATTCACAC AAGCATTACT GGTGACATTA 5040 

GTTGGGATTA TTGCCACTAT TGACTATAAT GGACCGTTAT TTATGATTCA CCGTCCGTTA 5100 

GTTACAAGTG CAATGGTTGG CTTAGTATTA GGAGATTTCA CCCAAGGTGT TCTTATTGGT 5160 

TCAGCTCTTG AATTAACTTG GCTCGGTGTA ACAGGTATTG GAGGTTATAC TCCACCAGAT 5220 

ACTATTTCAG GTGCGATTAT TGGTACTGCA TTTGGTATTT TATCTGGTCA AGGAGAAACT 5280 

GCTGGTATCG CTATAGCAGT TCCAATTGCA GTTGCTACCC AACAGTTGGA TGTTCTTGCA 5340 

AAAACTTTAG ATGTTTATTT TGTGAAAAAA GCTGATAATG ATGCTAAAAA CGGAGATTAT 5400 

TCAAAGATCG GTTTTTATCA TTATTCAAGT TTGGTTTTAA TCACGTTATT TAAAATTGTA 5460 

CCAATTTTCC TAGCTATTAT GCTTGGAGGG GAATATGTGG CAGACTTGTT TGCTAAGGTT 5520 
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CCACCAATCG TTATGCAGGG ACTTAACTCT GCAGGTGCTT TACTACCTTC AATTGGTTTT 5580 

GGTATGCTTT TAAATATGAT GCTCAAGAAA AATATGTGGG TATTCTTGTT GATTGGATTC 5640 

ATTTGTTCTG TGTATGGAGG AATGTCAACC ATTGGGATCT CACTAGTTGG TATTGCGGTA 5700 

GCATACTTCT ACGATATGAT TGGAAGCAAA CCACAAGAAA CAACTTCAAG TAGTGATGTT 5760 

GAGGAGGATC TTGATCTATG ATGAATAATA AAGTAACTAA AGTTGAACTT AAAAAAGTTT 5820 

TCAAACGAAG TTTTATGTAT GGTTCTTCAT GGAACTATGA GAGAATGCAG AACCTAGGTT 5880 

TTCTATATAC AATTCTTCCA GTATTGAAAA AACTATACCC AGACAAAGAT TCAGCTTCTC 5940 

CTGCAATGAA ACGTCACCTT GAGTTTTTCA ATACTCATCA AACAGCGGCA CCATTTATTC 6000 

TTGGAGTTAC TTCCGCTATG GAAGAACAAG AAGGAAATGA AGGTGCAGCT TCAATTACTG 6060 

GTATTAAAGT TGGCTTGATG GGGCCACTGG CTGGTCTAGG AGATAGTTTG TTCTGGCTGA 6120 

CACTAGTTCC TATCTGTTTT AGTATTGGTG CGTCTTATTC TAAAGACGGC GGTGCTTTAG 6180 

GTATCTTTAT CGCCTTAATA TTGTTTAATA TTATTAATAT TCCTGTTAAA TATTTCGGTT 6240 

TGAAATATGG GTATACTAAG GGTTCTAGTC TTATCCAAGA AAATAATACA AAAGGAACAT 6300 

TGAATCGCGT TACGAGTATG GCGACAGCAT TAGGGCTAGT ACTAGTGGGT GGTTTGATTC 6360 

CATCAATGGT TGGTATTAAT TTTGGATTAG AATTTAAGCA GGGGGAACTT GTTATTTCTG 6420 

TTCAAGAAAT GATTACAAAA TTAATTCCAG GATTTATCCC TATGGCTTTG ACTTTATTAA 6480 

TGTGTAAATT AATTAGAAAA GGAAAGAATC CGGTTGTACT AATCTTTAGT GTTATGGCTA 6540 

TTGGAGTTAT TCTAGTTGTT TTAGGAATTT TGAAGTAGTA GAAAGTGTGG AGGTGGTATT 6600 

TGGGATATCA CCTCCATTTT GGAAGAGAGG TAAAGAGTGA AATTATGGTA TAAGAAAGCT 6660 

GCCGCAAATT GGAATGAAGC CTTGCCGATT GGGAACGGTC ATTTAGGTGG TATGATTTAT 6720 

GGTTCAGCTA CAAAAGAATG TATTCAACTA AACGATGAGA CTATTTGGTA TAGAGGAAAG 6780 

TCAGATAGAA ATAATCCAGA CTCACTATTG CATCTTAAAA AAATTCGGGA ATATCTTTTA 6840 

GATGGAGAAA TTCAGAAAGC CGAAGAATTG ATAAAGTTAA CAGTGTTTGC TACCCCAAGA 6900 

GATCAAAGCC ACTATGAATT ACTTGGGGAA CTTTACATTG AGCATATAGA TATTCAGTCT 6960 

TGTGCTCTTT CATTGTATGA AAGAGAGCTA GATTTAGATA CAGCTATTTC TAATGTTGTG 7020 

TTTGAGCCTA ATAGTTGTAA TTTACAAATA AAAAGAGAAT ATTTTACGAG TTTTAATAAG 7080 

AATATTTTAT GTTGCCGTAT AGTGTCATCA GTTCAAAACA CATTAAATTT AAACATTAAT 7140 

TTGGGTAGAA ATAAACGGTT TAATGACGAA GTATCTAAAC TGGATTCAAG TACAATTTTA 7200 

ATGTCGGCCT CTGCTGGAGG TAGAAAAGGT GTTCAGTTTA AAGTAGTATG TCATTCTAAG 7260 

GTTACGGATG GTGAAGTAAG TGTATTGGGA GAGACAATAG TTATTCGGAA TGCTACAGAG 7320 
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GTATTTCTTT ATCTCAAATC AATGACGGAT TATTGGGGAA ATATAGATAT TTCTTCTCTT 7380 

CAGGGAGAAT TTAGTAGTAT TGATTACTTT ACAGAAAAAG ATGAACATGT AAAAAAATAT 7440 

CAGGAGCAAT TTAATAGAGT TGATTTTAAA CTAGACTATA GTAAAGGTTG TCTTAGCATT 7500 

CCAACGAATC TACTTCTTGA AAACACTAAA AAGTATAGTA ACTACTTGAC TAACTTGTTA 7560 

TTTCATTATG GAAGATATCT GTTAATATCG TCTAGTCAAC CGAATGGTTT ACCTGCCAAT 7620 

CTTCAAGGAA TATGGTGTGA TGAATTAAAT CCAATTTGGG GTTCTAAATA TACGATTAAT 7680 

ATTAATACTC AAATGAATTA TTGGATGGTA GGTCCATGTG ATTTACCAGA AGTAGAATAT 7740 

CCATTATTTG ATATGCTCGA AAGAATGAGA GAACCGGGAA GACTAACCGC TAAGAAAATG 7800 

TATGGAGCTA GAGGTTTTAC AGCACATCAT AATACGGATG GTTTTGGCGA TACGGCTCCC 7860 

CAATCTCATG CCATGGGGGC TGCAATTTGG GTATTAACTA TTCCATGGTT ATGTACTCAT 7920 

ATTTGGGAAC ACTATTTATA TTTCCAAGAT GAGCGTATTC TTACGGAACA TTTTGAAATG 7980 

ATAAAAGAAG CATTTCTTTT CTTTGAAGAT TATTTATTTG AGGTGGATGG CTACTTGATG 8040 

ACAGGTCCAA GTGTCTCACC GGAAAATAAA TATCGCTTAA AAAATGGTAT TGAAGGAAAT 8100 

GCTTGTCTAT CATCTACAAT TGATAATCAA ATTCTAAGAT ATTTTTGTGA TTCATGCATT 8160 

GGCATTGCAA AACAATTAGG AGACAATTCG GATTTTATTA GTCGTGTGAA GGAGTTAAAA 8220 

AAGAAACTAC CTAAAACAAA AATAGGTAGT AATGGGCAAA TCCAAGAATG GTTAGAAGAT 8280 

TATGAAGAAG TAGAGCCTGG GCATAGACAC ATTTCACCTC TATTTGGGCT TTATCCTTAT 8340 

AATGAGATTG ATATTCATAA AACTCCGGAA TTAGCAGAAG CAGCTAAAAT CACTATCAAT 8400 

AGGAGATTAT CAAACGCTAA TTTTTTATCT TCACAGGAGA GGGAGCAAGC GATTAATAAT 8460 

TGGTTAGTAA GTGGTTTGCA TGCTAGTACA CAAACAGGTT GGAGTGCTGC ATGGCTGATT 8520 

CATTTTTTTG CGAGACTATA TCAAGGTGAA CCTGCTTATA ACCAGATTAA TGGTTTGTTA 8580 

AATAATGCGA CTCTTGGCAA TTTATTTCTT GACCATCCAC CATTTCAAAT TGATGGTAAT 8640 

TTAGGTTTGG TGAGTGGAAT TTGTGAATTA TTAGTACAGA GCCATCATAA TTGGTTATCA 8700 

CTAATTCCAG CTTTACCTTC TGCTTGGTCA GAAGGAGAAG TGAAAGGTTT CAGAGTAAGA 8760 

GGAGGATATA AGGTATCGTT TGCTTGGAAA AATGGGGATA TAACATTCCT AAAATTGGAA 8820 

GGAGGAAACA AAGATCAAAA AGTAAGAGTA AGAATATATG GCAAAAATAC TGATGTACAA 8880 

AATATTGAAT TGGTATTTAA TTCAGAAAAA ATTATTGAGT TAAATTTTTA GGTATAAGTC 8940 

ATGAATAAAG AAAAAATAAA AAGAAAATTA ATCACAATAT TGTTTGTATG TATTGGGATG 9000 

TTATGTTTTG GATTGTTAGC AGGAGTTAAG GCTGATAATC GTGTTCAAAT GAGAACGACG 9060 
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ATTAATAATG 


AATCGCCATT 


GTTGCTTTCT 


CCGTTGTATG 


GCAATGATAA 


TGGTAACGGA 
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TTGAACTTCA 
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ATTTGGTGTT 
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9660 


GGCTTATGTG 


ATAACTGGGG 


CTCATCAACA 
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AAGAGCTAGG 
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CCTATGCATC 
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TCAATGATTG 


CTATGGAAAT 
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TATACTGGGG 
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9840 
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CGTATACATT 


TATGACAAAT 
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9960 
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AAACGGATTT 


10020 


TATCAAGGAC 


TTTATTCGAA 


TGATGAAACA 
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ATAATAATGG 


GAGATATCAT 


10080 


ATTCTTCCTG 


TAATACATGA 


GAAAATTGAT 


AAGGAAAAGA 


TTTCATCTAT 


ATTCCCTAAT 


10140 


GCAAAAATTT 


TGACTAAAAA 


TAGTGAGGAA 


TTGTCTAGTA 


AAGTCAACTA 


TTTAAACTCG 


10200 


CTTTATCCAA 


AACTTTATGA 


AGGAGATGGG 


TATGCTCAGC 


GTGTAGGTAA 


TTCCTGGTAT 


10260 


ATTTATAATA 


GTAATGCTAA 


TATCAATAAA 


AATCAGCAAG 


TAATGTTGCC 


TATGTATACT 


10320 


AATAATACAA 


AGTCGTTATC 






CTTACGCTGT 
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10380 


AATCCAAATA 


ATTTACATAT 


TTTATTGAAT 


AATTACAGGA 


CAGATAAGAC 


AGCTATGTGG 


10440 


GCATTATCAG 


GAAATTTTGA 


TGCATCAAAA 


AGTTGGAAGA 


AAGAAGAATT 


AGAGTTAGCG 


10500 


AACTGGATAA 


GCAAAAATTA 


TTCCATCAAT 


CCTGTAGATA 


ATGACTTTAG 


GACAACAACA 


10560 


CTTACATTAA 


AAGGGCATAC 


TGGTCATAAA 


CCTCAGATAA 


ATATAAGTGG 


CGATAAAAAT 
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CATTATACTT 


ATACAGAAAA 


TTGGGATGAG 


AATACCCATG 


TTTATACCAT 


TACGGTTAAT 


10680 


CATAATGGAA 


TGGTAGAGAT 


GTCTATAAAT 


ACTGAGGGGA 


CAGGTCCAGT 


CTCTTTCCCA 


10740 


ACACCAGATA 


AATTTAATGA 


TGGTAATTTG 


AATATAGCAT 


ATGCAAAACC 


AACAACACAA 


10800 


AGTTCTGTAG 


ATTACAATGG 


AGACCCTAAT 


AGAGCTGTGG 


ATGGTAACAG 


AAATGGTAAT 


10860 
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TTTAACTCTG GTTCGGTAAC ACACACTAGG GCAGATAATC CCTCTTGGTG GGAAGTCGAT 10920 

TTGAAAAAAA TGGATAAAGT TGGGCTTGTT AAAATTTATA ATCGCACAGA TGCTGAGACT 10980 

CAACGTCTAT CTAATTTTGA TGTGATTCTA TATGACAATA ATAGAAACGA AGTTGCTAAG 11040 

AAACATGTTA ATAATTTGTC GGGTGAATCT GTTAGTCTAG ATTTCAAAGA AAAAGGAGCA 11100 

AGGTATATTA AAGTTAAATT ACTAACGAGT GGAGTGCCTT TGAGTTTAGC AGAAGTAGAG 11160 

GTTTTTAGAG AATCAGATGG TAAGCAATCT GAAGAGGATA TAGATAAAAT AACAGAAGAT 11220 

AAAGTAGTCT CTACAAATAA GGTAGCTACT CAAAGTTCAA CCAATTATGA GGGTGTAGCT 11280 

GCTTTAGCAG TTGATGGTAA TAAAGATGGA GATTACGGAC ATCATTCGGT GACTCATACT 11340 

AAGGCAGATT CTAACGCTTG GTGGCAGGTC GATCTGGGAG AAGAGTTTAC GGTTTCTAAA 11400 

GTTGATATTT ATAATAGAAC AGATGCCGAA CCTCAGCGTT TATCTAATTT TGATGTTATT 11460 

TTTCTATCTT CATCAGGAGA AGAAGTTTTT AGAAGACATT TTGATAAAGT AGTTGATGGT 11520 

TTGTTATCTT TAAAAGTACC TTCTGTAGGG GCTAAGCTAG TCAAAATAGA ATTAAAATCA 11580 

GCAGCTATTC CGTTAAGTTT AGCGGAAGTT GAAGTCTATG GTTCAAAGAG AACTCCGAAG 11640 

AAACTTTCTA ATATTGCATT AACAAAAGAA ACTCGACAGA GTTCAACGGA TTACAATGGT 11700 

TTTTCTCGTC TAGCAGTTGA TGGAAATAAA AACGGAGATT ATGGTCATCA TTCAGTGACT 11760 

CATACCAAAG AAGATTCTCC TTCATGGTGG GAGATAGATT TAGCACAAAC CGAAGAATTA 11820 

GAAAAGTTAA TTATTTATAA TAGAACAGAT GCTGAAATTC AGAGATTATC AAATTTTGAT 11880 

ATTATTATAT ATGATTCAAA TGATTATGAA GTTTTTACAC AACATATTGA CAGTTTAGAA 11940 

AGCAATAATC TATCCATAGA CTTAAAAGGA CTGAAGGGAA AAAAGGTTAG AATTTCTTTG 12000 

AGAAGCGCAG GAATTCCTTT AAGTTTAGCA GAGGTAGAGG TTTATACTTA TAAGTAATTT 12060 

TAAAAATTAT CACCCAGGCT ACCGTAAATA TAATGGAGAT GGTAGTATGA AAGAAACAGA 1212 0 

AAAATAAGAG GAAAATAGTA TGATTCAACA TCCACGTATT GGGATTCGTC CGACTATTGA 12180 

TGGTCGTCGT CAAGGTGTAC GCGAATCACT TGAAGTGCAA ACAATGAACA TGGCTAAAAG 12240 

TGTGGCAGAT TTGATTTCAA GCACATTGAA ATATCCAGAT GGGGAACCTG TGGAATGCGT 12300 

GATTTCTCCA TCTACTATTG GCCGTGTACC AGAGGCTGCA GCTTCCCATG AGTTGTTTAA 123 60 

AAAATCAAAT GTTTGCGCAA CAATTACAGT TACACCATGC TGGTGTTATG GTAGTGAAAC 12420 

TATGGATATG TCTCCAGATA TTCCTCATGC TATTTGGGGA TTTAATGGGA CAGAACGCCC 12480 

AGGAGCTGTC TATCTTGCAG CTGTACTAGC TTCACATGCT CAAAAAGGGA TTCCAGCCTT 12540 

TGGGATTTAT GGAAGAGATG TTCAGGAAGC TAGTGACACA GATATTCCAG AAGATGTCAA 12600 
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AGAAAAACTT TTACGCTATG CGCGTGCAGC TCTTGCAACT GGCTTGATGA GAGACACTGC 12660 

TTACCTATCA ATGGGTAGTG TTTCGATGGG GATTGGTGGT TCTATTGTAA ATCCGGATTT 12720 

CTTCCAAGAA TACTTAGGAA TGCGAAATGA ATCGGTAGAT ATGACGGAGT TCACGCGCCG 12780 

TATGGACCGT GGTATTTACG ACCCTGAAGA GTTCGAACGT GCGCTCAAAT GGGTGAAAGA 12840 

AAACGTAAAA GAAGGATTCG ACCATAACCG TGAAGACCTT GTTTTAAGCC GTGAAGAAAA 12900 

AGATAGACAA TGGGAATTTG TTATTAAGAT GTTCATGATT GGACGTGACT TAATGGTTGG 12960 

TAACCCAAGA CTTGCTGAAC TTGGTTTTGA GGAAGAAGCG GTTGGTCACC ATGCTTTAGT 13020 

AGCTGGTTTC CAAGGTCAAC GTCAGTGGAC AGACCATTTT CCAAATGGGG ACTTTATGGA 13080 

AACTTTCCTC AATACTCAGT TTGACTGGAA TGGTATTCGA AAACCATTTG TATTTGCGAC 13140 

AGAGAATGAT TCACTAAATG GTGTGTCTAT GCTCTTTAAT TATCTATTAA CAAATACTCC 13200 

ACAAATCTTT GCTGATGTGC GTACTTATTG GAGCCCAGAG GCTGTTAAAC GTGTAACGGG 13260 

ACATACTTTA GAGGGTCGTG CTGCAGCTGG CTTCTTACAT CTAATCAACT CTGGTTCTTG 13320 

TACATTGGAT GGTACAGGTC AAGCTACTCG AGATGGCAAA CCTATTATGA AACCATTCTG 13380 

GGAGTTGGAA GAAAGTGAAG TGCAGGCTAT GCTTGAAAAT ACAGACTTCC CACCAGCAAA 13440 

CCGCGAATAC TTCCGTGGAG GAGGATTCTC AACTCGTTTC TTGACGAAGG GGGATATGCC 13500 

AGTAACAATG GTACGTCTCA ATCTTCTAAA AGGGGTTGGT CCAGTGCTAC AAATTGCAGA 13560 

AGGTTACACA CTTGAACTTC CTGAAGATGT TCACCATACT TTAGATAATC GTACAGATCC 13620 

AGGATGGCCA ACTACTTGGT TTGCTCCACG TTTGACAGGA AAAGGTGCTT TCAAGTCTGT 13 680 

CTATGACGTC ATGAATAATT GGGGAGCTAA TCACGGAGCC ATAACATATG GACACATTGG 13740 

AGCAGACTTG ATTACCTTGG CTTCTATGTT GAGAATTCCT GTCAATATGC ATAATGTACC 13800 

TGAGGAAGAT ATCTTTAGAC CTAAAAATTG GTCCTTATTT GGAACAGAAG ATCTAGAATC 13860 

AGCAGACTAT CGTGCATGTC AGTTGTTGGG GCCACTACAT AAATAAAACT TGTTTATATA 13920 

GGAGGTGAAC TTACGTCCCT CCTATCCTTT TAAAAAGATT TGTTAAACAA TTCACAAATA 13980 

ATTGAAAACG AATACAAAAA GTAATATAAT GATGTTAAAT AGATAGCGCG GAGGCGCAGG 14040 

AGGAAAATTA TATGGCT AT A TTTTATGTTC CGGCAGTCAA CCTTATTGGA AAAGGTGTTG 14100 

TAAATGAAGT GGGTCCTTAT ATCAAGGAAC TTGGCTATAA AAAGGCACTT TTGGTGACAG 14160 

ATAAGTACAT CGAAGGCAGT GATATTTTAC CTAAGACTTT AAAACCACTG GATACAGAAG 14220 

GAATCGAATA T 14231 
(2) INFORMATION FOR SEQ ID NO: 82: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 16995 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

AGTTCTCTTA ACTTTTTTAG GATGGCATTC TCCGCTCTCA GGTACTCATT TTCTGCTgAA 60 

GACGTTCTAA TTCTGTCCTC TCTTCAGGTC TCGTTTTTGG CTTACGTCCC ATTTTAGGTA 120 

CTCTCCCTCT TGTTTTCTCA ACAATAGTAT ACCCGTTTTT CCTGTATTGT GCTAGCCAGT 180 

TAAGAAGTAT CGTACGACTT GGGAGACCGT ATTCAAGAGA AACTCTATCT TTAGTCCAGC 240 

CTTCATGTCA GACTTTATTA CTCATTTCTT GTTTTAAATC AGGAGAATAG TAACGATTTT 300 

TTCCTTTTTT GACGAACTCT ATTCCGTAAC GATCAATCAA TTTAATCATG TACCTAATAT 360 

TAGAATTGCT TATCCCAAAT TTATTTGAAA GCTTCTCTAA GCTATATCCT TGTTTTCTAA 420 

GTTCATAGAT CTGAACTTTA TCATCATAAG TTAGTTTCAT AATAAAAACA CCCCAAAAGT 480 

TAGATTTTTT CTGTCTAACT TTTGGGGTGT AGTTCATGTA CACCTGATAT GATGCGTTTT 540 

ATAATTTTTA AGCCTTTTTG CCCAGCCTCG TCAAAAGTAA TGTTTTGACA CAAAATCTGT 600 

GACAAAACTT TAGTTTTAAA GGTTTTTAAC TTTGTATATA CTAGTTTTAA GAAAAGGAGG 660 

ATGATCTAAT GGAAGAAAAA GTATCATTGA AAGTCAGGGT TCAAAAACTA GGGACATCGC 720 

TTTCAAATAT GGTTATGCCC AATATTGGAG CATTTATTGC TTGGGGAGTA TTGACTGCCC 780 

TCTTTATCGC TGATGGCTAT CTGCCAAATG AACAGTTAGC TACTGTTGTT GGTCCTATGT 840 

TAACGTATTT ATTGCCAATC CTGATTGGTT ACACAGGTGG ATATATGATC CATGGCCAAC 900 

GTGGTGCCGT TGTAGGAGCT ATTGCTACTG TTGGTGCAAT CACAGGTTCT AGTGTTCCTA 960 

TGTTTATCGG AGCTATGGTA ATGGGCCCAC TGGGAGGATG GACTATCAAG AAATTTGATG 1020 

AGAAGTTCCA GGAAAAAATT CGTCCCGGAT TTGAAATGTT AGTTAATAAC TTCTCAGCTG 1080 

GTCTCGTTGG TTTTGCATTA TTGCTTTTGG CTTTCTACGC AATCGGTCCA GTCGTATCGA 1140 

CTCTTACTGG AGCTGTTGGG AATGGTGTTG AGGCTATTGT CAATGCTCGC CTCCTTCCTA 1200 

TGGCTAATAT TATCATCGAA CCGGCTAAAG TCCTTTTCCT CAATAATGCC CTCAATCATG 1260 

GCATTTTTAC TCCTCTGGGA GTAGAACAGG TAGCTCAAGC TGGTAAGTCA ATTCTCTTCC 1320 

TATTGGAAGC TAATCCTGGA CCAGGTCTGG GAATTCTATT AGCTTATGCT GTATTCGGTA 1380 

AAGGTTCTGC TAAATCTTCT TCTTGGGGGG CAATGGTTAT TCATTTCTTC GGAGGGATTC 1440 

ATGAAATTTA CTTTCCTTAT GTTATGATGA AGCCTACTCT ATTTTTAGCT GCTATGGCAG 1500 
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GAGGTATCTC TGGAACTTTT ACTTTTCAAC TCTTAGACGC TGGTCTTAAA TCTCCAGCTT 1560 

CACCAGGTTC TATTATTGCG ATTATAGCTA CGGCGCCAAA AGGTGTTTGG CCCCATCTAA 1620 

ATGTTCTTTT AGGTGTTTTA GTGGCAGCAG TTGTTTCTTT CCTTGTAGCA GCCCTTATTC 1680 

TTCATGCAGA CAAGTCAACT GAGGATTCGC TCGAAGCTGC TCAGGCGGCT ACCCAAGCAG 1740 

CTAAGGCTCA GTCTAAAGGT CAGTTAGTAT CAACTTCTGT TGATGCAGTT GTTTCGACAG 1800 

ACTCAGTGGA AAAAATCATT TTCGCCTGCG ATGCTGGTAT GGGAAGCTCT GCTATGGGAG 1860 

CTAGTATTCT TCGAGATAAG GTTAAAAAAG CAGGTCTAGA GATTCCAGTA TCTAATCAGG 1920 

CAATCTCAAA TTTGCTTGAT ACACCAAAAA CATTAATTGT TACTCAGGAA GAACTGACAC 1980 

CAAGAGCTAA AGACAAGAGT CCAAGTGCTA TTCATGTTTC TGTTGATAAT TTCTTAGCGT 2040 

CCTCTCGTTA TGATGAAATT GTAGCTTCAT TAACAGGAGC TTCTCCAATA GCAGAAATTG 2100 

AAGGAGATAT ACCAACTTCA GCACCAGTAG ATAGTCAGGA AAGTGACCTT AACCATATTG 2160 

ATGCTGTAGT AGTTGCTTAT GGTAAAGCAC AGGGAACTGC AACTATGGGC TGTGAAACGA 2220 

TTCGGGCTAT TTTTAGAAAC AAGAATATTC GTATTCCAGT TTCTACTGCC AAAATTTCAG 2280 

AATTAGGTGA ATTTAATTCT AAAAACATAA TGATTGTAAC AACTATTTCT TTACAGGCAG 2340 

AAGTGCAGCA AGCAGCACCG AATTCTCAAT TTCTTATTGT GGATAGTTTA GTAACAACAC 2400 

CAGAATATGA CAAAATGGCT GCTAGAATGT ACAAATAGAA CTAGAGGTTT CTAAATTACG 2460 

AATGCTATTA ACCAAACGAG AAGAACAATT ATTGAAGGCT TTCCTACATG TAGGGAAGCT 2520 

TTCAATGCAA GATATGACTG AAATCTTACA GGTTTCATCT AGAACAATTT ATCGAACTTT 2580 

ATCAGATTTG ACAGATAGCA TGGAGCAATA TGGAATCGAA ATAACGAAGC ATGGGAAATA 2640 

CTATATTTTG ACTGGAGAGT TGGATGATTT GCCGACAGAA CTTGAAGTGT TAGTTGAGTA 2700 

TAGTCCCCAA GAAAGACAAG AGTTGATTAC CTATCGCCTT CTGACTGAGA GTGGTTTTGT 2760 

CACCAATGAA GCATTGCAAG AGTGCACGAA AGTCAGTAAT GTAACTATTA TTCAGGATAT 2820 

TTCAGATATT GATAAGCGTC TTTTAGACTT TGATCTGAAA ATTGAACGAC AAAAAGGTTA 2880 

TCGGATTTCT GGTGATTCAG TTGGTAAGAG AAGATTTTTG GCTATTTTAC TGACAAACTG 2940 

TATCTCAGTA GCAGATTTTT CAACCGGTAA TTTTGGGAGC TTTGATATTT TAGAAGCAGA 3000 

TAGAACTGGG CTGGCCAGTC AGATTGTTAA TAAGCAACTG TCAGGTTTTC CAGATATGGA 3060 

TGCTAGGATG AAGATGTTTT TTGCGATCTT GTTATCTCTT ATAGGTCAGG AGCAAAACAT 3120 

TGAAAATTCA CCTAATACTA GTAAGCAGGC TTTGGAAATT TCTCAAAAAA TTTTTCAAGC 3180 

TTACTCTAAG CAGACTGCAC AATTTTATAG TATTCAGGAA ATTATCTATT TTGCGAGCAT 3240 

CTTGGATGAA TTAATCATTA AACGTCAGGA CAATCCGCTC TTTACGGAGA AATTTGATGG 3300 
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TGAATTTTTC TACAATATTT CAAATCTGAT TGATACGGTT TCCATGTATA CCAAGATTGA 3360 

CTTTTTTAAG GACAAGGTTT TATTCAATTT TCTTTTCCAT CATATTCGGC TCAGTTTAGG 3420 

CGTCCCTATC CTTTTTCAGG GTGAAAATTT GCCAGAATCT ATCCAGATTT TAGTTGAAAG 3480 

GAATAAATTT CTTTATACAG TCATCAGTCT TTTAGTGAAT GATATTTTTC CGAAATATCT 3540 

TCATACAGAG TATGAGTATG GCATGATTGC CCTACATTTT ATCTCTAGCT TAGGCCGTAG 3600 

TCCAGAGATT TATCCAGTCC GTGTTTTGCT TTTAACGGAT GAACGTCGGG TCACTAGAGA 3660 

TTTATTAGTC AGTAAAATTA AGAGTGTTGC TCCTTTTGTA GAGTTGATAG ATATTCAATC 3720 

TCTAGTAGAT TACCACAGTA TTGATCTCAG TCAGTATGAT TATATTTTAT CTACCAAGCC 3780 

GCTGACTAAT CAGGAAATCG ATGTAATTTC TAGTTTTCCA ACCGTCAAAG AATTGCTTGA 3840 

ATTACAGGAA CGACTTCAGT ATGTACAGGC ACATCGTACA ATTGTCGCGC GTGATGCTAT 3900 

CGCTCCAGAG AAAAGTTATG ACTTGCAAGA TTATTTAATA TCTAGTAGTC AGCTTTTGAG 3960 

TCAATTCGAG TTGGTTCAAT TGGAGAATAA TCAATCATTT GAGCACACGG TAGAACAAAT 4020 

CATCCAATAT CAGAAGAATG TGAGTGACAG AGCTTACCTA ACAAGAAAAT TGTTATCTCA 4080 

CTTCCAGAAT AGTCCTATGG CTATTCCTAA TACTGGTCTG GTGCTTTTAC ATAGTCAGTC 4140 

TAGCAAAGTA ACAACAAATA GTTTTACTAT GTTTGAACTC AAACTACCTA TCTCCGCATT 4200 

GTCAATGAAA CGAGAGGAAG AAGAGGTCAA AAGGTGTCTG CTAATGCTAA TGTCTAAAGA 4260 

AGCTAGCGAG GAAGCTAGAG ATTTAATGAC AGCTATTAGT CAGTCGATTA TTGAAAATCA 4320 

TCTTTATACA GAGATTTACA AGACGGGAAA TCAATCCATT ATTTATCAGA TGCTAAATAC 4380 

TATTTTTAAC GAAAAAATTA AGAAATTGGA GAACTAATAT GAAACTTGAA AAACATTTGA 4440 

TTAAGCTTAA TAAACAATTT TCTAACAAGG AGGAAGCTAT TTGTTATTGT GGGCAAGTTC 4500 

TTTATGAGGG TGGATATGTT AATGAAGACT ATATTGAAGC CATGATTGAG CGAGATAAAG 4560 

AGCTATCTGT TTACATGGGT AACTTTATCG CCATACCGCA TGGAACAGAT GCAGCAAAAA 4620 

ATGATGTCCT CAAGTCTGGT ATTACAGTCG TTCAAGTCCC TAGAGGGGTT GATTTTGGGA 4680 

ATGTATCTAA CCCTCAAGTG GCAACGGTTC TTTTTGGTAT TGCTGGTATT GGTAATGAAC 4740 

ACTTAGAAAT TATTCAGAAA ATTTCTATCT TCTGTGCAGA TGTAGATAAT GTTCTTAAAC 4800 

TAGCAGATGC TCAGTCAAAA GAGGAAGTAT TGCGCTTATT TGATGCTGTT GAATAATTGA 4860 

ATTTAGTCAT TTGTCATCTA GTATATATGT CCCTCAAATA GGAAAAGGAG AAATTGAATG 4920 

AAACATTCTG TTCATTTTGG TGCCGGTAAT ATCGGTCGTG GTTTTATAGG TGAAATTCTA 4980 

TTTAAAAATG GTTTCCATAT TGATTTTGTG GATGTCAATA ATCAGATAAT TCATGCTCTG 5040 
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AATGAAAAGG GCAAGTATGA AATTGAAATT GCACAGAAAG GACAGTCTCG TATAGAAGTA 5100 

ACTAATGTGG CTGGCATTAA TAGCAAAGAA CATCCTGAGC AAGTCATTGA AGCGATTCAA 5160 

AAGACGGATA TTATTACTAC TGCAATCGGA CCTAATATAC TCCCTTTTAT CGCCGAACTT 5220 

CTAGCCAAAG GAATCGAAGC TCGCCGAGTT GCAGGAAATA CACAGGCATT GGATGTTATG 5280 

GCCTGTGAAA ATATGATTGG CGGGTCTCAA TTTCTTTATC AAGAAGTCAA GAAATATTTA 5340 

AGTCCGGAAG GTTTGACATT TGCTGATAAC TACATAGGTT TTCCAAATGC TGCAGTAGAC 5400 

AGGATTGTTC CAGCACAAAG TCACGAAGAT TCCCTTTTTG TTGTGGTCGA GCCCTTTAAT 5460 

GAATGGGTCG TGGAAACCAA GCGTCTTAAA AATCCAGATT TACGTCTAAA AGATGTGCAT 5520 

TATGAAGAAG ATTTAGAACC CTTTATTGAG CGAAAACTTT TTTCAGTCAA TTCTGGACAT 5580 

GCAACTTCAG CTTACATTGG TGCGCATTAT GGTGCCAAGA CAATTTTGGA AGCTCTTCAA 5640 

AATCCTAATA TTAAATCTCG GATTGAATCT GTATTAGCTG AAATTCGGAG TCTCTTGATT 5700 

GCCAAATGGA ACTTTGATAA AAAAGAATTG GAGAATTATC ACAAAGTCAT TATAGAACGA 5760 

CTTGAAAACC CTTTCATAGT GGACGAGGTT AGTCGCGTAG CTCGTACTCC AATCCGAAAA 5820 

TTAGGCTATA ATGAACGATT CATCCGGCCG ATACGTGAAT TGAAAGAACT CAGTTTGTCA 5880 

TATAAAAACC TACTTAAAAC AGTTGGCTAT GTCTTTGACT ATCGCGATGT AAATGATGAA 5940 

GAAAGTATTC GATTAGGTGA ATTGTTGGCT AAACAATCAG TCAAAGATGT TGTTATACAA 6000 

GTTACAGGTT TAGACGACCA AGAATTGATT GAGCAAATTG TAGAGTATAT TTAATCTTTT 6060 

TCGAAAATCT CTTCAAATCA GGTTAGCATC GCTTTGTCTT AGGCATATGT TGTTCTATCT 6120 

ACAACCTCAA AGCAGTGCTT TGAGCTGACT CCGTCAGTCT TATCTGCAAT CTCAAAACAC 6180 

TGTTTGAGTT ATCTGCGGTA ATCTTTCTAG CTTGTCTTTG ATTTTTGTTG TTATTTATAA 6240 

GGTAAAAGAA GCTGGACAAA AAGTCTTCAA AATCGGGAAA AGGCAGCCTA TCGGGTGTTC 6300 

AAAAATCTTG ATAGGATGTC CTTTATTATG GAAAGCCTTA TTGGATTTTC TCCTCAGATT 6360 

GAGTTTTTGA TCAGCTTTAT GAGATAGGTC TTGCTAGAGA TGTAGCCCAT CATGTTATTT 6420 

TTATGGACAG TGGGAAAATT GTTGAAAAAA ATAATGCCCA TCAATTCTTT AGTCGTCCAA 6480 

GAGAAGAACG AACCAAGCAA TTTTGGAACG AATTCTTTCG AATGCGATCT ATATAGTAAA 6540 

ATGAAACAAG AACAGGACAA ATCGATCAGG ACAGTCAAAT CGATTTCTAA AAATGTTTTA 6600 

GAAGTAGAGG TGTACTATTC TAGTTTCAAT CTACTATATA ACTGAAAAAT TAGATAAATT 6660 

AGTTTTGGAA AATGACTAAC CAAAAGATAT CCAAAGTAGT CTAAAATTGT CTATACTTTA 6720 

TGAGTGTTTT AGTTAGGAAA AAGGCTTGTT GTCTATAATT GTCTGCATTA GTCTAGATTT 6780 

TATTTATAGA AAATGTTATA ATAGACTGTA TTTAAAAAAT TTTAAGGAGA AATGACAGAA 6840 
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TGTCTGTATC ATTTGAAAAC AAAGAAACAA ACCGTGGTGT CTTGACTTTC ACTATCTCTC 6900 

AAGACCAAAT CAAACCAGAA TTGGACCGTG TCTTCAAGTC AGTGAAGAAA TCTCTTAATG 6960 

TTCCAGGTTT CCGTAAAGGT CACCTTCCAC GCCCTATCTT CGACCAAAAA TTTGGTGAAG 7020 

AAGCTCTTTA TCAAGATGCA ATGAACGCAC TTTTGCCAAA CGCTTATGAA GCAGCTGTAA 7080 

AAGAAGCTGG TCTTGAAGTG GTTGCCCAAC CAAAAATTGA CGTAACTTCA ATGGAAAAAG 7140 

GTCAAGACTG GGTTATCACT GCTGAAGTCG TTACAAAACC TGAAGTAAAA TTGGGTGACT 7200 

ACAAAAACCT TGAAGTATCA GTTGATGTAG AAAAAGAAGT AACTGACGCT GATGTCGAAG 7260 

AGCGTATCGA ACGCGAACGC AACAACCTGG CTGAATTGGT TATCAAGGAA GCTGCTGCTG 7320 

AAAACGGCGA CACTGTTGTG ATCGACTTCG TTGGTTCTAT CGACGGTGTT GAATTTGACG 7380 

GTGGAAAAGG TGAAAACTTC TCACTTGGAC TTGGTTCAGG TCAATTCATC CCTGGTTTCG 7440 

AAGACCAATT GGTAGGTCAC TCAGCTGGCG AAACCGTTGA TGTTATCGTA ACATTCCCAG 7500 

AAGACTACCA AGCAGAAGAC CTTGCAGGTA AAGAAGCTAA ATTCGTGACA ACTATCCACG 7560 

AAGTAAAAGC TAAAGAAGTT CCGGCTCTTG ACGATGAACT TGCAAAAGAC ATTGATGAAG 7620 

AAGTTGAAAC ACTTGCTGAC TTGAAAGAAA AATACAGCAA AGAATTGGCT GCTGCTAAAG 7680 

AAGAAGCTTA CAAAGATGCA GTTGAAGGTG CAGCAATTGA TACAGCTGTA GAAAATGCTG 7740 

AAATCGTAGA ACTTCCAGAA GAAATGATCC ATGAAGAAGT TCACCGTTCA GTAAATGAAT 7800 

TCCTTGGGAA TTTGCAACGT CAAGGGATCA ACCCTGACAT GTACTTCCAA ATCACTGGAA 7860 

CTACTCAAGA AGACCTTCAC AACCAATACC AAGCAGAAGC TGAGTCACGT ACTAAGACTA 7920 

ACCTTGTTAT CGAAGCAGTT GCCAAAGCTG AAGGATTTGA TGCTTCAGAA GAAGAAATCC 7980 

AAAAAGAAGT TGAGCAATTG GCAGCAGACT ACAACATGGA AGTTGCACAA GTTCAAAACT 8040 

TGCTTTCAGC TGACATGTTG AAACATGATA TCACTATCAA AAAAGCTGTT GAATTGATCA 8100 

CAAGCACAGC AACAGTAAAA TAATCTTAAT AAACAGAAAA CCCACCTGAA TTGGTGGGTT 8160 

TTCTGATGCA CTATTTTCCA AAAATCTCTT TGAGGTCTGT GTCTGTAATC CCAATCATGG 8220 

CTGGGATGCG GTCCCAGTTT TCTTCGGTTA GGATGTAGGA TTGTTCAGAG GCACTTGATG 8280 

TGACTGTTTC AGAGACAGCT TGTTGCTTTT CTTCAACATT CTCCAGTAGA TCACTGAAGC 8340 

GTTCAATCAG ATAGGTTTTT CGGGCAGTTC CGATGTGTTG GGTAGCATAG TCGAAGGCTT 8400 

GTAATTCGCC TAGTAAGATG AGTTTGCTTT TGGCACGTGT AATGGCTGTG TAGATGAGAT 8460 

TTCGCTCCAG CATACGTCGG CTAGCACTAG TAATCGGTAG GATGACAACT GGGAACTCAC 8520 

TTCCCTGAGA CTTATGAATA CTCATGGCAT AGGCCAAGCG AATCTTGTAC CATTCGTTAC 8580 
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GGGGGTAAGA GACTTCATTA CCATCAAAAT CAATGACAAT CTCGTCTTGT TTCGATTCGG 8640 

TGTATTTACC AGGAATCAGG TCTGTGATAG CTCCTAAATC CCCATTAAAG ACATTGATTT 8700 

CAGCATCGTT AACCAAATGA ATGACCCTGT CTCTCTTACG ATAGTGACAC TGAGGAGCTT 8760 

CAAAACTGAG TTGATCTTTT TGTGGGGGAT TGAGCAGGTC TTGCATGAGC TGATTGATAG 8820 

CATCAATCCC TGCCGTCCCT CGGTACATAG GAGCCAGAAC TTGGATATCA CGGGCGGGAA 8880 

TACCATTTCT GAGGGCGGCA CCTAAGATTT TTTCAATGGT GGCAGGAATA TGGCCACTAG 8940 

CAATTTCAAA GTAGGAACGG TCAGCTTTTT TTTGGGTGAA ATCAGCTGGC AAGATGCCCT 9000 

GTCGAATCTG ACTAGCTAGG GTGACGATGG TTGATTCTTT GCTTTGTCGA TAAATTTTTT 9060 

CCAAGCGAGT CTGAGGAATC AAAGGAATAT GAAGTAGATC CGCTAGAACC TGTCCAGGAC 9120 

TGACAGAAGG TAGCTGATCA CTGTCACCTA CGATGAGGAT CTTACTGTTA GAAGAGATAT 9180 

TGGAGAAGAG TTGATTGGCC AGCCAAGTAT CTACCATAGA GAATTCATCC ACGATGATAA 9240 

AGTCAGCATC TAGGTAATCT TCCAGATGAC TGGTATCATC GTCACCTGTC ATTCCCAAGT 9300 

GGCGATGTAT GGTCGCGCTA GGCAAACCTG TCAATTCATT CATGCGACGA GCAGCTCGAC 9360 

CAGTTGGAGC AGCAAGAAGA ATGGGCAGAT TGCTTTTCTT CCTGAAGTCA AGTCCTTCTA 9420 

AAAGGGCATA AACAGCAATG ATTCCATTGA TAACAGTTGT CTTACCAGTA CCAGGCCCAC 9480 

CTGTCAGGAT AAAGACCTTA TTCTGGATAG GATCACAGAT AGCCTGTTTT TGAATGTTAT 9540 

CATACTCAAT TCCCAGTTCT TGCTCGACAG TAGTGATATG TTTTTGAATG GTTTCTAAAT 9600 

CATGACTCTT CTGTTTTCCT TTTTCAAGGA TACGAACCAA GTGACTGCGG ATGCCTTCCT 9660 

CAGCGAAAAA GAGGCTGTTG TCAAAGATCT TGGTATCAAT CTGCTGAACC TTGTCTTCTT 9720 

CGATCAGGTA GGAGAGCTCT TGGGCAACTT GGCTGGGGTC TAGTTCCACG GGACGGGAAG 9780 

ACTCAAGGAG AGTAAGGGTT TGTTCCAGCA AATCCCGTGC TTCAACATAG GTGTCCCCTG 9840 

TTTCCATACA GGCCTGAAAA AGACTGTGAA CTAGACCGGC GCGGAAGCGT TCAGGAGCCT 9900 

GACTTTCGAT GCCTAGTTCC TCAGCTAGTT GGTCAGCAAT GGTAAAGCCC AAACCCTTGA 9960 

TATCCTCAAC CAGCTGGTAG GGATAATTTT CAACCACATC AAGGGTTTCT TCCTTGTAAA 10020 

AGTCTTGAAT CTGAAAGGCT AGTTTGTTGG GAATGCCGTA GTTGGCTAGT TTGGCCAAAA 10080 

TCATCTCCGT TCCGTAGTTG AGACGGAGAG TGGAGACGAA AGCCTCGCGA TTTTTGGCAG 10140 

AGAGTCCTGC GATGCCTTCT AACTTTTCTG GGTGTTGCAA AATTTCGTCA ATGGTATTTT 10200 

CGCCATAGGT ATCCACGATT TTCTGAGCTG TCTTGAGACC AATCCCCTTG AAATGGCTAC 10260 

TTGAAAAGTA CTTGACCAAG CCCTTACTAG TTGGTTTTGC GCGATCATAA CGACTGATTT 10320 

GCAGTTGTTC TCCATACTTG GAGTGCTGGA CAATTTGCCC CCAAAAAGTA TAGTCTTCGC 10380 
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CCTCAATTAC ATCAGCCATG GTTCCTGTGA CAATGATTTC AAAATCATCA AAATCCTCTG 10440 

CGTCCGTATC GTCGATTTCT AGGAGGAGGA TGCGATAAAA ATTGCTGGGA TTTTCAAAAA 10500 

TAATCCGTTC AATAGTTCCT GAAAAATAAA CTTCCATAAA ATTCCTTTGC ATGAATAGGT 10560 

GAGAGTTGGG ATTGTTTTTA TTTTATACTC TTCGAAAATA TCTTCAAACC AGGTCAGCTT 10620 

CCATCTGCAA CCTCAAAACA GTATTTTGAG CTGACTTCGT CAGTTCTATC CACAACCTCA 10680 

AAACACTGTT TTAAGCAGCC TACGGCTAGC TTCCTAGTTT GTTCTTTGAT TTTCATTGAG 10740 

TATTTGTAAA TAAACAATCA CTTCTCACGA TAGAAGAAGA GGCTGAGATT GGTGATTCTC 10800 

TGCCTCTTAG GTTTCTTAAA ATGTTCCGAT ACGGGTGATT GGCCATAAGC GGAATTTAGC 10860 

TTCCCCTGTG ATATCTTTTG CTTTGAAGGT ACCTACGTGG CGGCTGTCGC TCGAAACCAA 10920 

GCGGTCATCT CCGAGGAGAA GGTATTCTCC TTCTGGAACA GTAAAGCTAA AGTTGGTGTT 10980 

GTAGTTGACA TCAACTGTGA AGGCTTGAGC TTTTTGAGCG ATACTTCTAA AGAAAGTTCC 11040 

TTTATTTCCT TCAAAGCCCT TGCCTGAGTA AGTGCTTTGG AGTTTGTCAT CCTTGAAGCG 11100 

TTTGATATAG TCTGCTAGAT AAGGCTCGTC CGTTTCTTTG TCATTGATGT AGAGTTTATC 11160 

ATTTTCGTAA CGAATGGTGT CGCCAGGCAT TCCAATCACG OGCTTGACGA TGTCCTTATT 11220 

GCCATCTTCC TCATGGGCCA CCACGATATC AAAACGGTCA ATAGGAAGGT GTTTTACAAC 11280 

GAAGAGAATT TCGCCATCCG CTAGGGTCGG ATCCATGGAA TGTCCTTCTA CGCGAACATT 11340 

GCTCCAAAAA AAGATACGAC TTAAAGCTAG TAATGACAGA ATTAGGAGGA ACAATCCCCA 11400 

CTCTTTTAAG AAATTTTTAA ATGAATTCAT AACTTACCTT TCTAAGCGTT TTTTCGCTTT 11460 

TTCAGTGTTT TTAAAGTGCA ATTTGGCGCA GAAGCTGAGT CCCTGCATAC CATAGGCTTG 11520 

CAAAATCTGG CTAGCCACCT TGTCAGAAGC CGTTCCAGCT CCACTTGGGA GCTGATAACC 11580 

CAGTTCTCGT CCCAAATTTT CAAGATTTTC CAGAAAGAGA TCACGCGCAA TGACAGAAGA 11640 

AACTGCGACA GACAAGTATT TGCCCTCAGC CTTTTCTTCT AAGCTGATAG GATTGCTGAA 11700 

ACGATTGGCC TCTTGTGCCA AGTACTTGTC ATAATTTTTA GCACTGGTAA AGGCATCAAT 11760 

CACAATTTTC TCAGGCTGAA CACCTTTTTG AAGGAGGAGA TAGATAGCCT GATTATGGAG 11820 

GGCAACCTTA ACCGAAACAG CGTTGTAGCG GTCTCCGATG ACCTCGTTGT ACTTGCTGGG 11880 

TGAGAGAAGG AGTGCCTGGT GCTGAATTTT TTCCTTGAGA ATAGGAGTAA TCTGACGGAT 11940 

CTTTTGGTCG GTCAGAGTCT TAGAATCCCC CACACCGAGT TTTCGTAAAA AGTCGTGCTG 12000 

GTCAGGTGTG ACAAAGGCAG CCACAACTGC AAGCCCACCA AAGTAGGAAC CATTTCCCAC 12060 

CTCATCTGTC CCAATTAAAG GAAGATTTTG TCCGCTGGTT TGCTCTACAG CTTGATAGCC 12120 
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AAAGAAACTG GCGTATTTTT CAGCCCCTTC ACCCTGAAGC AAGATTTTTC CAGAAGTATA 12180 

GATAGAAACC GTTGCTTGAG GTAGTTTCAA AAAGTAGCGG ATATAGGGAT TCTTGCTAGG 12240 

AGCCAGACTG GTTTGATAGT GTTCAAGAAA AGOCTGAATA TCCTTTTCGC TTGGTGTGAG 12300 

TGTGATACTT GCCATAGTTT CTATTGTACC ACAAAAGCAG TAAAATTTGT AAAAACTGAC 12360 

AAAATTAGCG AATTTTGGTA TAATATCG.TG AGGTGAATTT TATGGCAAAT CTAAATCGAT 12420 

TCAAATTTAC ATTCGGGAAA AAATCGTTAA CCTTGACAAG CGAACATGAC AACCTTTTTA 12480 

TGGAGGAAAT CGCTAAGGTT GCGACAGAAA AATACCAAGC AATTAAAGAA CAAATGCCTA 12540 

GCGCAGATGA TGAAACAATC GCTCTTTTGT TGGCAGTCAA CTGTTTATCA ACTCAGCTCA 12600 

GCCGTGAGAT TGAATTTGAC GATAAGGAGC AAGAGCTAGA AGAACTCCGT CACAAGCTTG 12660 

TGACTTGTAA GCAAGAACAG AGCAAGATTG AGGATTCCTT ATGATTTCAT TCCTTCTTCT 12720 

ATTGGTCTTG GTTTGGGGAT TTTATATCGG CTATCGGAGA GGCCTGCTCT TACAGGTTTA 12780 

TTACCTGATT TCAGCCATGG CATCGGCTTT TATGGCTGGC CAGTTTTATA AGGGGCTTGG 12840 

AGAGCAATTC CATTTATTGC TCCCTTATGC AAATTCGCAG GAAGGTCAGG GGACTTTCTT 12900 

TTTCCCATCG GATCAACTCT TTCAGCTGGA TAAGGTCTTT TATGCAGGTA TCGGCTACTT 12960 

GCTTGTATTT GGGATTGTCT ATAGCATTGG TCGTTTACTT GGTCTTCTCT TACACTTGAT 13020 

TCCTAGCAAA AAACTGGGTG GTAAGTTGTT CCAAGTTTCA GCAGGTATCT TGTCCATGTT 13080 

GGTGACCTTA TTTGTCTTGC AAATGGCCTT GACAATCTTG GCGACCATCC CCATGGCAGT 13140 

TATACAAAAT CCTCTTGAAA AGAGTATCGT CGCAAAACAC ATCATCCAGA GCATACCGGT 13200 

AACAACCAGT TGGCTCAAAC AAATCTGGGT GACAAATTTA ATCGGATAAA AAGGGCAGGA 13260 

GTTTTCCTAG CCCTTTGTTT ACAGATTTGA CTCGAATCTA TCAGAATGTA AAAAGCTACC 13320 

ACACCTAGAC ATTCAAAGAC AAGGAAATAA AGATGAATAA GAAAATATTA GAAACATTAG 13380 

AGTTCGATAA GGTCAAGGCC TTGTTTGAGC CTCATTTGTT GACCGAGCAG GGCTTGGAGC 13440 

AATTGAGACA ACTGGCTCCG ACTGCCAAAG CAGATAAAAT CAAACAGGCT TTTGCTGAGA 13500 

TGAAGGAAAT GCAGGCTCTT TTCGTCGAGC AACCGCATTT TACTATTCTC TCAACTAAGG 13560 

AAATTGCAGG AGTCTGCAAG AGGTTGGAGA TGGGAGCGGA TCTCAATATC GAGGAGTTCC 13620 

TACTCTTGAA ACGCGTGCTT CTTGCCAGCC GAGAACTTCA AAATTTTTAC ACCAATCTGG 13680 

AAAATGTCAG CTTGGAAGAA TTAGCCCTTT GGTTTGAGAA ATTACATGAT TTTCCGCAAT 13740 

TACAAGGAAA TCTTCAGGCC TTTAATGATG CGGGTTTCAT TGAAAATTTT GCCAGTGAAG 13800 

AATTGGCGCG AATCCGTCGA AAAATACATG ATAGCGAGAG TCAGGTACGC GATGTTTTAC 13860 

AAGACTTGCT CAAGCAAAAA GCGCAGCTGT TGACGGAAGG AATTGTTGCT AGCAGAAATG 13920 
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GCCGTCAGGT TTTACCAGTC AAAAACACCT ACCGCAATAA GATTGCAGGT GTCGTTCATG 13980 

ATATTTCTGC TAGTGGAAAC ACCGTCTATA TCGAACCCCG TGAGGTAGTC AAACTGAGCG 14040 

AAGAAATTGC TAGTCTGCGA GCAGATGAGC GCTATGAAAT GCTTCGCATT CTCCAAGAAA 14100 

TTTCTGAGCG TGTCCGCCCT CATGCGGCTG AGATTGCTAA TGACGCTTGG ATTATCGGTC 14160 

ATCTGGACTT GATTCGTGCC AAGGTTCGAT TTATCCAAGA AAGACAAGCA -GTCGTGCCTC 14220 

AGCTGTCAGA AAATCAAGAG ATTCAACTGC TCCATGTCTG CCATCCTTTG GTCAAAAATG 14280 

CCGTCGCAAA TGATGTCTAT TTTGGTCAAG ATTTAACAGC TATTGTCATT ACAGGTCCCA 14340 

ATACAGGTGG GAAGACCATC ATGCTCAAAA CTCTGGGCTT GACACAGGTC ATGGCCCAGT 14400 

CAGGATTGCC GATTTTAGCA GACAAGGGAA GTCGTGTTGG TATTTTTGAA GAAATCTTTG 14460 

CTGATATTGG AGATGAGCAG TCTATTGAGC AGAGCTTGTC TACCTTCTCT AGTCATATGA 14520 

CCAATATCGT GGATATTCTT GGCAAGGTCA ACCAACATTC ACTCTTACTT TTGGATGAGT 14580 

TGGGGGCTGG TACTGATCCC CAAGAGGGAG CAGCCCTTGC CATGGCTATT CTGGAGGACC 14640 

TTCGCCTGCG TCAAATCAAG ACCATGGCGA CGACCCACTA TCCAGAACTC AAGGCCTACG 14700 

GTATTGAGAC AGCCTTTGTG CAAAATGCCA GTATGGAGTT TGATACTGCA ACTCTTCGCC 14760 

CGACCTATCG CTTTATGCAG GGTGTTCCTG GCCGAAGTAA TGCCTTTGAA ATTGCCAAAC 14820 

GTCTAGGCCT ATCTGAAGTT ATCGTAGGAG ATGCCAGTCA GCAGATCGAT CAGGACAATG 14880 

ACGTCAATCG TATCATTGAG CAATTAGAAG AGCAGACGCT GGAAAGCCGC AAACGTTTGG 14940 

ACAATATCCG TGAGGTGGAG CAAGAAAATC TCAAGATGAA CCGTGCGCTA AAAAAACTCT 15000 

ACAACGAGCT TAATCGTGAA AAGGAAACCG AGCTTAACAA GGCGCGTGAA CAGGCTGCTG 15060 

AGATTGTGGA TATGGCCCTA AGTGAAAGTG ACCAGATTCT CAAAAATCTC CACAGTAAAT 15120 

CCCAACTCAA GCCCCACGAA ATCATTGAAG CCAAGGCCAA GTTGAAAAAA TTGGCTCCTG 15180 

AAAAAGTGGA CTTGTCTAAA AATAAGGTCC TTCAAAAGGC CAAGAAAAAA CGAGCTCCAA 15240 

AGGTGGGAGA TGATATCGTG GTTCTCAGTT ATGGTCAGCG TGGTACCTTG ACCAGTCAAC 15300 

TCAAGGACGG TCGCTGGGAA GCCCAAGTTG GCTTGATTAA GATGACCTTG GAAGAGAAAG 15360 

AGTTTGATCT TGTTCAAGCC CAGCAAGAAA AACCAGTCAA GAAGAAACAG GTCAATGTTG 15420 

TGAAACGAAC TTCTGGGCGA GGACCTCAAG CTAGACTGGA TCTTCGAGGC AAGCGCTATG 15480 

AAGAAGCCAT GAATGAGCTA GATACCTTCA TCGACCAAGC CTTGCTTAAC AATATGGCTC 15540 

AAGTTGATAT CATCCATGGT ATCGGAACAG GAGTCATCCG TGAAGGAGTT ACCAAATACT 15600 

TGCAAAGAAA CAAACATGTC AAGAGTTTCG GCTATGCCCC ACAAAATGCT, GGAGGCAGTG 15660 
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GTGCGACTAT TGTCACTTTT AAAGGATAGC AGTATTCTGG ACTTTATAAA GTAAAAACTG 15720 

TTGAACTAAT TTTTACTAAT AAACACATTG ACAAAAGCCA ACATTTTTTG TAAAATTAGA 15780 

ATCAATTAAA TACCAACACC GAATGAAGTT TAATAGAAGT GGGGAATCGT TTGATTTTCC 15840 

ATGACTGTAA ATGGACGGAA CTCTGGAGAG ACCGTAAAGG CACCGAAGGG CAAGGCAGGC 15900 

AACTGCTCAA ACTCTCAGGT AAAAGGACAG AGCTAGGATA GACCGCTTTT TAGCATTTAT 15960 

CTAAGCATTC CAGAGTACAT GTATCTTGCA TGTGCTCTTT CTTTTGGGGT TGAAACGATA 16020 

GGAGAAGGAA ATGTTAGAAT TGCTTAAATC AATCGATGCT TTTGCTTGGG GACCGCCCCT 16080 

CTTGATTTTA TTGGTCGGAA CAGGGATTTA CCTAACTATT CGGCTAGGAC TCTTGCAGGT 16140 

TTTGCGTCTA CCCAAGGCCT TTCAGCTTAT TTTTATCCAG GATAAGGGAC ATGGTGATGT 16200 

ATCCAGTTTT GCAGCTCTGT GTACAGCCTT GGCATCAACT GTTGGAACAG GAAATATCAT 16260 

AGGAGTTGCG ACGGCTATCA AGGTTGGTGG ACCAGGAGCT CTATTTTGGA TGTGGATGGC 16320 

GGCTTTCTTT GGAATGGCTA CCAAGTATGC GGAAGGACTC TTGGCCATCA AATACCGCAC 16380 

CAAGGACGAC CATGGTGCAG TAGCGGGAGG TCCCATGCAT TATATCCTTC TAGGGATGGG 16440 

AGAAAAGTGG CGACCACTTG CTGTTTTGTT TGCAGTAGCA GGAGTATTGG TTGCTCTCTT 16500 

GGGAATCGGA ACCTTCACCC AAGTCAACTC GATTGCAGAA TCTATCCAAA ATACAACGAC 16560 

GATTTCGCCA GCCATCACAG CTCTCGTCTT GTCTGTCTTT GTAGCGATTG CAGTCTTTGG 16620 

TGGACTCAAG TCTATTTCTA AGGTTTCAAC TACTGTTGTT CCTTTTATGG CCATCATTTA 16680 

TATCTTAGGA ACTCTTACAG TTATTTTCTT TAATATCGGA AAAATCCCTG GCACAATCGC 16740 

TTTAGTCTTT ACCTCAGCTT TTAGTCCCCT TGCTGCGGTA GGTGGATTTG CTGGTGCTAG 16800 

CGTTCGGATG GCTATTCAAA ATGGTGTGGC GCGTGGTGTG TTCTCAAACG AATCTGGTCT 16860 

GGGTTCTGCT CCTATTGCAG CTGCAGCTGC CAAGACAAAT GAACCAGTAG AGCAAGGTTT 16920 

GATTTCCATG ACAGGAACCT TTATTGATAC CCTCATCATT TGTACTCTAA CTGGTTTGAC 16980 

CATCTTGGTA ACTGG 16995 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDKESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

CCGGGGCTTT TGTAGTATAA TAGAGATACG TTTTGAAAGT AGGAGGTATC TATGGACTTA 60 
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ACTAAGCGCT TTAATAAACA GTTAGATAAA ATTCAAGTTT CGTTGATTCG TCAGTTTGAC 120 

CAGGCTATTT CGGAGATTCC TGGGGTCTTG CGTTTGACCT TGGGGGAACC TGATTTTACA 180 

ACGCCAGACC ATGTCAAGGA GGCGGGCAAG CGAGCGATTG ATCAGAACCA ATCCTACTAT 240 

ACAGGGATGA GTGGTCTGCT GACTCTACGT CAGGCAGCCA GTGACTTTGT TAAGGAAAAG 300 

TACCAACTGG ACTATGCTCC TGAAAATGAA ATCTTGGTTA CAATTGGGGC GACAGAGGCT 360 

TTATCTGCGA CTTTGACGGC TATTTTGGAA GAGGGAGACA AGGTACTTTT GCCAGCTCCT 420 

GCTTATCCAG GCTATGAACC GATTGTTAAC TTAGTTGGGG CAGAAATTGT TGAGATTGAT 480 

ACGACTGAAA ATGGTTTTGT CTTGACTCCT GAGATGTTGG AGAAGGCCAT TTTGGAGCAG 540 

GGTGATAAGC TCAAGGCGGT TATTCTCAAC TATCCAGCCA ATCCGACAGG AATTACCTAC 600 

AGTCGAGAGC AGTTAGAGGC CTTGGCAGCT GTTTTACGCA AGTACGAAAT TTTTGTTGTC 660 

TGTGATGAGG TTTACTCAGA ATTGACCTAC ACAGGCGAAG CCATGTGTCT CTAGGAACGA 720 

TGTTGAGAGA CCAGGCTATT ATTATCAATG GTTTGTCTAA ATCGCATGCC ATGACAGGTT 780 

GGCGTTTGGG GCTGATTTTC GCTCCTGCGA CCTTCACAGC CCAGTTAATC AAGAGTCACC 840 

AGTACTTGGT CACTGCCGCA AATACCATGG CGCAACATGC TGCGGTAGAA GCCTTGACGG 900 

CTGGTAAAAA CGATGCGGAC CCATGAAGAA GGAATATATC CAACGTCGGG ACTATATCAT 960 

CGAAAAAATG ACTGCTCTTG GTTTTGAGAT TATCAAACCA GACGGTGCCT TCTATATTTT 1020 

TGCTAAAATT CCAGCGGGCT ACAATCAAGA CTCCTTTGCT TTTCTGAAGG ATTTTGCTCA 1080 

GAAGAAGGCC GTTGCCTTTA TCCCTGGTGC AGCCTTTGGA CGTTACGGGG AAGGCTACGT 1140 

CCGCCTATCT TATGCAGCCA GCATGGAGAC TATCAAAGAA GCCATGAAAC GACTTGAGGA 1200 

GTACATGAGA GAAGCATGAT TCAGTCTATC ACGAGTCAAG GCTTGGTGCT TTACAATCGC 1260 

AATTTTCGTG AGGATGACAA GCTCGTCAAA ATTTTTACAG AGCAGGTTGG CAAACGCATG 1320 

TTTTTTGTCA AACACGCTGG TCAGTCTAAG CTGGCGCCTG TTATTCAGCC CTTGGTGCTG 1380 

GCACGATTTC TCTTGCGAAT CAATGATGAC GGACTCAGTT ACATCGAAGA CTATCATGAG 1440 

GTCATGACTT TTCCCAAGAT TAATAGTGAC CTCTTTGTCA TGGCCTATGC GACCTATGTG 1500 

GCAGCTCTTG CAGATGCTAG TTTGCAGGAC AATCAGCAGG ATGCTCCCTT GTTTGCTTTT 1560 

TTGCAAAAGA CTTTGGAGTT GATGGAAGCA GGCTTGGATT ATCAGGTTTT GACCAATATT 1620 

TTTGAAATTC AAATTTTGAC TCGATTTGGA ATCAGCCTCA ATTTTAATGA GTGTGTCTTC 1680 

TGCCATCGGG TTGGTCAGGC TTTTGACTTT TCTTTCAAAT ATGGAGCCTG CCTCTGTCCA 1740 

GAGCATTATC ATGAGGATAA GAGACGTTGT CATCTCAATC CCAATATCCC CTATCTGCTC 1800 
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AATCAATTTC AAGCTATTGA TTTTGAGACT TTGGAGACCA TTTCGCTCAA GCCTGGAATC 1860 

AAGCAAGAGC TACGCCAATT TATGGATCAA TTATATGAAG AGTACGTTGG GATTCACCTA 1920 

AAATCAAAGA AATTTATTGA TTCCCTAGCA GACTGGGGAC AATTACTAAA AGAGGAAAAG 1980 

AAATGAAAAA AATCGCAGTA GATGCCATGG GGGGCGATTA CGCACCTCAG GCCATTGTTG 2040 

AGGGTGTCAA TCAAGCCCTA TCTGACTTTT CAGATATCGA GGTTCAACTT TACGGAGATG 2100 

AAGCTAAAAT CAAGCAATAT CTGACAGCGA CAGAGCGCGT CAGCATTATC CATACGGATG 2160 

AGAAGATTGA TTCGGATGAT GAACCTACGA GAGCTATTCG GAATAAGAAA AATGCCAGTA 2220 

TGGTATTGGC AGCCAAGGCT GTCAAAGATG GTGAAGCAGA CGCTGTCCTT TCGGCTGGGA 2280 

ATACAGGTGC CTTGTTGGCA GCAGGATTCT TCATCGTGGG TCGTATCAAG AATATCGACC 2340 

GTCCTGGACT CATGTCTACC TTGCCTACCG TTGATGGAAA AGGTTTTGAC ATGCTAGACC 2400 

TTGGTGCCAA TGCAGAAAAT ACAGCCCAGC ACCTCCATCA ATATGCGGTT CTAGGTTCCT 2460 

TCTATGCTAA AAATGTCCGT GGCATTGCGC AACCACGCGT TGGTTTGCTC AACAACGGAA 2520 

CAGAGAGTAG CAAGGGCGAC CCGCTTCGTA AGGAAACTTA TGAATTACTG GCGGCTGATG 2580 

AAAGTTTGAA CTTTATCGGA AACGTGGAAG CGCGTGATTT GATGAATGGC GTTGCAGATG 2640 

TTGTTGTGGC AGATGGTTTC ACGGGAAACG CTGTGCTCAA ATCCATCGAA GGGACAGCTA 2700 

TGGGAATCAT GGGCTTGCTC AAGACAGCTA TTACAGGTGG TGGTCTTCGA GCGAAACTAG 2760 

GTGCCCTCCT TCTCAAGGAC AGCCTCAGTG GTTTGAAAAA ACAGCTCAAT TATTCAGATG 2820 

TTGGTGGAGC GGTCTTGTTT GGTGTTAAGG CACCTGTTGT CAAGACTCAT GGCTCAAGCG 2880 

ATGCCAAGGC TGTTTATAGT ACGATTCGTC AGATCCGTAC CATGCTAGAA ACAGACGTGG 2940 

TTGCCCAGAC TGCGCGTGAA TTTTCAGGAG AATAAAAGAG ATGACAGAAA AAGAAATTTT 3000 

TGACCGTATT GTGACCATTA TCCAAGAGCG ACAGGGAGAG GACTTTGTCG TGACAGAATC 3060 

CTTGAGTCTG AAAGACGATT TGGATGCGGA TTCTGTTGAC TTGATGGAGT TTATCTTGAC 3120 

TCTGGAAGAT GAATTTAGTA TCGAAATCAG CGATGAAGAA ATTGACCAAC TCC AAAACG ? 3180 

AGGAGATGTG GTTAAAATCA TTCAAGGAAA ATAGCAATCG GAGTTCCAAG TCAACGGAAG 3240 

TAGATGGTTT TTAGAAATGA GAAATATCGG ACAAGCTGGT AAAATCTTGG CTGACAGTGG 3300 

TTATCAAGGG CTCATGAAGA TATATCCTCA AGCACAAACT CCACGTAAAT CCAGCAAACT 3360 

CAAGCCGCTA ACAGTTGAAG ATAAAGCCTG TAATCATGCG CTATCTAAGG AGATAAGCAA 3420 

GGTTGAGAAT ATCTTTGCCA AAGTAAAAAC GTTTAAAATG TTTTCAACAA CCTATCGAAA 3480 

TCATCGTAAA CGCTTCGGAT TACGAATGAA TTTGATTGCT GGTATTATCA ATCATGAACT 3540 

AGGATTCTAG TTTTGCAGGA AGTCTAATAG TAAAAAAGTG ATTAGAAAAC ATCTTTTTTA 3600 
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AAAATAGAGA 


TGATTTTGAA 


ACAAAAAAGC TAATTCAAGA CGTTTCGATG 


CCAATTCAAG 


3660 


ATTTGGATGA 


AAAAAATTAA 


TAGATACTGT 


TATACTAAAC 


TTGTCAAGTT 


TGTAACAAGA 


3720 


CAAATATTAA 


AAATAAAAAA 


GAGGTATTCG 


TTATGAATAC 


AAAAACGATG 


TCACAATTTG 


3780 


AAATTATGGA 


TACTGAGATG 


CTTGCTTGCG 


TTGAAGGTGG 


CGGATGCAAT 


TGGGGAGATT 


3840 


TTGCCAAAGC 


AGGTGTTGGA 


GGAGGAGCAG 


CACGAGGTCT 


TCAGCTAGGA 


ATTAAAACAA 


3900 


GAACATGGCA 


AGGTGCAGCA 


ACTGGTGCTG 


TGGGAGGAGC 


TATACTTGGA 


GGTGTGGCCT 


3960 


ATGCAGCGAC 


ATGTTGGTGG 


TAATTATGGA 


TTTTAAAAGT 


TTTATTATTG 


GTTTAGTAGT 


4020 


TGGTATATTT 


GGTCCTTATA 


TGGATGATTT 


AATTAGAAAA 


AAATTTTTAA 


AGTCTTCGGA 


4080 


GAAGAAAACA 


GAAAAATCTG 


TTAAAAAATA 


ATCAAAACTA 


TAAATGATGA 


ATCTGAATCA 


4140 


AAATTATTTT 


GCGCATGTAA 


AGAGGAGTCT 


TATAGTAACG 


AGTCAAAAAA 


GGAGTAACTA 


4200 


TGAATCGTAA 


TTTAGAACGG 


TGTTATCTAT 


TCTGACTAGG 


AATAGATCAT 


ACCAGAGGTA 


4260 


GCTTAGAAAT 


AGCAGAGACA 


TTAGAAATTG 


AAGTAATAAA 


TAGGATGTCG 


TAAGTGTTAC 


4320 


TATCAATGAT 


TTATTTGTTT 


CAAGCTTGCC 


TAGGGTGACA 


GTAAAAAATC 


AATTTCCTTT 


4380 


CAATAGCATA 


TTTTTAGTGG 


GCAGGACTCT 


TGTTCTGCCT 


ATTTTTTTAT 


CCAAAAAGTG 


4440 


CAGTTGGGAG 


GGAGATAGGC 


TCATTTGGGA 


AGGAAGTCCA 


GTTTTTGTTT 


AGTGATTGGG 


4500 


GTAAGATAGT 


TGTTATCAGA 


TGAGTTAATA 


CTCTTCGAAA 


ATCAAATTCA 


AACCACGTCA 


4560 


ACGTCGCCTT 


GCCGTATATA 


TGTGACTGAC 


TTCGTCAGTC 


CTATCTACAA 


' CCTCAAAACA 


4620 


GTGTTTTGAG 


CAGCCTACGG 


CTAGTTTCCT 


AGTTTGCTCT 


TTGATTTTCA 


TTGAGTATTA 


4680 


GGGAAAAGGA 


GATGAATATG 


AAATTTGGGA 


AACGTCATTA 


TCGTCCGCAG 


GTGGATCAGA 


4740 


TGGACTGCGG 


TGTAGCTTCA 


TTAGCCATGG 


TTTTTGGCTA 


CTATGGTAGT 


TATTATTTTT 


4800 


TGGCTCACTT 


GCGAGAATTG 


GCTAAGACGA 


CCATGGATGG 


GACGACGGCT 


TTGGGCTTGG 


4860 


TCAAGGTGGC 


AGAGGAGATT 


GGTTTTGAGA 


CGCGAGCCAT 


TAAGGCAGAT 


ATGACGCTTT 


4920 


TTGACTTGCC 


GGATTTAACT 


TTTCCTTTTG 


TTGCCCATGT 


GCTTAAGGAA 


GGGAAATTGC 


4980 


TCCACTACTA 


TGTGGTGACT 


GGGCAGGATA AGGATAGCAT 


TCATATTGCC 


GATCCAGATC 


5040 


CCGGGGTGAA 


GTTGACTAAA 


CTGCCACGTG 


AGCGTTTTGA 


GGAAGAATGG 


ACAGGAGTGA 


5100 


CTCTTTTTAT 


GGCACCTAGT 


CCAGACTATA 


AGCCTCATAA 


GGAACAAAAA 


AATGGTCTGC 


5160 


TCTCTTTTAT 


CCCTATATTA 


GTGAAGCAGC 


GTGGCTTGAT 


TGCCAATATC 


GTTTTGGCAA 


5220 


CACTCTTGGT 


AACCGTGATT 


AACATTGTGG 


GTTCTTATTA 


TCTGCAGTCT 


ATCATTGATA 


5280 


CCTATGTGCC 


AGATCAGATG 


CGTTCGACAC 


TAGGGATTAT 


TTCTATTGGG 


CTAGTCATCG 


5340 
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TCTACATCTT CCAGCAAATC TTGTCTTACG CTCAGGAGTA TCTCTTGCTT GTTTTGGGGC 5400 

AACGCTTGTC GATTGACGTG ATTTTGTCCT ATATCAAGCA TGTTTTTCAC CTCCCTATGT 5460 

CCTTCTTTGC GACACGCAGG ACAGGGGAGA TCGTGTCTCG TTTTACAGAT GCTAACAGTA 5520 

TCATCGATGC GCTGGCTTCG ACCATCCTTT CGATTTTCCT AGATGTGTCA AGGGTTGTCA 5580 

TTATTTCCCT TGTTCTATTT TCACAAAATA CCAATCTCTT TTTCATGACT TTATTGGCGC 5640 

TTCCTATCTA CACAGTGATT ATCTTTGCCT TTATGAAGCC GTTTGAAAAG ATGAATCGGG 5700 

ATACCATGGA AGCCAATGCG GTTCTGTCTT CTTCTATCAT TGAGGACATC AACGGTATTG 5760 

AGACTATCAA GTCCTTGACC AGTGAAAGTC AGCGTTACCA AAAAATTGAC AAGGAATTTG 5820 

TGGATTATCT GAAGAAATCC TTTACCTATA GTCGAGCAGA GAGTCAGCAA AAGGCTCTGA 5880 

AAAAGGTTGC CCATCTCTTG CTTAATGTCG GCATTCTCTG GATGGGGGCT GTTCTGGTCA 5940 

TGGATGGCAA GATGAGTTTG GGGCAGTTGA TTACCTATAA TACCTTGCTG GTTTACTTTA 6000 

CTAATCCTTT GGAAAATATC ATCAATCTGC AAACCAAGCT TCAGACAGCG CAGGTTGCCA 6060 

ATAACCGTCT AAATGAAGTG TATCTAGTAG CTTCTGAGTT TGAGGAGAAG AAAACAGTTG 6120 

AGGATTTGAG CTTGATGAAG GGAGATATGA CCTTCAAGCA GGTTCATTAC AAGTATGGCT 6180 

ATGGTCGAGA TGTCTTATCG GATATCAATT TAACCGTTCC CCAAGGGTCT AAGGTGGCTT 6240 

TTGTGGGGAT TTCAGGGTCA GGTAAGACGA CTTTGGCCAA GATGATGGTT AATTTTTACG 6300 

ACCCAAGTCA AGGGGAGATT AGTCTGGGTA GTGTCAATCT CAATCAGATT GATAAAAAAG 6360 

CCCTGCGCCA GTACATCAAC TATCTGTCTC AACAGCCCTA TGTCTTTAAC GGAACGATTT 6420 

TGGAGAATCT TCTTTTGGGA GCCAAGGAGG GGACGACACA GGAAGATATC TTACGGGCGG 6480 

TCGAATTGGC AGAGATTCGA GAGGATATCG AGCGCATGCC ACTGAATTAC CAGACAGAAT 6540 

TGACTTCGGA TGGGGCAGGG ATTTCAGGTG GTCAACGTCA GAGAATCGCT TTGGCGCGTG 6600 

CTCTCTTGAC AGATGCGCCG GTCTTGATTT TGGATGAGGC GACTAGCAGT TTGGATATTT 6660 

TGACAGAGAA GCGGATTGTC GATAATCTCA TTGCTTTGGA CAAGACCTTG ATTTTCATTG 6720 

CTCACCGCTT GACTATTGCT GAGCGGACAG AGAAGGTAGT TGTCTTGGAT CAGGGCAAGA 6780 

TTGTCGAAGA AGGAAAGCAT GCTGATTTGC TTGCACAGGG TGGCTTTTAC GCCCATTTGG 6840 

TCAATAGCTA GAAAGAGGAG AGGATGAAAC CAGAATTTTT AGAAAGTGCG GAGTTTTATA 6900 

ATCGTCGTTA CCATAATTTT TCCAGTAGTG TGATTGTACC CATGGCCCTT CTGCTTGTGT 6960 

TTTTACTTGG CTTTGCAACT GTTGCAGAGA AGGAGATGAG TTTGTCCACT AGAGCTACTG 7020 

TCGAACCTAG TCGTATCCTT GCAAATATCC AGTCAACTAG CAACAATCGT ATTCTTGTCA 7080 

ATCATTTGGA AGAAAATAAG CTGGTTAAGA AGGGGGATCT TTTGGTTCAA TACCAAGAAG 7140 
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GGGCAGAGGG 


TGTCCAAGCG 


GAGTCCTATG 


CCAGTCAGTT 


GGACATGCTA 


AAGGATCAAA 


7200 


AAAAGCAATT GGAGTATCTG 


CAAAAGAGCC 


TGCAAGAAGG 


GGAGAACCAC 


TTTCCAGAGG 


7260 


AGGATAAGTT 


TGGCTACCAA 


GCCACCTTTC 


GCGACTACAT 


CAGTCAAGCA 


GGCAGTCTTA 


7320 


GGGCTAGTAC 


ATCGCAACAA 


AATGAGACCA 


TCGCGTCCCA 


GAATGCAGCA 


GCTAGCCAAA 


7380 


CCCAAGCCGA 


AATCGGCAAC 


CTCATCAGTC 


AAACAGAGGC 


TAAAATTCGC 


GATTACCAGA 


7440 


CAGCTAAGTC 


AGCTATTGAA 


ACAGGTGCTT 


CCTTGGCCGG 


TCAGAATCTA 


GCCTACTCTC 


7500 


TTTACCAGTC CTACAAGTCT CAGGGCGAGG AAAATCCCCA AACTAAGGTT CAGGCAGTTG 


7560 


CACAGGTTGA 


AGCACAGATT 


TCTCAGTTAG 


AATCTAGTCT 


TGCTACTTAC 


CGTGTCCAGT 


7620 


ATGCAGGTTC 


AGGTACCCAG 


CAAGCCTATG 


CGTCAGGGTT 


AAGCAGTCAA 


TTGGAATCCC 


7680 


TTAAATCCCA 


ACACTTGGCA 


AAGGTTGGTC 


AGGAATTGAC 


CCTTCTAGCC 


CAGAAAATTT 


7740 


TGGAGGCAGA 


GTCAGGTAAG 


AAGGTACAGG 


GAAATCTTTT 


AGACAAGGGG 


AAAGTTACGG 


7800 


CGAGTGAGGA 


TGGGGTGCTT 


CATCTTAATC 


CTGAGACCAG 


TGATTCTAGC 


ATGGTTGCAG 


7860 


AAGGTGCCCT 


ACTAGCCCAA 


CTTTATCCAT 


CTTTGGAAAG 


AGAAGGGAAA 


GCCAAACTCA 


7920 


CAGCTTATCT 


AAGTTCAAAA 


TATGTAGCAA 


GAATCAAGGT 


CGGTGATTCT 


GTTCGCTATA 


7980 


CTACGACTCA 


TGATGCCGGG 


AATCAACTTT 


TCCTAGATTC 


TACTATTACA 


AGTATTGATG 


8040 


CGACAGCTAC 


TAAGACTGAG 


AAAGGGAATT 


TCTTTAAAAT 


CGAGGCGGAG 


ACTAATCTAA 


8100 


CTTCGGAGCA 


GGCTGAAAAA 


CTTAGGTACG 


GGGTGGAAGG 


CCGCTTGCAG 


ATGATTACGG 


8160 


GCAAGAAAAG 


TTACCTACGT 


TATTATTTGG 


ATCAATTTTT 


GAACAAAGAG 


TAATGTTCGT 


8220 


GTTTTTAGAG 


TTAAATAATT 


TTTAAACTGT 


GAGAAAGATT 


CTTCTTGCAG 


TTTTTTCTTT 


8280 


ACAATTTTTG 


AAAAACATCT 


ACTATTTATT 


CGGTTAAATT 


CTTGTGTTTT 


TTGGTTTTTT 


8340 


GTGGTAAAAT 


GTGCTCAAGT 


AATACGAAAG 


GCGAACTTTA 


AAATGTCAAA 


ACAATTGATC 


8400 


TATTCGGGAA 


AAGCTAAAGA 


TATCTATACA 


ACTGAGGATG 


AAAATCTTAT 


TATTTCAACT 


8460 


TACAAGGACC 


AGGCGACTGC 


TTTCAACGGT 


GTCAAGAAGG 


AGCAGATTGC 


AGGTAAGGGA 


8520 


GTCTTGAATA 


ATCAGATCTC 


ATCTTTTATT 


TTTGAGAAAT 


TAAATGTGGC 


TGGTGTGGCG 


8580 


ACTCACTTTG 


TGGAGAAACT 


TTCAGACACG 


GAACAACTCA 


ATAAAAAGGT 


TAAGATTATT 


8640 


CCTTTGGAAG 


TCGTGCTCCG 


CAACTATACT 


GCTGGTTCCT 


TTTCAAAACG 


TTTTGGTGTG 


8700 


GATGAGGGAA 


TCGCCTTGGA 


GACTCCGATT 


GTCGAATTTT 


ACTACAAAAA TGATGATTTG 


8760 


GATGATCCAT 


TTATCAATGA 


TGAGCATGTG 


AAATTCCTAC 


AGATTGCGGG 


TGACCAGCAG 


8820 


ATTGCCTACT 


TGAAGGAAGA 


AACGCGTCGT 


ATCAATGAAC 


TATTGAAAGT 


CTGGTTTGCT 


8880 
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GAGATTGGGC TTAAATTGAT TGACTTTAAG CTAGAGTTCG GTTTTGACAA GGATGGCAAG 8940 

ATTATCTTGG CAGACGAATT TTCACCAGAT AACTGCCGCT TGTGGGACGC TGATGGCAAC 9000 

CACATGGATA AGGATGTTTT CCGTAGAGGA TTGGGAGAAC TAACCGACGT TTATGAGATT 9060 

GTTTGGGAAA AGTTGCAGGA ATTGAAATAA TCTGTTTGCA ACGGAAAACC TTCGTCTCTC 9120 

AACTAAAAGG ACTCAGGCTG AAAAGGTCCC CCAGACCTTT TCACTCTGTA GAGAACTAGG 9180 

TGAACTAACA GATGTTTACG AAATTGTCTG GGAAAAGTTG CAGGGTTTAA AATAACAACC 9240 

TCAAGGCTGT TTGGGAATAT TGCAAGAGCT GAAATAAAGG AATAAGAATT GATGGATAAA 9300 

CGTATTTTTG TTGAAAAAAA GGCTGATTTT CAGGTCAAGT CAGAGAGTTT GGTTAGAGAG 9360 

CTCCAGCACA ACTTGGGACT GTCAAGCTTG AAAAGTATTC GTATTGTGCA AGTATATGAT 9420 

GTATTTGACT TGGCTGAGGA CTTGTTTGCA CCTGCAGAGA AGCACATTTT CTCTGAGCAG 9480 

GTAACCGACC ATGTTTTAGA TGAAGTATCT GTGCAGGCGG ATCTTGCTAA CTATGCTTTC 9540 

TTTGCCATTG AAAGTCTGCC AGGGCAGTTT GACCAGCGTG CAGCTTCGTC ACAGGAAGCC 9600 

TTGCTTTTGT TGGGAAGTTC GAGTGACGTG ACAGTCAACA CAGCCCAACT TTACTTGGTG 9660 

AATAAAGATA TTGATGCGAC TGAGTTGGAA GCTGTCAAAA ACTACCTGCT CAATCCAGTT 9720 

GATTCTCGTT TCAAGGATAT CACGACAGGG ATTGCCAAGC AGGAGTTTTC AGAGTCAGAC 9780 

AAGACCATTC CCAAATTGAC TTTCTTTGAA AGCTATGCAG CAGAAGACTT TGCTCGCTAC 9840 

AAGGCCGAAC AAGGGATGGC CATGGAAGTG GATGATTTGC TCTTTATCCA AGACTACTTT 9900 

AAGTCAATCG GGCGCGTGCC AACTGAGACT GAACTCAAGG TTTTGGACAC TTACTGGTCT 9960 

GACCACTGCC GTCATACGAC TTTTGAGACA GAGTTGAAAC ACATCGACTT TTCAGCTTCT 10020 

AAATTTCAAA AGCAATTGCA GTCAACCTAT GACAAGTATA TTGCCATGCG CGAGGAATTA 10080 

GGTCGGTCTG AAAAACCACA AACCTTGATG GATATGGCGA CTATTTTCGG TCGTTATGAG 10140 

CGTGCTAATG GACGATTGGA TGATATGGAA GTCTCTGACG AAATCAATGC CTGCTCAGTT 10200 

GAAATTGAAG TGGACGTTGA TGGTGTCAAG GAACCTTGGC TCCTCATGTT TAAAAACGAA 10260 

ACCCACAACC ATCCAACAGA AATTGAGCCA TTTGGTGGAG CGGCTACCTG TATTGGTGGA 10320 

GCTATTCGTG ATCCGTTGTC AGGCCGTTCC TATGTTTACC AAGCCATGCG TATTTCAGGT 10380 

GCTGGTGATA TTACAGCACC GATTTCGGAA ACTCGCGCTG GGAAATTGCC ACAACAAGTC 10440 

ATTTCTAAAA CAGCAGCTCA TGGTTATTCT TCATATGGTA ACCAGATTGG GCTTGCAACA 10500 

ACCTACGTTC GTGAATACTT CCACCCAGGC TTTGTAGCTA AACGTATGGA ACTTGGTGCC 10560 

GTTGTTGGTG CGACTCCCAA GGGCAATGTT GTCCGTGAAA AACCTGAAGC AGGTGATGTG 10620 

ATCATCCTTC TCGGAGGCAA AACAGGTCGT GATGGTGTCG GTGGTGCGAC GGGCTCTTCT 10680 
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AAGGTTCAAA CAGTTGAGTC TGTAGAGACT GCTGGTGCTG 
ATCGAAGAAC GCAAGATTCA GCGCCTCTTC CGTAATGGCA 
AAGTCCAATG ACTTTGGGGC AGGCGGTGTC TGTGTGGCTA 
CTTGAAATCG ACCTCAACAA GGTGCCTCTT AAATACCAGG 
GCCATCTCTG AATCACAAGA ACGGATGGCG GTCGTGGTTC 
TTCGTTGCCG AATGTAACAA AGAAAATATT GATGCTGTTG 
AAACCAAATC TTGTCATGCA CTGGAATGGT GAGACAATCG 
CTTGACACCA ATGGTGTGCG CGTGGTTGTC GATGCCAAAG 
CTCCCAGAAG AGCGTCAAAC ATCTGCTGAA ACACTGGAAT 
TCTGACCTCA ACCATGCAAG TCAAAAAGGA TTACAGACTA 
CGCTCAACGG TTAATCACCC ACTTGGTGGT CGTTACCAAC 
GTGCAGAAAT TGCCAGTTCA ACACGGTGTG ACTCATACTG 
TTCAACCCAT ATGTAGCTGA ATGGTCTCCA TACCACGGTG 
GCAACTGCTC GTTTGGTGGC TGCTGGTGCC AACTGGTTCA 
GAGTATTTCG AGCGTATGGA TAAACAAGCA GAGCGTTTCG 
CTAGGTTCTA TTGAAGCACA AATTCAGCTT GGCTTGCCAT 
ATGTCTGGTA CCTTTGAAGA ATTGACCGTT CCGCCAACCT 
ACGGCAGATA GCCGTAAGGT GCTCTCTCCA GAATTTAAAG 
TACATCCCAG GTCAAGCCCT CTCTGCAGAG ATTGATTTTG 
GCTCAGTTTG AAGCCATCCA AGCTGACCAT AAAGTGACAT 
GGTGGTGTAG TTGAAAGTTT GGCTCTTGCT ACCTTTGGAA 
ACCTTGCCTG AACTTGAAAC AGCTTTGACA GCTCAATTAG 
CCTGAAGAAA TTGCTGGAGT AGAGAAGGTT GGACAAACGA 
GTCAACGGTG TGAAGCTAGA TGGACACAAG CTTGACAGTG 
GAAGTTTACC CAACAGAATT TACCCAAGCG AAAGAACTAG 
TCAGATGTTG TGATTAAAGC CAAAGAAAAG GTTGAAAAAC 
TTTCCAGGAA CCAACTCAGA ATATGATTCA GCTAAGGCCT 
GTCAATTTGG TGCCATTCGT GACCTTGAAT GAAGAAGCTA 
ATGGTTGACA ATATCGACAA GACTAATATT CTCTTCTTTG 



AGGTTCAAAA AGGAAATGCC 10740 

ATGTCACTCG TCTGATCAAG 10800 

TCGGTGAATT GGCAGACGGT 10860 

GCTTGAATGG TACAGAAATT 1092 0 

GTCCTGAAGA TGTGGATGCC 10980 

TGGTGGCGAC AGTAACTGAA 11040 

TTGACTTGGA GCGTCGTTTC 11100 

TTGTGGACAA GGATGTCAAA 11160 

CAGATACCCT TACGGTTCTA 11220 

TCTTTGACTG CTCTGTTGGA 11280 

TCACACCAAC TGAGGCATCT 11340 

CGTCGGTCAT TGCTCAAGGT 114O0 

CTGCTTATGC GGTTATCGAA 114 60 

AGGCTCGTTT CTCTTACCAA 11520 

GTCAGCCAGT AGCTGCTCTT 11580 

CTATCGGTGG TAAGGACTCC 11640 

TGGCTGCCTT TGGGGTGACG 11700 

CTGTTGGGGA AAATATCTAC 117 60 

ACTTGATTAA GAAAAATTTT 11820 

CTGCATCAGC TGTCAAATAC 11880 

ACTATATTGG TGCAGAGGTG 11940 

GCGGCTTTGT CTTCACATCT 12000 

AAGCAGACTT TACACTGACT 12060 

CATTTCAAGG GACATTGGAA 12120 

AAGAAGTACC AGCTGTGGCA 12180 

CTGTGGTTTA CATCCCAGTC 12240 

TCGAAAAAGA AGGTGCAGAG 12300 

TTGTCAAGTC AGTTGAAACT 12360 

CTGGTGGATT CTCGGCTGCG 12420 
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GATGAACCAG ATGGTTCAGC TAAGTTTATC GTCAATATCC TGCTTAATGA AAAAGTGCGT 12480 

GTGGCTATTG ATAGCTTTAT CGCCCGTGGT GGTTTGATTA TCGGTATTTG TAATGGATTC 12540 

CAAGCCTTAG TCAAATCGGG TCTCCTACCC TACGGAAACT TTGAAGCTGC TAACAGTACT 12600 

AGCCCAACCC TCTTCTACAA TGATGCCAAC CAACACGTGG CCAAGATGGT GGAAACTCGC 12660 

ATTGCCAATA CCAACTCACC ATGGTTGGTT GGTGTGCAAG TGGGCGATAT CCACGCTATT 12720 

CCTGTTTCGC ACGGTGAAGG GAAGTTTGTC GTGACGGCTG AGGAATTTGC AGAGCTCCGT 12780 

GACAATGGAC AAATTTTCAG CCAATACGTT GACTTTAACG GTAAACCAAG TATGGATTCT 12840 

AAGTACAATC CGAATGGTTC TGTCCATGCC ATCGAAGGAA TTACCAGCAA GAATGGTCAA 12900 

ATCATCGGTA AGATGGGCCA CTCAGAACGT TATGAGGATG GTCTTTTCCA AAATATCCCA 12960 

GGCAATAAAG ACCAACACCT GTTCGCATCA GCGGTTAAAC ATTTCACTGG AAAATAAGAC 13020 

TTACAGATTT TCTAATAGAT AGTATCAGTA ATGTAAAAGT CATGTAAATC TAGCTCTTGA 13080 

TGATTACAAA TGAAAATTAG GTATAAAAAA TGACATACGA AGTAAAATCT CTTAATGAAG 13140 

AATGTGGTGT TTTCGGTATT TGGGGACATC CAGATGCTGC TAAGTTGACC TATTTTGGAC 13200 

TCCACAGTCT TCAACACCGT GGTCAGGAGG GGGCAGGAAT CCTCTCCAAT GATCAAGGAC 13260 

AACTGAAGCG CCATCGTGAC ATGGGGCTTT TATCAGAAGT TTTCAGAAAT CCAGCTAATT 13320 

TGGATAAATT GACAGGAGCT GGTGCGATTG GGCATGTGCG TTATGCGACT GCTGGCGAAG 13380 

CTTCTGTAGA TAACATCCAG CCCTTCCTCT TCCGTTTTCA CGATATGCAG TTTGGTTTGG 13440 

CTCATAATGG AAATCTGACC AATGCAGCCT CTCTCAAGAA AGAACTGGAA CAAAGAGGAG 13500 

CAATTTTCAG CGCGACTTCG GACTCTGAAA TCTTGGCTCA CCTCATTCGT CGCAGTCATA 13560 

ATCCTAGCCT GATGGGCAAA ATCAAGGAAG CGCTCAGCCT TGTCAAAGGT GGTTTTGCCT 13620 

ATATCTTGCT GTTTGAGGAC AAGTTGATTG CGGCTCTTGA CCCAAATGGA TTCCGACCGC 13680 

TTTCGATTGG TAAAATGGCT AATGGAGCAG TTGTTGTATC TTCTGAAACC TGTGCTTTTG 13740 

AGGTCATTGG TGCCGAGTGG ATTCGTGATT TGAAGCCAGG TGAGATTGTG ATCATTGATG 13800 

ACGAGGGCAT TCAGTATGAC AGCTATACAG ATGATACCCA GTTGGCGGTT TGTTCTATGG 13860 

AGTATATCTA CTTTGCTCGC CCTGATTCTA ATATCCACGG TGTCAATGTC CATACGGCAC 13920 

GTAAGAGAAT GGGAGCGCAA TTGGCGCGAG AATTTAAGCA TGAGGCAGAT ATTGTAGTTG 13980 

GTGTGCCCAA TTCTTCCCTA AGCGCGGCTA TGGGATTTGC GGAAGAATCA GGCTTACCAA 14040 

ATGAAATGGG TCTGATCAAA AACCAATACA CCCAGCGAAC TTTTATCCAA CCGACTCAAG 14100 

AATTGCGGGA GCAAGGAGTG CGGATGAAAC TGTCTGCTGT TTCGGGTGTT GTCAAAGGCA 14160 

AACGTGTGGT CATGGTGGAT GATTCCATTG TACGTGGAAC AACCTCTCGT CGTATCGTTC 14220 
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AGCTCTTGAA AGAAGCGGGT GCGACTGAGG TTCACGTTGC CATTGGAAGT -CCTGCACTAG 14280 

CGTATCCATG TTTCTACGGG ATTGATATCC AGACCCGTCA GGAGCTGATT GCAGCCAATC 14340 

ATACGGTCGA AGAAACTCGC CAAATCATTG GTGCGGACAG TCTGACTTAT CTTTCAATTG 14400 

ATGGCTTGAT TGAGTCGATT GGTATCGAAA CAGATGCGCC GAACGGTGGT CTCTGTGTCG 14460 

CTTACTTTGA CGGTGACTAC CCAACGCCTC TTTATGACTA CGAAGAAGAC TATCGTAGAA 14520 

GTTTGGAAGA AAAGACCAGT TTTTACAAGT AGGCGACAGA TTCTCCATTA AAGAAAAGGA 14580 

AAAAATAAAT GACAAATAAA AATGCATATG CCTCACGTCT CACTACTGAC TAAAGGCTTA 14640 

AGCATTTAGT CAGTAGACGC TTTGTCCTAT AGGATCAAAG CTAGAGCCCT GACTAGTATT 14700 

TTTAGATAAA AAGATGGTTT ATCTAAAAAT ACGTCGCAGT CTTTCTCAAA AAAAGAAAAG 14760 

GAAAAATAAA ATGGCAAATA AAAATGCGTA CGCTCAATCT GGTGTGGATG TTGAAGCGGG 14820 

TTATGAAGTT GTTGAACGGA TTAAAAAGCA CGTGGCCCGT ACGGAGCGTG CAGGTGTCAT 14880 

GGGAGCTCTT GGTGGCTTTG GTGGTATGTT TGACCTTTCC AAGACTGGGG TTAAAGAACC 14940 

CGTCTTGATT TCAGGGACTG ACGGTGTCGG AACCAAGCTC ATGTTGGCTA TCAAGTACGA 15000 

CAAGCACGAT ACCATCGGGC AGGACTGTGT GGCCATGTGT GTCAACGACA TCATTGCTGC 15060 

AGGTGCGGAA CCCCTCTATT TTCTCGACTA CGTAGCGACA GGGAAGAATG AACCAGCTAA 15120 

GCTAGAACAA GTGGTTGCTG GTGTGGCAGA AGGTTGTGTG CAGGCTGGTG CTGCCCTCAT 15180 

CGGTGGGGAA ACGGCTGAAA TGCCGGGCAT GTACGGCGAA GACGACTATG ACTTGGCTGG 15240 

TTTTGCGGTC GGTGTGGCTG AAAAATCTCA AATCATTGAC GGTTCAAAGG TGGTAGAGGG 15300 

AGATGTTCTT CTCGGACTTG CTTCAAGTGG GATTCACTCA AATGGTTACT CTTTGGTTCG 15360 

TCGTGTCTTT GCGGATTACA CAGGTGAGGA AGTCCTACCA GAATTGGAAG GCAAGAAACT 15420 

TAAGGAAGTT CTACTTGAGC CGACTCGTAT CTATGTCAAG GCTGTCTTGC CGCTCATCAA 15480 

AGAAGAGTTG GTCAACGGCA TTGCCCACAT CACAGGTGGT GGCTTTATCG AAAATGTCCC 15540 

TCGTATGTTT GCAGATGACC TAGCTGCTGA AATTGATGAA AGTAAAGTTC CAGTGCTTCC 15600 

AATTTTCAAA ACCCTTGAAA AATACGGTCA GATTAAACAC GAAGAAATGT TTGAAATCTT 15660 

CAATATGGGT GTGGGACTTA TGTTGGCGGT CAGCCCTGAA AATGTAGAGC GTGTAAAAGA 15720 

ATTGTTGGAT GAAGCAGTCT ATGAAATTGG TCGCATCGTC AAGAAAGAAA ACGAAAGTGT 15780 

C ATT AT C AAA TGAAAAAAAT AGCGGTTTTT GCCTCTGGTA ATGGCTCAAA TTTTCAGGTG 15840 

ATTGCCGAAG AATTTCCAGT GGAGTTTGTC TTTTCAGACC ATCGTGATGC CTATGTGCTT 15900 

GAGCGTGCAA AGCAGCTCGG CGTTCTGTCC TATGCTTTTG AACTCAAGGA GTTTGAGAGC 15960 
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AAGGCAGACT ACGAAGCAGC CCTTGTCGAA CTCTTGGAAG AACACCAGAT TGACTTGGTT 16020 

TGCCTAGCAG GCTACATGAA AATCGTTGGA CCAACCTTAT TGTCGGCTTA TGAAGGTCGG 16080 

ATTGTCAACA TTCATCCAGC CTACTTGCCA GAATTTCCAG GAGCTCATGG GATTGAGGAT 16140 

GCTTGGAATG CTGGCGTGGG TCAGTCTGGT GTGACCATTC ACTGGGTGGA TTCGGGTGTG 16200 

GATACAGGCC AGGTCATCAA ACAGGTTCGT GTGCCACGAC TAGCTGATGA TACCATTGAC 16260 

AGATTTGAAG CTCGCATCCA TGAAGCAGAG TACAGGCTGT ATCCGGAAGT AGTGAAGGCT 16320 

CTATTTACAG ATTGACTTTT TGATGATTCA TATGATATCT TTGATTTTAA ATTGGAGTCA 16380 

GTGTTTGTTG AAGACGGCTT CAAACGGAGG TATTTGTAAT GTTAGAATCT AAAAAAACAA 16440 

CTCGATATGT ATTTTATGTC TATCTGATGT TATTAACTTG GGGAATCTTA TTTAAGTTTG 16500 

AAACAAATCC TGAATTTATA GCATTTTTCT TAGCTCCAAG GTATATCAAT TGGATTCCAT 16560 

TTTCAGAACC ACTAATAGTC GATGGAAAAA TTGTTTTTGC TGAAATGTTA TTTAATCTGA 16620 

TTTTCTTTAT TCCATTAGGT GTTTGTTTCC CTTTGATAAA AACTAATTTA TCTAGTTTAA 16680 

GAATAGTCGG GACAGGTTTC TTGATTAGTT TATTGTTTGA GTGCTTACAG TATATTTTAG 16740 

CAATAGGTAT AACAGATATA ACGGATTTGA CTTTAAATAC GCTAGGTGTC TGTGTAGGCT 16800 

TACTGATTTA TCAAATTTTT AT AAGAGTG T TCAAATCACA GACTAGAAAA TGGATCAATA 16860 

TCTTAGGTAT GCTTAGCCTT GGTTTTGCTT ATCTTGTTTT ACTGTTACTG CATTTACTTA 16920 

GTGTTTAACT AATGATTAAA AAGGAGAATA TAATGACTAA ACGCGTCTTA ATCAGCGTCT 16980 

CAGACAAAGC GGGCATTGTT GAATTTGCCC AAGAACTCAA AAAACTTGGT TGGGAGATTA 17040 

TCTCAACAGG TGGAACTAAG GTTGCCCTTG ATAATGCTGG GGTGGATACC ATTGCTATCG 17100 

ATGATGTGAC TGGTTTCCCA GAAATGATGG ACGGTCGTGT GAAGACCCTC CACCCAAATA 17160 

TCCACGGAGG GCTTCTCGCT CGTCGTGACT TGGATAGCCA CTTGGAAGCG GCTAAGGACA 17220 

ACAAGATTGA GCTCATTGAC CTTGTGGTGG TCAACCTTTA CCCATTTAAG GAAACTATCC 17280 

TTAAACCAGA TGTGACTTAT GCTGATGCAG TTGAAAATAT CGATATTGGT GGGCCATCTA 17340 

TGCTTCGTTC AGCAGCGAAA AATCATGCCA GTGTTACAGT TGTGGTAGAT CCTGCTGACT 17400 

ACGCTGTGGT TTTGGATGAA TTGGCAGCAA ACGGCGAAAC CTCTTATGAA ACTCGCCAAC 17460 

GTTTAGCAGC CAAAGTATTT CGTCACACAG CGGCTTATGA CGCCTTGATT GCAGAATACT 17520 

TCACAGCTCA AGTGGGTGAA AGCAAGCCTG AAAAACTCAC TTTGACTTAT GACCTCAAGC 17580 

AACCAATGCG TTACGGTGAG AATCCTCAAC AAGACGCGGA CTTTTACCAG AAAGCTTTGC 17640 

CTACAGACTA CTCCATTGCT TCAGCCAAAC AGCTCAACGG GAAAGAATTG TCATTTAATA 17700 

ATATCCGTGA TGCAGATGCT GCTATCCGTA TCATCCGTGA CTTCAAAGAT AGTCCAACCG 17760 
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TTGTGGCTCT CAAACACATG AATCCATGTG GAATTGGTCA AGCTGATGAC ATCGAGACTG 17820 

CTTGGGACTA CGCTTATGAG TCTGACCCAG TGTCTATCTT TGGTGGGATT GTCGTCCTCA 17880 

ACCGTGAGGT GGATGCTGCG ACAGCTGAGA AGATGCACGG CGTTTTCCTC GAAATCATCA 17940 

TTGCACCAAG CTATACGGAT GAAGCGCTAG CCATTTTGAT CAATAAAAAG AAAAACTTGC 18000 

GTATCCTTGC CTTGCCATTT AATGCTCAAG AGGCTAGCGA AGTGGAAGCA GAATACACAG 18060 

GTGTAGTCGG TGGACTTCTC GTGCAAAATC AAGACGTGGT CAAGGAAAGC CCAGCTGACT 18120 

GGCAAGTGGT GACTAAACGT CAGCCAACTG AGACAGAAGC GACTGCTCTT GAGTTCGCTT 18180 

GGAAGGCTAT CAAGTACGTC AAATCAAATG GTATTATCGT GACCAACGAC CACATGACAC 18240 

TTGGTGTTGG TCCAGGTCAA ACCAACCGTG TGGCTTCTGT TCGCCTTGCC ATTGACCAAG 18300 

CCAAAGATCG TCTGGACGGG GCGGTCCTTG CTTCAGATGC CTTCTTCCCA TTTGCGGATA 18360 

ACGTGGAAGA AATCGCCAAA GCAGGAATTA AGGCCATCAT CCAGCCCGGT GGCTCTGTCC 18420 

GTGACCAAGA ATCCATCGAA GCAGCGGATA AATACGGCTT GACTATGGTC TTTACAGGTG 18480 

TGAGACATTT TAGACATTAA GAAGATAAAA GGGAAGAAAA CAGTTTCTTT CCTTTTTTGG 18540 

CTTAAAATAC TAACTGAAAC AAGATTAAAA CGAACTTTTT TGATATAATG TTGGTAAATA 18600 

ATTCGCAAAA GAGGTTGAGG AATGAAACTG CTTGTTGTCG GTTCTGGTGG TCGTGAGCAT 18660 

GCGATTGCTA AAAAGTTACT TGAATCAAAA GACGTGGAAA AAGTCTTTGT AGCTCCTGGG 18720 

AATGATGGGA TGACTCTGGA TGGTTTGGAA TTGGTAAATA TCTCTATTTC CGAACATTAT 18780 

AAATTGATTG ACTTCGCAAA GACCAATGAT GTTGCTTGGA CCTTTATCGG TCCAGATGAT 18840 

GCCCTTGCTG CTGGTATCGT GGATGATTTT AACCAAGCTG GACTTAAGGC CTTTGGTCCG 18900 

ACTAGGGCTG CAGCGGAGCT GGAGTGGTCC AAGGATTTCG CCAAGGAAAT CATGGTCAAA 18960 

TACGGCGTTC CGACAGCAAC ATATGGCACA TTTTCAGATT TCGAGGAAGC CAAAGCCTAT 19020 

ATCGAAAAGC ATGGTGCGCC TATCGTAGTC AAGGCGGATG GCTTGGCACT TGGGAAGGGT 19080 

GTCGTCGTTG CTGAGACGGT TGAGCAAGCG GTCGAAGCCG CTCATGAGAT GCTTTTGGAC 19140 

AATAAATTTG GTGACTCAGG TGCGCGCGTG GTTATTGAGG AATTCCTTGA AGGAGAGGAA 19200 

TTTTCACTCT TTGCCTTTGT CAATGGTGAT AAGTTCTACA TCATGCCAAC GGCTCAGGAC 19260 

CACAAACGTG CCTATGATGG CGACAAAGGG CCTAACACGG GTGGTATGGG TGCCTATGCG 19320 

CCAGTCCCAC ACTTACCACA GAGTGTAGTT GATACAGCGG TTGACACCAT TGTCAAGCCA 19380 

GTTCTAGAAG GGGTGATTAA AGAAGGTCGC CCTTATCTGG GAGTTCTTTA CGCAGGGCTT 19440 

ATCCTGACAG CTGATGGACC GAAAGTCATT GAGTTCAACG CTCGGTTCGG AGATCCAGAA 19500 
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ACTCAGATTA TCTTGCCTCG CTTGACCTCT GACTTTGCTC AAAATATCAC AGATATCCTG 19560 

GATAGCAAGG AGCCAAATAT CATGTGGACG GACAAGGGTG TGACTCTGGG TGTGGTTGTC 19620 

GCATCCAAGG GCTACCCGCT AGACTATGAA AGGGGCGTTG AGTTGCCAGC CAAGACAGAA 19680 

GGCGATGTCA TCACCTACTA TGCAGGGGCT AAGTTTGCGG AAAATAGCAG AGCACTGCTC 19740 

TCAAACGGCG GACGAGTTTA TATGCTCGTT ACCACAGCAG ATACCGTCAA AGAAGCCCAA 19800 

GCCAGCATAT ACCAAGAACT ATACCAACAA AAAATAGAAG GACTCTTCTA CCGAACAGAT 19860 

ATCGGAAGCA AGGCAATTAA GTAAAGATAT AAGAATAACG CGCCGTAGTC GCCAAACACG 19920 

ATAATGGTCG TCGTGGTGAA AAGACCAGAA CAGTGAATGT TCTGGTCAGG GGGAAACTTG 19980 

GAGACCTTAG GCTCAAAGTT TAGGAATGAA ACCGAAGGTT TGCTTCCGCC TCCATCACCT 20040 

AAGACCATTA TCAAAAAGAA AAATAAAAAT TCACAAAATA CGTTAATGAT CGTATGGTTT 20100 

GCGAGCGTTA GCGAGCTAAT ATAGAACAAT CACCGCCGTT GTGAAAGAAC GATTGGATGA 20160 

TAATCCAATC GTTCAGGGAA ATTGGAAGAC CTTGGGTTTC CAATTTAGGC ATGAGACACC 20220 

TTTGGTGGCT GCTGCCGTCC CTCACAAGCT AAGGTGATTG TTGAAAAAGA GGAAAAAGGA 20280 

GAAGAAATGA AACCAGTAAT TTCCATCATC ATGGGCTCAA AATCCGACTG GGCAACCATG 20340 

CAAAAAACAG CAGAAGTCCT AGACCGCTTC GGTGTAGCCT ACGAAAAGAA AGTTGTTTCC 20400 

GCACACCGTA CACCAGACCT CATGTTCAAA CATGCAGAAG AAGCCCGTAG TCGTGGCATC 20460 

AAGATCATCA TCGCAGGTGC TGGTGGCGCA GCGCATTTGC CAGGCATGGT AGCTGCCAAA 20520 

ACAACCCTTC CAGTCATTGG TGTGCCAGTC AAGTCTCGTG CTCTTAGTGG AGTGGATTCA 20580 

CTCTATTCTA TCGTTCAGAT GCCGGGTGGG GTGCCTGTTG CGACCATGGC TATCGGTGAA 20640 

GCTGGAGCGA CTAACGCAGC TCTCTTTGCC CTCCGTCTCC TCTCTGTAGA AGATAAGTCC 20700 

ATTGCGGATG CACTTGCCAA CTTTGCTGAA GAACAAGGAA AAATCGCAGA GGAGTCGTCA 20760 

AATGAGCTCA TCTAAAACAA TCGGAATTAT CGGTGGCGGT CAACTGGGTC AGATGATGGC 20820 

CATTTCTGCT ATCTACATGG GCCACAAGGT TATCGCGCTG GATCCTGCGG CGGATTGCCC 20880 

GGTCTCTCGT GTGGCGGAAA TCATTGTGGC ACCTTATAAC GATGTAGACG CCCTCCGTCA 20940 

GTTGGCAGAC CGTTGCGATG TCCTCACTTA TGAGTTTGAA AATGTCGACG CTGACGGTTT 21000 

GGATGCCGTT ATCAAGGATG GACAACTCCC TCAAGGAACA GATCTGCTCC GCATTTCGCA 21060 

AAATCGTATT TTTGAAAAGG ACTTTTTGTC AAACAAGGCT CAAGTCACTG TGGCACCCTA 21120 

CAAGGTCGTG ACTTCTAGCC TAGACTTGGC AGATATCGAC TTGTCGAAAA ACTATGTCCT 21180 

CAAGACTGCG ACTGGTGGCT ACGATGGTCA TGGACAAAAG GTTATTCGTT CAGAAGCAGA 21240 

CTTGGAAGCA GCCTATGCGC TAGCAGACTC AGCAGACTGC GTCTTGGAAG AATTTGTCAA 21300 
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CTTTGACCTT GAGATTTCTG TCATCGTGTC AGGAAATGGC AAGGAGGTGA CGTTTTTCCC 213 60 

AGTTCAGGAA AATATCCACC GCAACAATAT CCTGTCTAAG ACCATCGTAC CAGCCCGCAT 21420 

TTCTGAAAGT CTAGTAGACA AGGCTAAAGC TATGGCAGTG CGAATCGCAG AACAACTCAA 21480 

CTTGTCTGGA ACTCTCTGTG TGGAAATGTT TGCGACAGCT GATGACATCA TTGTCAATGA 21540 

AATCGCCCCA CGACCACATA ACTCTGGGCA CTATTCTATT GAAGCCTGTG ATTTCTCTCA 21600 

GTTTGACACC CATATTCTGG GTGTTCTCGG AGCACCATTA CCAGTCATCA AACTCCATGC 21660 

GCCAGCCGTT ATGCTTAATG TCCTCGGTCA GCATGTCGAG GCTGCTGAAA AATATGTCAC 21720 

AGAAAATCCA AGCGCCCACC TCCACATGTA TGGTAAAATA GAAGCAAAGC ATAATCGTAA 21780 

GATGGGACAT GTGACTTTGT TTAGTGATGT GCCGGATAGT GTGGAAGAGT TTGGGGAAGG 21840 

GATTGATTTT TAGGACAAGT CTATGATACA AATTATCGTT AATACATTTA TTGAAAAGTA 21900 

TAAGACTGGA GCAGTTGTTG AAGTGTTGTA TGCCAGTGCT GACCAAGATA AGGTACAAGC 21960 

TAAATATGAA GAACTAGCTG CACAATACCC CGAAAATTAT TTAGCTATCT ATAATGTACC 22020 

GCTGGATACG GATTTGAATA CACTAGATCA TTACCCGTCT GTGTTTATTG GAAAAGAGGA 22080 

GTTTGAGTAG AAATCTTGGT TTACCTAGAT AGCTTATTCC CAACAGCTTA AGAAGAAAGG 22140 

AAAAATTAAC ACATGATCAA CCGTTACTCT CGCCCTGAGA TGGCGAATAT TTGGAGTGAA 22200 

GAAAATAAAT ACCGTGCTTG GCTTGAGGTG GAAATCCTCT CTGACGAGGC ATGGGCTGAG 22260 

TTGGGGGAAA TCCCTAAGGA AGATGTGGCT TTGATTCGCA AGAAGGCGGA CTTTGACATC 22320 

GACCGTATTT TGGAAATTGA GCAGGAGACG CGCCACGATG TGGTGGCTTT CACGCGTGCG 22380 

GTTTCTGAGA CTCTTGGTGA AGAGCGCAAG TGGGTTCACT ATGGGTTAAC TTCTACTGAC 22440 

GTGGTGGATA CTGCTTATGG TTACCTCTAC AAGCAGGCCA ACGACATCAT CCGTCGTGAC 22500 

CTTGAAAACT TCACTAATAT CATCGCTGAC AAGGCCAAGG AGCACAAGTT CACCATCATG 22560 

ATGGGGCGTA CTCATGGTGT GCACGCTGAG CCGACAACCT TTGGTCTTAA ATTAGCAACT 22620 

TGGTACAGCG AAATGAAACG CAATATCGAG CGCTTCGAGC ATGCGGCTGC TGGTGTAGAA 22680 

GCTGGTAAGA TTTCTGGTGC GGTTGGGAAC TTTGCCAATA TCCCACCATT TGTAGAGGAG 22740 

TATGTCTGCG ATAAACTTGG CATCCGTGCC CAAGAAATCT CTACACAAGT CCTTCCTCGT 22800 

GACCTTCACG CTGAGTACTT TGCGGTTCTT GCCAGCATTG CGACTTCAAT CGAACGTATG 22860 

GCGACTGAGA TTCGTGGTCT ACAAAAATCT GAGCAACGCG AAGTAGAAGA GTTCTTTGCT 22920 

AAAGGGCAAA AAGGGTCTTC AGCAATGCCT CACAAACGCA ACCCAATCGG TTCTGAAAAT 22980 

ATGACTGGTC TGGCGCGTGT CATTCGTGGT CACATGATTA CGGCTTATGA AAACGTCGCT 23040 
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CTCTGGCATG AACGCGATAT TTCTCACTCA TCAGCTGAGC GTATCATCAC ACCAGATACG 23100 

ACCATTTTGA TTGACTACAT GCTCAACCGT TTTGGAAATA TCGTCAAGAA CTTGACAGTC 23160 

TTCCCAGAAA ATATGATCCG AAACATGAAC TCGACTTTTG GTCTTATCTT TAGCCAACGG 23220 

GCTATGTTGA CATTGATTGA AAAAGGCATG ACCCGTGAGC AAGCCTATGA CTTGGTGCAA 23280 

CAAAAACAGC CTACTCTTGG GACAACCAAG TAGACTTTAA ACCACTTCTT GAGGCAGATT 23340 

CAGAAGTAAC ATCACGTCTC ACACAAGAAG AAATCGATGA AATCTTCAAC CCAGTTTATT 23400 

ACACCAAACG AGTGGATGAT ATCTTTGAAC GTCTTGGACT AGGTGATTAA TTAAAAAATA 23460 

AACAGCGAGC TTCAATCTCG CTGTTTATTT TTTATCGAAA AGACTTAGTC TTCTTTTCTT 23520 

TTAGTGAGTC CATAGGCTGC TAGTGTGGAC ATGAGTCCTG CGACTACTAG TCCTGCAGAA 23580 

TCGTGAGTTC CTGTTTCAGG AAGTTTTTTC TCTGTTACCA CAGGAGCTGG ATCTTGAGGA 23640 

AGAACTTTGC TTTCCTCAGC AGGAGCAGTT GATGGAGCTG GTTGGCTTGG GATTTCTAGT 23700 

TTTGGTTTTT CTTCAGCAAT AGCGGCTTGT CCGTTTTCAT CGCCTACATG TGTTACCATA 23760 

GTTCCGACTT CGACTATTTG AGTAACGGCT TCCTGTGCTA CGACACTATT TACAAGTGTT 23820 

TTCACTTCCT TACCATCGGC AGAAGTGCTC ACAGAGTAGA AGTTGCTACG ATGTCCATTG 23880 

ACGCCCTTAG TAATGACTTG TGTTTTTCCT TTGAGTAAGA GTGGATTTTC ACAAGTCACT 23940 

GTGGTAAATG GAATTTCTTC TTCTTGGATA TCCAGTCTAG GTTTTACCTC AGTAGTTGGT 24000 

GCAAGACCAC TTTCATCACC CTTGTGAGTT ACAGGAGCGC CAACTTCAAC CACTTGGTTT 24060 

ATAACTTCTT TGGTTACCTG GCTATCAAGG ACTGTTTCTG TTGTTTTTCC ATTTTCAGTG 24120 

AGTACAGAGA TGTAATGAGT TCGTTCACCT TTGACTCCTG CTGTGATAAT ATTTTCCTGA 24180 

CCGGCTGGGA GGTTAGGATT TTCTTTCTTG ATAACTTCAA ATGGAATTTC TTCAGTTCTT 24240 

GTGATGAGTT CTGGTCTGGT TTCAACATTG GCAGCCACTT CATTTTCATC TAGGCTTCCT 24300 

GAATGAGTTA CAGCTGGTTT GAGGCCTTGA AGAGOGGCTT TTAGGTTGGC TACAAGCGTG 24360 

TCAAGCTCAG CTTGTTTATT ACGGTTGAGG TTGTAATTTA GAGCTGTTTT AGCTGCGTCA 24420 

AGGGCCTCAA GACTTTCTTT ACTATATCCT TCTAAGTTTG TAGGAATTTT AGCTAATTCT 24480 

TCGCGGAGAG CATTATAATT AGCACGAAAG TAGTCTTTGT TGTGGTCTGC AAAGGCAGTC 24540 

ATGAGTTCAA AGATTTCCTC TTCCTTGTAT TCAGCGCTTG GTCTATCTGC CCAGATTGAA 24600 

AGCATACTTC CGACTGTTGG AAGATCTACT TCAGGATATT TGGTAGAAGC TAGTTGATTG 24660 

AATGGTGTTT TTCCAGTATT CTCAATAGCT TTCTTGAGGA AACCACCACC ATCTTCTGGT 24720 

TTTTGACCAA GAATGTAGTA CCAGTCACCG TTGGTATTCA AGAATTTATA GCCTTTGCTT 24780 

GCTAGGTATT GAGGTGATGC GAGGTTATAT CCCCACCAGC CTTTAGACCA GTAAGAAATC 24840 
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AAGACATCTT TGTCAAACTG AACATCGTCC TTGTCTTCAT AGTAGAAGCC ATCGTTGAAG 24900 

GCCATTGGTT GAAGCCCTCT TTCTTTGGCC ATAGCTGCGA GGGTGTTGGC ATATTCGGCA 24960 

AATTTGCCAT AGAGTTGATA CCACTTGAGG TAGTACCAGC CTTGGGCACT AGTCGCATCG 25020 

TTGGCGTATT CGTCAGTACC AAAGTTGAAA ATCTTTGTTT TACCTGCAAA GAAGTCCATG 25080 

TATTTACCGA TGAGGGCTTT TACAAAGTTC ATCGCTTCTT CGTTTTTCAA GTCCATAGTT 25140 

GTTTTTGAAA CTTTATCAAA GTGGGCTTGA GGATTTTTAA TACCTAATTT TTCCATGGCA 25200 

ACCAGCATAG CATCCATGTG ACCTGGACTG TTAATAGCTG GGATGAGACC GATGTGCTTA 25260 

GATTTAGCGT ATTCAATTAG CTCTGTTACT TCTGCCTGTG TTAGTGCAGT ACCGTTTGGA 25320 

TCGTCGTAGT AAGCTTTAGT TCCTTCGATA ATAGCTTTTT TAACGTCATC ACTAGCATAG 25380 

GTTTTTCCGT TGGCAGTAAT GGTCATATCA TCGAGTAGAA AGCGAAGTCC GTCATTTCCT 25440 

AGAAGGAGAT GGACATCAGA ATATCCGAGC TCACTGGCCT TGTCTACGAT GCGTTTGAGC 25500 

TGGTTCAGAG TAAAGTATTT GCGTCCAGCA TCGATTGAGA TTACCTTGTT TTTGGCAAGT 25560 

TTTTCAACCT CACGTTTAGC TTCTTCTTCT TTTTGAGCTT CAGGCGTGAG GGTCAAGTTG 25620 

TTGACAGTTT CTTGAAGTTT AGCAATGGCT TGATCAATCG TATCTTGTTG GGCACGGCTA 25680 

AGGTTGCTAT CGAGAGAGCG AATAGCTTTT TCAGCTTCTT TTACGGCCGT GACGCTTTCT 25740 

GCAGTATAAC GGTTCAGGTC TTTTGGTACC TCGTTAAGTG CTTGCTCTGC AGATTCATAA 25800 

TCAGCTGCGA AGTATTCAGC GTTGGCATTT GCAAAATGAC GCATGAGTTT GAAGAGGCGT 25860 

GATGGTGAAT AACGTGCAGA TGGAGTGTCA GCCCAAGCAG CTACCATACC ACCGATGATT 25920 

GGGATATCAG CTCCTTCTGT TTTTGGTACA GAAGTGATTG GTGTGTTTTT AATACCATTG 25980 

AGCCCCTGAT CGAGATTGTA CCAGCCTTGG CCATCAGCGT TTCGTCCAAG AACGTAGTAC 26040 

CAAGCATCAT TGGTATTAAG GATTTGGTGA CCTTTTTCAG CTAGTAGTTT AGAAGAAGCG 26100 

ACATCGTAGC CTCCCCAACC ACCAGTCCAC ATAGAAACGA TGATGTCTTT GTCAAAACTA 26160 

CCAAAGCTTG TGTCGCTATT GTAGTAGATA CCGTCGTTAA AAGCCATTGG TTTGAGACCG 26220 

TGCGATTTTA CAATACGAGC GAGGTCATTG GCGTAGGCAA TAAATTTTTC ATAGCCTTTT 26280 

ACAGGGTAGC CTTCGTTTGG ATAGTATTTA TCAGCTTGAA GCACACTCCA ACCTTTAGCA 26340 

TCTGTCGCAT CATTGGCATA TTCATCAAGT CCGATGTTGA AGATTTCAGT CTTTTTCGCG 26400 

AAATAAGCAG CATACTTGTC GATAAGGGCT TTTGTAAAAG CGACAGCTTG TTCX3TTGTCA 26460 

AGATCGACAG TACGGGCTGA TTTCTTCCCA AAATAGCTAA AGTTAGGGTT TTGGATTCCC 26520 

AATTCTTTCA TGGCATTGAG AATCGCATCC ATGTGTCCAG GACTATTTAC TGTCGGAATG 26580 
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AGACCGATAC CTTTATCTTT GGCATAGTTA ATCAGATCTG TCATTTGACT TTCTGTTAAG 26640 

TGATTGCCGT TTGGATCGTT GTAATAATCA TTTGTACCTT TTTCAATGGC GCGTTTGACA 26700 

TCGTCACTGG CATAGGTCTT GCCGTTAGCT GTGATGCTCA TATCGTCCAA CATGAAACGG 26760 

AGTCCATCAT TTCCGACTAA TAGGTGTAAA TCAGTGTAGC CATAATGTTT CGCTTTATCG 26820 

ATGATTTCCT TGAGCTGTTC TGGTGAGAAA TATTTACGTC CAGCATCAAT AGAAACAATT 26880 

TTCTTTTTCG CTAGTTTTTC ATTTACAGTT GCAGCACGTT CCTTTCCTGC CTCTGTTGCC 26940 

GGTTTGTCAG CCTCTGCTTT CGCTTCATCT TTTTTAGCTG GTTTATCCTT GTCAGTCTTG 27000 

TCTGTATTTG ACTCTTTAGA ATCAACCTCT TTCGCTTCTT CCTTTTTAGG GCTAGCTTCT 27060 

TCTGCCTTTT TATTAGCAGT TTCTTTTTCA GCAGAAGTTG GAGTTACCAC TTCTGCTTTA 27120 

TCACTAGGAG TTGAACTAAC TTCCTCTTGT GGTTTTTCTT CTGTTTTTGG AAGACTAGCT 27180 

ACCTTATCAG TAGCTGGAGT TTCTGTTTCT ACAGTTTTTG GAGCTTCTGG TTGAAGCACT 27240 

GCTTTAGGTG TTTCCTCAGT CCGATTTTCG GATGATTGAG GGGAATCAGA AACCGTATGG 27300 

ATGGTCGGTT GGTTTTCTGT AGTAGTAGGA GTAACTCCAT CGGCTGCAAC AGTCTGTGCT 27360 

TGGAAGGCAA ATCCAATTAG AACAGAAGCT GCTCCTACAG CGTATTTACG AATAGAAAAA 27420 

CGCTGTTGTT TTTCATGTTT CATTGCAAAA CCTCCTGATT GCATTGTTAT ATTGATAGCG 27480 

ATTATATAAA TCAACGCCTT TATTTTATTT CTTATATTAA TTTCTTATAT TAACGAGAGT 27540 

CAAGAGGAGA TGACAAAAAA CTATAATAAG TATAAAAAAA TATAAAATTT AAACTTAAGA 27600 

TTTCAGATTG GTCGGAAAAA ATACGTATAT ATATCTAGTA TAATTTTTGG TTCTATTTCT 27660 

ATAAAATATT CCACAAATTA TAGAATTTTC CAAAAATAGG TAAGCGCTAC CTTTTTGGTG 27720 

TAGTATAATA AGCATAGAAA AAGCCCAAGC GATTAGCTCA GGTTTTCTTC TTAGTGATCA 27780 

CGGTCACATG AGATAAATTT AATCTTGTAG TAATCAGATC GTTTGTAAGT TTCACTGTAT 27840 

TCTAAAACTT GGCCAGTTGA TTCGAGTTTG GTGATTTTAG TTTGTAGGAC AGTAGGGAAT 27900 

TGTTCATCGA CTCCGAGGAC TGAAGCTGCA TGTTCTGGAG TTGGAAAGAC TATTTCGTTG 27960 

ATTTCTTCAA AGTGTTCATC ATTCATGTGA ATGTGGTAGT CTAACTTGAA ACGATTATAG 28020 

ATAGAACTAT AGTATTCAAG GTTTGGATAA TTTGCGTTGA TATATTGTTC TGGGATGTAG 28080 

GATGTATGGT AGATATAAAC GACACCGTTT GATTCGCGGA TACGTTCAAT CTTGTAGTAG 28140 

AATTGATCGC CGCGTAGACC CAATTTTTCC AAGTAAACAA GCTTGTTTCC GCGTTCAATT 28200 

GAAAGAACAG TTACCTTATC ATCTTTAGCA TTGAAGAGTT CAATATCTGA AAACTCTACA 28260 

AGCTTGTGTT TGCGTGCACG TGAAACGAAG GTTCCTTTTC CTTGTTGGCG GACAATATAG . 28320 

CCATCTTTGG CAAGGTCGTT TAAGGCGCGA ACAACTGTGA TAGAGCTGAC ATCGTACATT 28380 
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GAAATGAGTT CTGCTTCAGT GTAAAATTTA TCTCCACTGC TAAACTGCCC AGAGATGATT 28440 



TTATTTTTTA ATTCGTCTTT TATGTATTGA TGG 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6749 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



28473 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

CCTGATGGGT GGTATGCGAG GATACAGTTC TGAAAATCGC CGTTACTTAA TTAATGGACG 60 

CGAAGTCACA CCTGAGGAAT TTGCTCACTA TCGTGCGACT GGTCAATTAC CAGGAAATGC 120 

AGAAACTGAT GTGCAAATGC CACAACAGGC ATCAGGTATG AAACAAGGCG GTGTCCTTGC 180 

AAAACTAGGT CGAAACTTAA CAGCAGAAGC GCGTGAGGGC AAGTTGGATC CTGTTATCGG 240 

ACGAAACAAG GAAATTCAAG AAACATCTGA AATCCTCTCA CGCCGCACCA AGAACAATCC 300 

TGTTTTGGTC GGAGATGCAG GTGTTGGTAA GACAGCAGTT GTCGAAGGTC TAGCGCAAGC 360 

CATTGTGAAC GGAGATGTTC CTGCTGCTAT CAAGAACAAG GAAATTATTT CTATTGATAT 420 

CTCAGGTCTT GAGGCTGGTA CTCAATACCG TGGTAGCTTT GAAGAAAATG TCCAAAACTT 480 

AGTCAATGAA GTGAAAGAAG CAGGGAATAT TATCCTCTTC TTTGATGAAA TTCACCAAAT 540 

TCTTGGTGCT GGTAGCACTG GTGGAGACAG TGGTTCTAAA GGACTTGCGG ATATTCTCAA 600 

GCCAGCTCTC TCTCGTGGAG AATTGACAGT GATTGGGGCA ACAACTCAAG ACGAATACCG 660 

TAACACCATC TTGAAGAATG CTGCTCTTGC TCGTCGTTTC AACGAAGTGA AGGTCAATGC 720 

TCCTTCGGCA GAGAATACTT TTAAAATTCT TCAAGGAATT CGTGACCTCT ATCAACAACA 780 

CCACAATGTC ATCTTGCCAG ACGAAGTCTT GAAAGCAGCG GTGGATTATT CTGTTCAATA 840 

CATTCCTCAA CGTAGCTTGC CAGATAAGGC TATTGACCTT GTCGATGTAA CGGCTGCTCA 900 

CTTGGCGGCT CAACATCCAG TAACAGATGT GCATGCTGTT GAACGAGAAA TCGAAACGGA 960 

AAAAGACAAG CAAGAAAAAG CAGTTGAAGC AGAAGATTTT GAAGCAGCTC TAAACTATAA 1020 

AACACGCATT GCAGAATTGG AAAGGAAAAT CGAAAACCAC ACAGAAGATA TGAAAGTGAC 1080 

TGCAAGTGTC AACGATGTGG CTGAATCTGT GGAACGAATG ACAGGTATCC CAGTATCGCA 1140 

AATGGAAGCT TCAGATATCG AACGTTTGAA AGATATGGCT CATCGCTTGC AAGACAAGGT 1200 

GATTGGTCAA GATAAGGCCG TAGAAGTTGT AGCTCGTGCT ATCCGTCGTA ACCGTGCTGG 1260 



WO 98/18931 



PCT/US97/19588 



678 

TTTTGATGAA GGAAATCGCC CAATCGGCAA CTTCCTCTTT GTAGGGTCTA CTGGGGTTGG 1320 

TAAGACGGAG CTTGCTAAGC AATTGGCACT CGATATGTTT GGAACCCAGG ATGCGATTAT 1380 

CCGTTTAGAT ATGTCTGAAT ACAGTGACCG CACAGCTGTT TCTAAGCTAA TTGGTACAAC 1440 

AGCAGGCTAT GTGGGTTATG ATGACAATAG CAATACCTTA ACAGAACGTG TTCGTCGCAA 1500 

TCCATACTCT ATCATTCTCT TGGATGAAAT TGAAAAGGCT GACCCTCAAG TTATTACCCT 1560 

TCTCCTCCAA GTTCTAGATG ATGGTCGTTT GACAGATGGT CAAGGAAATA CAGTAAACTT 1620 

CAAGAACACT GTCATTATTG CGACCTCAAA TGCTGGATTT GGCTATGAAG CCAACTTGAC 1680 

AGAAGATGCG GATAAACCAG AATTGATGGA CCGTTTGAAA CCCTTCTTCC GTCCAGAATT 1740 

CCTCAACCGC TTTAATGCAG TCATCGAGTT CTCACACTTG ACTAAGGAAG ACCTTTCTAA 1800 

GATTGTAGAT TTGATGTTGG CTGAAGTTAA CCAAACCTTG GCTAAGAAAG ACATTGACTT 1860 

GGTAGTCAGT CAAGCGGCTA AAGATTATAT CACAGAAGAA GGTTACGACG AAGTCATGGG 1920 

GGTTCGTCCT CTCCGTCGCG TGGTTGAACA AGAAATTCGT GATAAGGTGA CAGACTTCCA 1980 

CTTGGATCAT TTAGATGCTA AACATCTGGA AGCAGATATG GAAGATGGCG TTTTGGTTAT 2040 

TCGTGAGAAA GTCTAAGACA GAATTTTGAG GATAAAAAAG AAGGAGCCAG CTGAAAAAAA 2100 

CTGGTTCCTT TTTAGGTACG ACAGGCATGT CGTATAGTAG AAGTGTATTA TTCTAGTTTC 2160 
AATATACTAT AGTAGCTCAG AAGTCGGTAC TTAAACGTGC TATATCAAAA CCAGTCCTGG \ 2220 

AAAAACGTGG ACTGGTTTCG TGTTTGGATT ATTACCTTGA ACGACATGCG TTAAAAGTTA 2280 

GTTGAACCGC CGTATGCCGA ATGGTACGTA CGGTGGTGTG AGAGGGGCTA GAGATTATCC 2340 

CCTACTCGAT TTTAAATCAC ATGACGTTCA AAGGCATCAT CTGAAATCCC TTGTTCCAAG 2400 

ATGAGTTTTG CCCATTCTTT AGCAGAGAAG AGGCTGTGGT CCTTGTAGTT TCCGCAAGAT 2460 

TCGATGGTTG TCCCTGGGAC ATCTTCCCAA GTAGTAGTTT CAGCGATTTC CTTGAGCGAA 2520 

TCCTTGATAA CAGCTGCGAT TTTAGCACTG GTGTGACGTC CCCACATAAT CATGTGGAAG 2580 

CCTGTGCGGC AACCAAATGG TGAACAGTCA ATCATGCCGT CAATGCGGGT ACGGATGAGT 2640 

TTGGCTAAGA GGTGCTCGAT AGTGTGAAGG CCGGCAGTAG GGATAGAGTC TTCGTTTGGT 2700 

TGCACCAAGC GAATATCATA ATTGGAGATG ATGTCTCCTT TTGGTCCTGT TTCTTCCCCA 2760 

ATCAAGCGAA CATAGGGTGC TTTGACAATG GTGTGGTCAA GTTCAAAACT TTCGACAATA 2820 

ACTTCTTTTG ACATGGTAAA TCCTTTCAGT TTTCTTCTCT CATTATATCA TAAAGGTTGC 2880 

TCCTGAGACA GAGAGAAAAC CTCTCCGAGG CTGGAGAGGT TGAAATCTTT ACTTACGATA 2940 

TAAGCGGTCG TATTGGTAGT ATGGGTCAAA GGTTACGTTG ATACCCAGTT TACGAAGGAC 3000 

ATTCTTGTCT TCATCAGTCA AGATGATGGT TGAGTGGGCT TCGCTTCCTT TGAGGTTGCC 3060 
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GAGTTCTTCC ATAGCGCGGG CAGCATCAGG ATTTTCTGTA GCTGTGATAG CAAGTGCAAT 3120 

CAGGATTTCA TTTGAATGAA GGCGTGGATT GCGGCTACCG AGATGATCGA TTTTAAGACC 3180 

TTGGATTGGC TTAACAACTT CAGGCTCGAT TAGTTTTACT TCTTTAGCGA TGTCAGCTGA 3240 
TTTTTTGATG GCGTTGATCA AGGCAGCGGG TGTAGGACCA AAGAGTTCTG AGTTCTTACC v 3300 

AGTGATGATT TCCCCATTTG GCAATTCAAA GGCTAGGGCT GGTCCACCAG TTTCTTCTGC 3360 

TTTTTGGCGC GCAACGACAG CAACCTTACG GTCTGCAGGT GTGATACCGA GGTCGTTCAT 3420 

GAGCAACTCA ATTTTCTTGA CGGCAGCTTC GCCAACTTTT TCAGCTTTGA AGTCAAGAAC 3480 

TGTTTGATAG TAACGGCGGA TGATTTCTTG TTTAGAAGCT TCGACAGCGG CCTCGTCATC 3540 

TGTAATAGCG AAACCAACCA TGTTGACACC CATATCTGTC GGTGAAGCGT ATGGTGATTT 3600 

TCCGAGAATA CGTTCCAACA TGCGTTTGAG CACTGGGAAG ATTTCGATAT CACGGTTGTA 3660 

GTTGACAGTG GTTTCTCCAT AGGTTTGAAG ATGGAAGGGG TCAATCATGT TGACATCATC 3720 

AAGGTCAGCT GTGGCAGCTT CATAAGCCAA GTTAACTGGA TGATGAAGGG GAAGATTCCA 3780 

AACAGGGAAG GTTTCAAATT TAGCGTAGCC AGATTTGATG CCATTGATTT GGTCGTGGTA 3840 

CATATTGGAC ATACACGTTG CCAATTTTCC AGAACCAGGT CCAGGAGOGG TTACGACAAT 3900 

CAAGTTGCGA CTGGTTTTGA TGTAGTCGTT TTTGCCCATG CCTTCTGGGG AAATGATGTG 3960 

ATCCATATCC GTCGGATATC CTTTGATTGG ATAATGAAGA TAAGAATCAA TTCCGTTTTT 4020 

CTCAAGTTGA TTGCGGAAGG CATCTGCAGC GGGTTGGCCA GCGTATTGTG TAATGACAAC 4080 

GGAACCAACA AAAATCCCTA ATTCATTGAA TTTATCAATC AAACGAAGAA CTTCTTGGTC 4140 

ATAAGAAATG CCTAAGTCGC CACGTGCTTT GGAATGTTCA ATGTTGCTAG CATTAATGGC 4200 

AATCACAACC TCAACCTGCT CTTTCAATTC TTGCAAGAGC TTGATTTTGT TGTCAGGTTC 4260 

ATAACCAGGA AGGACACGAG CAGCGTGGAA ATCTTCTAAC ATTTTACCGC CAAACTCTAA 4320 

GTAGAGCTTG CCGTCAAATT GGTTAATGCG CTCCAAAATA TGGTCGCGTT GTAAATTCAA 4380 

ATATTGTTCA GAACTAAAAG CTTGTTTTTT CATTTTTTTA CCTCTGGACT CTATTATAAT 4440 

AAAAAATTGG AAGTTAGGAA ACTACGGAGC TAAAAAAGAA ATTAAAAAGA TTAAGCAAAC 4500 

GCTTGCACAA AATTTTAAAA AGTGCTATCA TAGACTATAG ATTATGAAAA TAATGAGGTA 4560 

AACAGATGCA AGAAAAATGG TGGCACAATG CCGTAGTCTA TCAAGTCTAT CCAAAGAGTT 4620 

TTATGGATAG TAATGGAGAT GGAGTTGGTG ATTTGCCAGG TATTACCAGT AAGTTGGACT 4 680 

ATCTAGCTAA GCTAGGAATC ACAGCAATTT GGCTTTCTCC CGTTTATGAC AGCCCTATGG 4740 

ATGATAATGG CTATGATATT GCTGATTATC AAGCGATTGC GGCTATTTTT GGAACCATGG 4800 
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AGGACATGGA TCAGCTGATT GCAGAAGCTA AGAAGCGTGA CATTCGTATC ATCATGGACT 4860 

TGGTGGTCAA TCATACCTCA GATGAACATG CTTGGTTTGT CGAAGCCTGT GAAAATACTG 4920 

ACAGCCCTGA GCGAGACTAC TATATCTGGC GCGATGAACC CAATGACCTA GATTCTATCT 4980 

TTAGTGGGTC TGCTTGGGAA TACGATGAAA AGTCAGGTCA ATACTATCTC CACTTTTTCA 5040 

GCAAGAAACA GCCGGATCTC AACTGGGAAA ATGAAAAACT TCGCCAGAAA ATTTATGAGA 5100 

TGATGAACTT CTGGATTGAT AAAGGTATTG GTGGTTTCCG TATGGATGTT ATTGACATGA 5160 

TTGGCAAAAT TCCTGACGAG AAGGTAGTCA ATAATGGTCC TATGCTCCAT CCCTATCTCA 5220 

AGGAAATGAA TCAGGCGACC TTTGGAGATA AGGATCTCTT GACAGTAGGG GAGACTTGGG 5280 

GAGCAACTCC AGAGATTGCC AAGTTCTACT CTGATCCAAA GGGGCAAGAA TTGTCTATGG 5340 

TCTTCCAGTT TGAACATATC GGTCTTCAGT ATCAGGAAGG TCAGCCTAAA TGGCACTATC 5400 

AAAAAGAGCT GAATATCGCT AAGTTAAAAG AAATCTTCAA CAAATGGCAG ACAGAGTTAG 5460 

GAGTTGAGGA CGGCTGGAAT TCCCTCTTCT GGAACAACCA TGACCTCCCT CGTATTGTCT 5520 

CAATCTGGGG AAATGACCAA GAATACCGCG AAAAATCTGC CAAAGCCTTT GCAATCTTAC 5580 

TTCATCTCAT GAGAGGAACT CCTTATATCT ACCAAGGTGA GGAGATTGGG ATGACCAACT S640 

ATCCGTTTGA AACACTGGAT CAAGTAGAAG ATATTGAATC TCTCAACTAT GCGCGTGAGG 5700 

CTCTTGAAAA AGGTGTTCCG ATTGAAGAAA TCATGGACAG TATCCGTGTT ATTGGACGTG 5760 

ACAATGCCCG TACCCCTATG CAATGGGACG AGAGCAAAAA -CGCTGGTTTC TCAACAGGTC 5820 

AACCTTGGTT GGCGGTTAAT CCAAATTACG AGATGATCAA TGTCCAAGAA GCGCTGGCAA 5880 

ATCCAGATTC TATTTTCTAT ACCTATCAGA AACTGGTCCA AATTCGCAAG GAGAATAGCT 5940 

GGCTAGTTCG AGCTGACTTT GAATTGCTTG ATACGGCTGA TAAGGTCTTT GCTTATATAC 6000 

GTAAGGATGG CGACCGTCGC TTCCTAGTTG TGGCTAACTT GTCCAATGAA GAGCAAGACT 6060 

TGACAGTAGA AGGAAAAGTC AAATCTGTCT TGATTGAAAA CACTGCGGCT AAAGAAGTAC 6120 

TTGAAAAACA GGTCTTGGCT CCATGGGATG CTTTCTGTGT GGAATTACTA TAAATATTTT 6180 

TTGCAGAAAA ATTTAAAATT GAAATCGTAT AAAAACAAGG GAGGACTGTA TAAAAGACAG 6240 

AAATCCTTTG TTTTTTATAA CCAAAGTTTA TAAACTTTCA TTCTTGAAAT TCAATTAACT 6300 

TTACAAATTC CCACTATTAA GGAGAAAGAA GATGAACATA AAGAAGCGTG TCCTTAGTGC 6360 

AGGCCTGACT TTTGCATCTG CTTTGCTTTT ACCCAAATCA TTCATACCTC TCTCAACTAG 6420 

ATGTAACTTA CAAAACCCCT GACCTCATGA GCCACTTTCT TCCTCCTCAT GAGGTCAGTT 6480 

TTACTTTCTG CTGTTCCAGT ATCGTTTTTC CTCGCTAGAT TTCCTCAAAA GGGCAGACTC 6540 

CTCCCTTGGT GCGTCACACG ATTTTTTCAT CTCGACTGTT CTTTAATGCA TCATTAACGA 6600 
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CGCTTTTCTT CTAGGTGGTT CATAAGGAAC AGGAAGATTC AGGTTGACTT TTCTAATCCT 6660 

AGAATAAAGT GCTGAAAACA ATTCGGAATA GGCATAGAGA CTAGACAATT TGAGGAGCTG 6720 

CTTGCGTCCT GTTCGAACAC ATTTTCCGG 6749 



(2) INFORMATION FOR SEQ ID NO: 85: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



TCTACCCATG 


GACTTTGAGG 


CATTCATTGT 


TCCATCTTCT 


AGTGGCGAAT 


CTTTTGATAC 


60 


AAACGATTCA 


ATTCACTTGG 


ATAGTGAAAC 


TCTCCCGCAA 


ACATTTTTCT 


GGTTAACTCA 


120 


ATCCAGCTGA 


TATTTCTTTC 


AGCCAAAATA 


ATGGACAAGT 


TCTCCCAAAA 


TCGTTCAGCC 


180 


ATATTGCTTC 


TCCTTTAGTT 


AGATAAATAA 


TGTGTTTGCG 


CCATGTAAAT 


CAATTGTTTC 


240 


GTATCTCTTG 


GCAATAGAGC 


TCTAGCCTCT 


TCCAAATTCA 


GACTTGGATA 


AACTCGCTTA 


300 


TTTGAAACCG 


CAAGAGGAAG 


TCTGATGGTT 


AGTTCAGGAT 


TTTTTAAAAT 


TATCTCAACG 


360 


AAATCCGTTA 


ATCTTAGATT 


GTCACGGTTC 


TTAAATCGTA 


ATAAATTGGG 


AGATAAAAAC 


420 


TCAAAACAAT 


CTGAAGAATA 


GCTCATCATC 


TCAATTAATT 


TGTCCTTTGT 


CATTTCAGAA 


480 


ACTGAATGAC 


AAGATACCTC 


TATGCCATAG 


TTTTGGAAGA 


AATCTAAAAG 


AAGTTGATTT 


540 


CTTTGTCTAT 


TTTTACTTAG 


ATAGAGATCA 


ATCATGGGAG 


ACCTCCCAAA 


GATTCGGTTC 


600 


CATTTGATAT 


TCTGACACGA 


TTAAGGAATC 


TAATAAATTA 


AGGAATCTAA 


TAAATTTGCG 


660 


AAGTTAATCG 


GTTTCTTGTC 


TTCATCATAA 


GCTTTTACAG 


TTACTTGGGT 


TGTAAGTATT 


720 


CCCTCTTTTC 


CCTCGGCTCG 


ATAGCCTTGT 


CCATATAAAA 


CAAAAACGAG 


ATTTTGATGA 


780 


TCATCTACAA 


AGGCATCAAC 


CCCATTCTTT 


ATGTCTTGAC 


TTTCAAGGAA 


TTCCATAACG 


840 


TTTTGAAGAT 


AGGATTCGTA 


AAATAGTGGG 


TAGTTATGTT 


TTTTATGGTA 


ATCATCTAAA 


900 


AATGTCACTT 


CAAACTCACA 


TGGAGAGTAA 


TTTTGACTTT 


GAACAGCCTA 


AAAGTGCCAT 


960 


CAAATTTGAA 


TTGGAATAAA 


TCAAATAAAT 


AGCCCCATCC 


TCATCAATCC 


AACCTTTGCT 


1020 


CAAAGACAAC 


TCCAACCGAT 


CTTTTAAAAC 


TGAGTAAACC 


ACCTTAACCT 


CCAGTTTCAT 


1080 


ATTCTTATAC 


CGTTCACTCT 


CAAATAAAAG 


TTTGGGGAGC 


TTATAATAAC 


GCTCTGATGT 


1140 


CTGATATTGA 


TTAGCGGTAA 


TACGCTTCAT 


TATTGTCCCT 


CCAAGACTAA 


AATTCCAACA 


1200 
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TTTCCAAATT 


CATCAAATCG 


GATTAAACCT 


ACTTGTTCCA 


TTTCATCAAC 


TAACTGAGTT 


1260 


GCTTTTACCC 


AAATCATTCA 


TACCTCTCTC 


AACTAGATGT 


AACTTACAAA 


ACCCCTGACC 


1320 


TCATGAGCCA 


CTTTCTTCCT 


CCTCATGAGG 


TCAGTTTTAC 


A A X V- i u\f X VJ 1 




1380 


TTTTTCCTCG 


CTAGATTTCC 


TCAAAAGGGC 


AGACTCCTCC 


CTTGGTGCGT 


CACACGATTT 


1440 


TTTCATCTCG 


ACTGTTCTTT 


AATGCATCAT 


TAACGACGCT 


TTTCTTCTAG 


GTGGTTCATA 


1500 


AGGAACAGGA 


AGATTCAGGT 


TGACTTTTCT 


AATCCTAGAA 


TAAAGTGCTG 


AAAACAATTC 


1560 


GGAATAGGCA 


TAGAGACTAG 


ACAATTTGAG 


GAGCTGCTTG 


CGTCCTGTTC 


GAACACATTT 


1620 


TCCCACCACG 


TGAAGAAAAA 


GATGGCGGAA 


GCGTTTGATT 


GTTAAAGTTT 


GGAAGTCACC 


1680 


TCCAGCTAGA 


TGTTTGAGAA 


AAAGATAGAG 


ATTGTAGGCG 


ATACAGCTCA 


TCATCATACG 


1740 


AACTTCGTTT 


TTGATTAAGG 


TTGAACTATC 


CGTTTTATCG 


CCAAAAAATC 


CCTCCTTCAT 


1800 


CTCCTTGATG 


AAATTCTCGG 


CTTGACCACG 


TCCACGATAA 


AG 




1842 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19390 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

TCATCTTTAT CTCCTCGAAA TTTTCTAATA TAGCCATTAT AACAGAATTT TGTGAAAATT 60 

CCTATTATAG TAAATCACTA TTTCAGTATA AAAAGAAAAA ACGAATCAGA CGATTCGCTC 120 

TTCTTAAAAT CTGAAAATAG CTTTCCAGAA AGGATTAGCC GATTTTTTGC AGATTGAGCA 180 

CTGCATCGTG ACTCATCAAG ACTTGACCAT ACTCTTGTAA GACTGAGCGA CTGATATCAC 240 

TATCGTCTGC AAACTCGCGC ATACGGGCCA ACAGCCAAGC TGGATATGGG CTTGGATGAT 300 

TTTCAATATC CACTAAAATG GTCAAATAAT AGCGCTCGTT CATTTTGTAG AGTTCAGAAG 360 

TTTCCATTTC AAAAGTCACT GTCTTGGCAA AAGCTACCAA GTCAGCCAAC TTAGCAAAAG 420 

AAAGGATGTA GTAGATGTAA GGTTCTTTCT TACTCTCAGC TTCTTGTTCA GCCTGCTCTT 480 

GCTCTTCTTC CTTGACTTCA ACTTGCTCAA GAGATTGAAT GGCTTCGATA TCATCCTTGG 540 

TTTTGTCTGC GATGCTTTTT TCCAGGGTTT TGATAAATTC ATCTGGAGAC ATTTGAGCCA 600 

ATTCTTCCAT ATCTGGCAAA TCCGATAAGT CTTCAAAATC TAGATTTTGG TCAATCTTTG 660 

ACTTGGTCAC AAAGACATCT ACCTTATCAG GTTTTGGAGT CACACGGAAG CTCAACATGC 720 

CTGTATCCAG AAAGCTATCA GGCATCTCTA GCTCATCCAA GATAGCATAA AAGAACTCTT 780 
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CTGTTTTTTC TTGAGGAACG AGAAAGTCAG CAATCTCCAT TCCACGATCC ATCAAATCCT 840 

CTAAAGATAT CGTGATTTTT AAAGTTGTAT CACTAATTTG TTTCATTTTC ATTGCTAGTA 900 

ACCTCATACT TTCAGTTCTA TCTATTATAC TAGATTTTTA CGATTTTATC AAAAGAAGGC 960 

TCCTCTATAC GGATAGATTT TCCCTAGGGT CTTTCTATAG GAGACTCCAA AAGAAAATTT 1020 

CTGCAGACAG ATAGAAAAAG CCTTCAAAAT CGGCTAAGAG CCGACTTTGA AGACCTTATA 1080 

CATCAGAATA CTTATAATTT AAAGGTTGCT ACACCGAGGA TAGAACGATT TAAGTTTCTG 1140 

AGAATTTGAA GACTTTGCTC AAATTTCTTA TAACGAGTCA CTCCGTACTC TTCAACAAGA 1200 

AGGACTGTAT CTCTTTCCAA AAGAGATGAT ACATCCTGTA AATCTACAAA ATGCATTCCT 1260 

TTTAAAGCTT CTTGACTCTG TTTCAATTTA TCTAAGATAG CTTTATTTGA GCTAACGATG 1320 

GTCAATTCCT GTCCAGTATT TTTGTATGAC AAAACATCTG CTAGGTTAGC AATTGTTGTA 1380 

ATCTCTGTTA CAAAATCAAT TTGATACTGA GAAAAATCAC CTACTCTATT GATTGTTGGA 1440 

TTAAAGAGAT AAACTAACAC ATTTCCCATC ACAACCAAAA TCACACAAAC CACTCCAATA 1500 

ACAACTAAAC GAAGAATCAG ATTTTTCACA TTTAAGCCAA GCGCTGTTTC ACCATTTGCG 1560 

TTCAATTCTT TAGAGTTGAT GGTTTCCAGT TTTTCAATTT TCACATTTGC ATAGGCATGT 1620 

TTAAATTTCT CAATCAACCC ATCAATTTTT TTCTCTAACA AGTTATTGGC ATCTTTACTT 1680 

GATGTCAAAA TTTTCACACC AACCCCTGCA TCGTCAATCA TATAGTAGAC GGTCAATTTT 1740 

TTCCACCAAT AGTCATTCGT TGAATTTTTC AAGGTTGTTT CTGTCGTGTC TAATTCACTG 1800 

GCAATTTTTT TCAACTCACT GGGTTCTACA TCATTGAAAA GATAAGCTCC ATTCAAATTA 1860 

CCATCAATCA ATTTCCCATA AAAATCACTA TAACCACCAA TTTGATGATT CAAAATCGTT 1920 

TTGTCCGACT CTTTTGGAGG AGTGATTTTA TAGATAAGAT AAGTTGAATA ACTTGTTGTA 1980 

TCTTTGACAG TGTTTTTATT CCTAACTGCT TTAATTGTAA ATGGTACAGC AATGAGAGCA 2040 

AATAAAGCGA TGAGAGCTAA AATATTTGCT TTTCGCTTTT TATAAAGATT TGCAAACAAA 2100 

TCAGCTACTG AATAATGTTC AAACATGATT TTTTTCTCCT TTGTTTAGTA GATACTAGTT 2160 

TTCCTTTGTA AGCATTTTTG CTACAAATAT AATCACAAGA ACAATTCCCC AGAATTGCAT 2220 

TGTAAATAAA TTGAAGAAAC TTTCTGAAAA GCTGCTTCTT GGCATAAAGA ATAGATTATT 2280 

CAAGATGAGT AGGGATAAAG CAAATAGGAT TGTCCTTGAG CGATAGGCTA CTTGCAGCAT 2340 

GGCTATAAAT AATACGCCGA GTAAGAAACT AAGCAGAAAG ACTCCAATCA TACCATAGTC 2400 

GGTATACAAC TCCATGATAT AACTACTTCC GATACCATGC CCTTTCAAGT ATTCCTTGTT 2460 

CAAGACAAGA TAGGATAGAT TGTGGGCATA ACTATTACTA TCAATAGCTA GTTCCACACT 2520 
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ATTGGTTGTA TGTTCAAAGG CTTTTCCTCC GAAAATGGCT CCCAAACTCC CCCTTGCAAA 2580 

ATAATCAAGA ACAGGACCAA AAGTAAAATT ACGGAAATCT CGGTAAGGGA GGCTACTGTT 2640 

AAATAGAAAA CCTCGAGCCA GAACACCAAA ACTAGTCCCT TGTTTATAGA TAAAGTCAAG 2700 

TAAGATATCC CAGAAACCTG TATGGGAAAC TTGGACATTA TCCCGTACAT AATTGAGTAC 2760 

TCCCATCGCT AACATGAGAA TAGGAGAACC TACAAAAATC GCTAACTTTT CTTTAAACCC 2820 

AATCCATTTT CCTTTTTCAG TTTGCTCCCG CATAAAGTAA TAAACAAAAG CAAATiaAAAT 2880 

ACTTAAAATA AAGGGATTTC GTGTCCCAAT TGCCAAATGA ATAGTATTAG CTGCAATAAA 2940 

GGAGACAAGC ACTGCTGTGG CCTGCAATTT CTTTGGCTTG GTTGCCAGAT ACATACACAT 3000 

TGCATAGACC GTAAAGGTAG ACAAAATGTA GGTAAAATAA GGCAGTTTAC TTTCAAAATT 3060 

TGCATAGTAG GCATAGTAGG AAGTCTGCAA ACGATACAAG AGCCGTTCAA ATAACCGAAT 3120 

GAAATAGAAA GGATAAGTTA GAAGAAAAAC TCCTAGTGAT ACAAAGCGTA ACCGCTTGAT 3180 

ATAAACCTCT TTTAGAGAAT TTCCTATATT TGCTACTTTT ATTTTCTTCC TAGCTATGAA 3240 

GTAACGAGCC AGAATGCCTC CTGTGGTCAA GCCCAGAATC GAAATCATGA CAACTATAAA 3300 

GGCAAAACGA TAGGCTATTG GATGATAGGT ATCCAAAGCA CCATCCCTAA AATAATCAAT 3360 

GGTCGGTCTT GATACCAGAA ATACAAAAAT GGTTAAATAG AAAATAAAAT GGATTAAGTA 3420 

ATACTTGATA TCATTCCAAC AAGCAATTAA GCTACTAACC AACAAGAACA ATAAAGTAGA 3480 

AAGTAAGCTA ACATTATTAT TATTAAACAG ATACACAATT CCACTTACTA GCGTCAAGGC 3540 

ATAACTGACT ATGGTCAAAC TAAATAATAA TCGTTTCCCA TCAATCACTT GGTCACCCCC 3600 

GTTCTAATGT AATTTTTTAG ATTTTTCAAT ATTTTTCAGT AATAAGAATC GATATAAGGA 3 660 

AATATTTATG AATAGGGCCA AAGCACTAAT TCTTCTCCCC TTACGGAAAA TTGGATTCCT 3720 

AGAAATAGCA AAGGCATGGC CTTTTAAAAA ACGATGAATC TGAGAATAGG CTTCAAACTG 3780 

TTTATACTGA TCATCTAGCA ACATCTTATC CAGAATAAAG AAGTGGGCAT AGGCCAATCT 3840 

GAAAAAAGCG ACCTCTTTCA AGTCAGGATA GTTTTTCACA ACTTCATTAT AAAACTTTTG 3 900 

GTAGATATCA ATATAGGCTA AATCCTTCTC TGCATAGGGT TTGGTCGTAA TACTATCCCC 3960 

TCTATGGAAA TAGTAATAAT AGGGTTTAGT ATTAACCACA TACTTCTTGG CCAACTTGAT 4020 

TAAATCAAAA TGGTAATAGG CATCTTCGTA AATCAACCCC TTAGGAAAGG ATAGGGCAGT 4080 

TGCAATCTGT CTCTTGATTA GCTTATTGCA AATCGTCCCA GGTATTTTTT CACCTATGAG 4140 

GTATTCCTTT AGAAATGTTT GAGAATCACA GACAAAATAG TCATCCTGAT TGGCTGACTG 4200 

TGGGCTTTCA TCATTAGCAT AGACATTCAT GACACCACAG CTCGAAACAT CCGCATCTTC 4260 

TTGAACTAAT TGCTCATATA AGCTCTGAAT CATTTCTGGA TGGATATAAT CATCTGAGTC 4320 
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AATAAAAATC AGATAATCCC CGTGAGCCTG CTTCATCCCA TCATTTCGTG CTTGCGACAA 4380 

TCCTTCGTTC TTTTTATGAA GCACTGACAC CCTGTCATCT TGTTCAGCGA TTGAATCACA 4440 

CAAGCGACCA CTTTCATCTG TTGCACCATC ATCAACAAGA ATAATTTCCA GATTTTGATA 4500 

GGTCTGCTTC TGAATGGAAG CTATCGATTT TTCTAGGTAC TGCGCCACAT TATAGACTGG 4560 

CACAATCACA CTAATTAATG CAGTTTCCAT GCTACTCCTC TAATAGTTTT TCTACTTGTT 4620 

CGATTTGTTT TGTAATTGTA AATTGTTGAA TGAATTGGCT AGCCTCATCG ACATCAAAGT 4680 

TTGAGGCAGA AGTCATGTAA TTAGTAATCG CCTGAGCTGC CTCTTGATTG CTCTCAATGA 4740 

TTTGTCCAAA TCGTCCTTCT TGGGATAATT CCTCAGCCCC TCCAACGTCC GTAGAGATAA 4800 

AAGGGAGTCC CAGACTCAAG GCCTCCACAT ACACTCCAGG AAAACCTTCT TGTTTAGACA 4860 

TAGACAAAAG AACTTTCGTC TGAGATAGAT ACTGATAAGG ATTTTTTTGA TAACCAAGGA 4920 

AATGTACATA GTCCTCAATC CCATACTCTT TGACTCGTTT TTTCAGTTCC TCTTCCATAT 4980 

CACCAGCCCC GATAAAATAG AGATGATAGT TTTTTCCCTC TTGGTGTAAT AATCGTATCA 5040 

CTTCCACTAC ACGGTCAGAA CCCTTATTTT CCTCAATCCG TCCGATAGTA CAGATACTTT 5100 

GAGGAGCAAT CTCGATATCG ATCTTCTCTT GAGATTTTTC TAGAATAGTC TGAAAATCAT 5160 

ATCCATTGTA GATTGTCTGT AATTTAGAAG TATAATCTGG ATAAACTTCC TTGATAGAAT 5220 

TGCTGGTCTT TTTTGAAATC CCTACAATTG TATTCGCAGC ATCCAACTGG CTTCTATGTG 5280 

ATTCTCTTTT AGAGCTATCC TTAAGAAGTT CTTCAATACT TCCATGAATC CAAGATATCT 5340 

TCTTGACTTC TCTTCTTTTA GAGAACAACA GTGGTGGATT CATAATGGTA AAAGAAACTT 54O0 

CAACATCATA ATCATCTTTT ACAAGCAAAC GACGAGTCAG TCTTGGAAAA TAAATTCTCA 5460 

TTCTCCACAA AAAAGCTCGT AACCATCTGG TTTGGCGATA ATCTTGAAGG GATTTTAAAA 5520 

TGCGTACATG CTTTGGAACA GATTCATATC CCTTGTCAAA GTGCTCCATT TCAAGAATAT 5580 

CAATATCATA CTTTTCTGGA TCCAGATTTG AAACAATGGT TGATAGAATC TTCTCTGCAC 5640 

CACCTCCAAG AGAAAAAGAC CACATAAAAA ATAAGATTTT TTTCTTAGCC ACCATATTCT 57 00 

CCCTTGTATT CTGTATAAGA CTTATCCATA TCAGCGATGA CAGCATCATG ATGCGGTACC 5760 

TGCTTGTCTG CTGGTGGAGG CGTCATATAA TCCCCAAAAG CAGTTCTGAG ATAGACATCA 5820 

TAGCCGATTG GAATAGGCAT CTCTGTTCCT TCAAATGGCA AGAAAAGATT GTCTTCAAAA 5880 

GATGTGATTG GGTACTTGTT TCTCATGTAG CCAGGACCTG AGCATAATTC TGTAATGCCA 5940 

TCACAATCAG CCAAATCATA CTTAGTCATT TCTTTCTCAG CTTTTTTCCA GATGCGATAA 6000 

CGGAGAGATT TTGGAGTCAA ACCCAGTAAA ATGCGACTTC CCCATTTCAT GAGATCACCA 6060 
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TGCTTTTCTG GAATAGTTTG CGCACAAAAG AGTGAATAAA TCAAGGCCCA ACGAACCTGT 6120 

TTTTTCCGCT CAGCTGGATT TTTCGGATAA TAATCCAAAG GCAAAACATC CAAGGCCAGA 6180 

CCATGTGGCA AATCCAAATC CTGCTGATAA GGCTTGATAC AGGTGGTTTT CTTGTCACGA 6240 

ATGGTAATAA AAAGATTACG ATCAACAAAA TCCTTGTGAC TCTTTGACAA GAAATAACGT 6300 

TCATCTGCAT AACGAGGCCA TAATTCTGCT AATTTCTCAT AATCTTTACG AGGCATAAAA 6360 

AAGTCTAGGT CGTCGTCCCA AGGAATAAAT CCCTTGTTTC GAAGGGCACC AATAGCGCCT 6420 

CCGCCACAGA GATAACAGAG CAAATCATGT TCTTTACAAA AGGCCACAAA ATATTCAGCC 6480 

ATCTCCAGAC TACGAGCCTG AATTGCTTTT AAATCAGTCA TATTGTTCAT TATTCTTTCT 6540 

ATCGTATCGT TTCATTATAC CACAAACAAG GGGTGAAAAT CTATTGCAGA CTGTAAAAAA 6600 

TCAAAGCCTG ACTGCTATCC AAATAGCTAT CAAACTTTGA TTTTTCTGTC TTATACTCTT 6660 

CGAAAATCTC TTCAAACCAC GTCAGCTTCA CCTTGCCGTA GGTATAGGTA ACTGACTTCG 6720 

TCAGTCTTAT CTACAACCTC AAAACTGTGT TTTTAGCAGC CTGCGGCTAG CTTCCTAGTT 6780 

TGCACTTTGA TTTTCATTGA GTATTATCTT ATCTTAAGCC CATTTGAGCG AGCTTGGTTT 6840 

GATATTTGTT TTGATCAACC AGCAGGCCCA AGCCCCCATA AACATCATAG GCATCTACCC 6900 

AGTCACCCAG TTCTGGAATC GTCAATTTTT CAATACCATT TTTTGCTCCA TCCAAAACAG 6960 

ATAAACCGTT TGTTAGGAGG AAAGTATAGG GTACGTTGGT TGAGGTCATA GCAAAAACCT 7020 

TTCCAAGAGC TTCAGAACCA GTGAAAAGTT TAGTGGGATC TTTAATTTGC TCTAAAATTG 7080 

CTGTTAAAAC TTGTTGCTGT CTTTTTGTAC GGCCGTAATC TGCCTCATCA TCATCACGGA 7140 

AACGAGCATA ATTGAGCAGG GTCGAGCCAT TCATCTGCTG TTTTCCGACT TTAATGGTTT 7200 

GGGTTGGAGA CTCAGTCTCG GTAGCGTATA AATCATCTCC GACTGTAGCT TCTGTTAGGG 7260 

GACGCCCATT CAATGTTGAA AATTGAGCAT CAATCGTCAC CCCATCAGGG AAAAGCGTGT 7320 

CAATCGCTGT GGCAAAGGCC TGGAAATCAA CCAAGGCGTA GTACTTAATG TCCAAGTCAA 7380 

AATTATCTTT CAAGACTTGG CGAACCATTT CTGCCCCTTT TTGCCCCTCT TGTTCTCCTA 7440 

ACTCGTAGGC TACGTTTAAC TTGTTATCTG TCTGTTTTCT ACCATTAATC ACTTGACTAT 7500 

AACCATCTAT ATAGACCAAA TTATCACGCA TGAAACTGAC TAGCTTCATT TTCTTATCTG 7560 

AGCCCCCGAC ATTTAATACC ATAATAGAGT CAGTTCGTGT CTCAACACTG TTCTGGCCGA 7620 

TTCGACCATC AGTACCCATG ATTAAAATAT TAACTCCATC TCTAGTGTCC TGACCATTAA 7680 

AGACTTCTAC TTGAGCTGCC CGGGCATCAG CAGTTTTCTT TGCGCTAGCA TCTTGGTAAC 7740 

CACGCAAAAA CATGAATACC ATGGCCAAAG CCACACAGAC CAAAAGTGAA AAAATCACCA 7800 

TAAAAATTCG TTTAAGACGG AGCTTCCGTC TTTTCTTTTT TGGAGGGAAA GAGAGTGCTT 7860 
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GTGATTTGGA TTGTGAGCGA CTCCGGTTCG CATAGCTTGG TAAGTCAACC TGCTCTTCTC 7920 

TTTCTTGTTC CAAGCTAGAG CTACTATTTC CCCTAGCAAG AGTTAGCTTT TCTTGCAAAT 7980 

AGGCAAACTC ATTTTTTTCT CTCTCATTGA GATAGTGAAT ATTTTTTAGC AAATAATCAT 8040 

AACGCAACTG CTCATGATGA CTTAAGGGAT TTTCTTTACT CATCTTCTCT CCTTTCCATG 8100 

GTCTGATATT GGATAAATAG GATAGGCACC CAGAATTTTA TACTGGATTC CAATCGCTTC 8160 

TAATTCTTTT TGGGCAAAGT GGACCAAGTC CTTATCGGTA TAATCCACAT CGATAATGAA 8220 

AAAGTATTCA CCCAGTGCTG TCTTGAGTGG ACAACTTTCA ATTTTTGTCA AGTCAATTCC 8280 

TCGCCAAGCA AAGGTCGACA GGGCCTTATA AAGTGCACCT GGAAGGTTGT CAGGTAATGT 8340 

CAAGGCCAAA CTCATCTTTT CAGTTTGTGC TTGCAAGGGA ATACTAGGCT TTTCAGCTCC 8400 

TAGAACCCAG AAACGTGTGA AATTGGCTTC CATTTCCTGA ATATCCTCGG CAATCAGTTC 8460 

CAATCCATAT TCTTCAGCAG AACTTCTAGG TGCAACTGCT GCAAAGGGCT GGTCTGGATG 8520 

TTCGGAAATA AAACGGGCCG CATAAGCTGT ACTAGCTGTT ACCTCGATTT GAGCCTCTGG 8580 

ATATTGTTCA TCGATGAATT TCTTTCCTTG AGCCAAGGCC TGTGGATGTG AAAAAATCTT 8640 

TTCAATCTTA GTATGGCCTG GAACCACCAT CAACTGCTGA TGAATAGGCT GAACGATTTC 8700 

TGCTACTGCT TGGATGTGAG CCTGATGAAA AAGATAGTCC AAGGTTTCAT GAACACTACC 8760 

CTCAATAGAA TTTTCAACTG GCACCACAGA ATAGTTCACT AATCCTTGCT CATAAGCCTT 8820 

GATGACATCT GTAATGTTGG CAAAAGCCTG CAATTCCTCA TGAGGAAAAG CTGTCTGCAC 8880 

AACGTGGTGT GAAAATGATC CCTTGGGACC TAGATAAGCA ATTTTCATCT TAGTTCCTCT 8940 

ATAATTTCCT CTGGGCTTAG CTTGGTCACA TCCAAAACCC GACTAGCCAC TTCCTCATAC 9000 

CAAGCCTGTC TTTCTTGGAA AATAGCTACT AGTTCTTCCT TGCTATTATT TAGAAAAAGC 9060 

GGTCGCTGAT TGTCCTTATC AGCTGCGATA CGTTGGTAGA GGGTTTCAAA ATCTGCTCTC 9120 

AGGTAGATGT TATCTGTATT AGTCTTGAGT AAGTCACGAT TTCTCTGAGA AATAACCACT 9180 

CCTCCTCCAG TTGACACGAC TTGGTCTGTT TGTAGTAAAT CAGCTAGGAC TTCTGATTCT 9240 

ACCTGACGAA AGGCTGTTTC TCCCTTTTCA GCGAAAAAAT TCGCAATGGA CATACCTAGG 9300 

CGATTCTCAA TCAGAGCATC CATATCAAGG TAATTAGGGT CCAAGCCTCT TGCAATAGTC 9360 

GATTTTCCAG CCCCCATAAA CCCTAATAAC ACCTTAGCCA TGAATCAAGC TCTCCAAATC 9420 

ATCAAAGAAA CTAGGATAGC TGGTATTGAT GGCTTCTGCA CGGTCAAGCT CCACCTCTCC 9480 

ATCTGCAACC AAGAGGGCTG CGATAGCTGT CATCATGCCG ATACGGTGGT CACCAAACGT 9540 

ATTGACTCTA GCACCGTGAA GAGCTGATTT TCCTTTGATA ATCATCCCAT CTGCCGTAGG 9600 
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AGTAATATCT GCTCCCATAC TATTTAAGGC GTCTGCCACA ACCTGAATAC GGTCTGTTTC 9660 

CTTGACCTTG AGCTCCTCAG CATCCTTGAT AACTGTTACA CCTTGGGCTT GGGTCGCAAG 9720 

CAGGGCAATA ATGGGCAATT CATCAATCAA TCGTGGAATC AAAGCGCCAC CAATCTCTGT 9780 

TCCTTTCAAG TCAGAAGACT CAACAATCAA GGTAGCAGAT TTAGCGACTG GATCGATTTC 9840 

AGTTATTTCC AATTTTCCAC CCATGGCACG AATGACATCA ATAATAGCGG TGCGAGTTTC 9900 

GTTGATCCCC ACATTCTGCA GCACTAGACG AGAATTTGGA GCAATCAAAC CTGCGACTAA 9960 

CCAAAAGGCT GCACTGGAAA TATCTCCTGG TACGACCACC TTCTGTCCTG TCAATTTTTG 10020 

TGGCCCCTGG ACTGTGATTT TCTTACCATC CACACTTAAA TGACCACCAA ATTGTTTCAA 10080 

CATATCTTCA GTATGATTAC GGGTGTACTC TTTTTCGATA ATAACTGACT CCCCCTTAGC 10140 

TTGTAAGGCT GCAAACATCA AGGCTGACTT GACTTGGGCA GAGGCAATTG GCAACTCATA 10200 

ATGAATAGGT CTTAGGTTTT TCGTCCCTTT TAAGCGAAGG GGAGGCAAGT CTCGTTCAGT 10260 

TTGCCCTGAA ATGCTGACGC CCATTTTTTT CAGTGGAAGG GTCACACGGT CCATAGGACG 10320 

TTTGGAAAGA CTATCATCTC CAAACATCTC TACTTCGAAA TCTGCACCAG CAAGGACACC 10380 

TGAAATCAGG CGAATCGAGG TGCCAGAATT TCCCATATTA AGGGCATTTT GTGGCGCTTT 10440 

TAAGCCAGCC ATGCCTACAC CTTGAATGGT AATAACCCCA TCTTTATCCT CAATTTCAAC 10500 

ACCAAGGTCA CGAAAAACCT GCATGGTCGA AAGAACGTCT TCACCTCGCA GAATATCATA 10560 

AACCTTGGTC TCACCCTCAG CCAAACTTCC AAAGATAATG GAACGGTGGC TGATAGACTT 10620 

GTCACCTGGG ACGCGGATAC TACCATGTAA ATGGCGAATG TTTGTTTTTA GTTTCATACT 10680 

GGACCTCATA CTTGCAATAC TTTTACCTAT TTTATCATAA AAAGCCAGAA ATTCCTTAAA 10740 

AATTCCTGAC TTTAGGATCG TTCTTTTCTT ATTTCAGCAA TTCTGAAACT GGTTCAAAAA 10800 

CAATTTTTTC AATATCAGAA AGGTAAATGG CCAATTGTTG TTGCTTGGTA AAGAATTCTG 10860 

ACAAGAGGCT ATTTCCTTGA ATCTGTTTAC CAAAGCCTTC CATCTTAGCT TGGAAGGACG 10920 

CATCTGGCAT TTGACCTGTC TGTGCTAGTT TTTGAATTTC CTCTTGAAAG GCAAGATAAT 10980 

CTGTAAAGAT TTTGCTTGCC TCAGCATCTG CTGCAATCGC ATCTTTAGCT GCTTTAACAG 11040 

CCTTGTATTC TGGTAATCCG CGTAGACCGC GACTGAGTTC GTTTGCACTA TCGTAAATAT 11100 

TTGACATGTT CTTCTCCTTA TTTGATGACG ACTGTATAGT CAGTATTTTC TGTTATGAGA 11160 

TGCTCAGCTC TTTCCAAGTC TTGAGCATTT TTAAATGAAA TTTGTAGGAT TCCGTGAATA 11220 

TCCTCACGAT TTTCCTCGTT GATGTGGATA TTAACCAAGG AAGTTCCACG TAGCAGTTCC 11280 

AAAATCCGCA GGATGACATC TTCTTCATCA GGAACGTCAA CATAGAGGTC GTAAGAGCTA 11340 

TCCACACCAC CACGCTTATG GATTTCCATG GTCTGGCGTT GTTCACGCGC TTGGTTAAAA 11400 
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AAGTTCCAAA TTTGCTCTTC ATCTCCCTTA CTAATGGCCT GACCAATCGC TTCCAAACGT 11460 

TCCTTGAAAT CCTCAATTCT ATCCAGAATG ATCTCGCTAT TGGACAAGAG AATGGAGGTC 11520 

CACATTCCTG GCTCGCTTTC CGCAATTCGG GTCATATCTC GAAAACCACC TGCCGCAAAG 11580 

CGCCTTGCCA TCTCATGCTC TTGAGCATAG ACCGCAGTCT GCTCCATGAG ACTAGAAGCC 11640 

AAAATATGAG GAAAATGGCT AATCTGAGAA GTGACACGAT CATGCTCCTT GGCATCAATC 11700 

TCGATAAAAC GAGCATGAAG ACCTGAAAGC AGATCCTTCA TTTCCTTAAG CGTGTCCTGA 11760 

CTTGTCAGGC TTGAAGGTGT AAAGATATAA TAGGCATTTT CAAAAAGATT GACATCTGCC 11820 

GAAGCAGCCC CTGTCTTGTG ACTACCAGCC ATGGGATGGG CCCCGACAAA GCGAACAGAC 11880 

TTGCCAGCCA AATACTGCTC CGCCGCATCC ACAATGGTTG ACTTGGTCGA ACCAGCATCT 11940 

GAAATAATAA CGCCTTCTCG CAAATCCAAA TTGGCCAACT CCTTAATGAA AGCAATAGTT 12000 

TGTTTGATTG GCAAGCTGAG GATAATGACA TCTGCCAAAG GAGCAAAACT AGCAAAATCA 12060 

TCCGTTGCAC GGTCAATCAT ACCTTCTTTC AAGGCGATAT CTCTCGAAGC TTGACTACGA 12120 

TTATAACCTA AAATTTCATA ATCTGGATGA TCGCGTTTGA TACCAAGTGC CATAGAGGCT 12180 

CCAATCAACC CAAGACCTGC GATATAGATT GTTTTTGCCA TAGGAACTCC TTAATAGTTC 12240 

TTTGTATAGT CTCGGTGTTT GGCTACCGCT TCTTTTAGTT CCTCAAGATT ATCTGATGAG 12300 

AATTTTTCGA GGATTTCTTG CGCCAGAACC GTTGCTACAA CTGCTTCCAT GACCATTCCT 12360 

GCAGCTGGAA GAGCAGTCGG ATCACTTCTC TCCACGGTTG CCTTGTAAGG TTCGTGGGTT 12420 

TCGATATCCA CACTCATAAG AGGTTTATAA AGAGTAGGAA TGGGTTTCAT GACCCCACGA 12480 

ACAACGATGG GTTGCCCATT AGTCATACCA CCTTCAAAAC CACCTAGATT ATTGGTACGG 12540 

CGAGTATAAC CGTCTTCTTT AGACCAGAGA ATTTCATCCA TAACTTGGCT GCCTTTACGA 12600 

TAACCAGCCT CAAAGCCAAG ACCAAATTCC ACCCCTTTAA AGGCATTGAT AGAGACAACA 12660 

GCTTGAGCCA ATCTTGCATC CAATTTTCTA TCCCATTGGA CATAGGAACC AAGACCAACT 12720 

GGAACGCCTC CGACGACTGT CTCCACAACC CCACCGATGG TATCACCATC ACGTTTGATT 12780 

TGGTCAATAT AGTCCTTGAT TTCCTGTTCT CGTTCTTGGT TGACAATAGA AACTTCAGAC 12840 

TGGGCAGCTC TTTGCTTAAT TTCAGCGACT GTCAGATTTT CAGGAACATC GATTTCCTTG 12900 

CCACCAAAGA CCACGACATG GTTGGCAATC TCCATATCCA GCTCAGCCAA GAGGCGTTTG 12960 

GCTACTGCAC CAACTGCCAC CCGCATGGTG GTTTCACGAG CTGATGAACG CTCCAAAGAA 13020 

TTTCGCAAAT CATCAAAACG GTACTTAATC CCCCCAACCA AATCGGCATG ACCTGGGCGA 13080 

GGATGAGTAA TTTTCCGCTT GCTTTTAAGG CGGTCTTCAA TGTCCTCCGC AGACATGATG 13140 
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TCCAGCCATT TCTGGTGGTC CTTATTGATG ACATCCATAG TAATAGGCGC CCCTGTCGTC 13200 

TTCCCGTGGC GAACGCCCGA AGTAAAGACA ACCTGGTCAT TCTCAATCTT CATACGACCA 13260 

CCACGACCGT AGCCACCCTG ACGGCGTCTA AGGTCCTCAT TGATATCCTC AGCTGTCAAT 13320 

GGAAGTCCAG CTGGAATTCC CTCAATAATA GCTGTTAGAC GGGGGCCGTG TGATTCTCCT 13380 

GCAGTTAAAT ATCTCATACA CTCTCCTTAT TTTACCAAGT AGTCTTTCAT CTCTTCCAGA 13440 

GAAACTGGGT GAATGGTCGC TGAACCAAGC TCTGGCACCA AGACCAATTT CAAGGTGTTA 13500 

CCACGCGCTT TCTTGTCATG AGTAAGAGCC TGATAAAGCT TGCCAACTTC CCAATTTTCA 13560 

TAGTCAACAG GCAAACCGAA TTTCTGACAC ATCTCTGTGA TAGATTGGGT AATGCCAGCT 13620 

GGCATGAGGC CTTTTTCCTC AGCAACCTTG GAAATCTGTA CCATTCCCAT GGCAACAGCC 13680 

TCTCCATGCA TGACCTTGCC ATAACCGGCA GTCGCTTCGA TGGCATGGCC AATAGTGTGG 13740 

CCAAAATTGA GGTAAAGACG AATACCATTG TCCAACTCAT CTTCAACCAC CATCTTGCGC 13800 

TTCACCTGAC AAGAATGTTC AATCAAGGTC TCTGCATGTT CCAAAATACT CTCAACAGAA 13860 

CCATTCAGTC CCGTCAAGAG AGCCCACAGT TCTGGATCCT CAATCAAGCC ATACTTGATA 13920 

ACTTCACCCA TCCCTTCAAT CAACTCTCTT TTTCCGAGGG TTTCAAGAAC AAGTGGATCA 13980 

ATCAGAACCC CATCTGGTTG GGCAAAGGTC CCCACCATAT TTTTAGCAAA TGGTGTATTA 14040 

ACGCCTGTCT TTCCACCGAT AGAAGAATCA ACCTGAGCTG TCAAACTAGT CGGAATCTGA 14100 

ACAAAGTGAA TACCCCGCAT ATAGGTAGAG GCTACAAATC CAGCCAGGTC CCCAACAACG 14160 

CCACCACCAA GAGCAACGAT TCCATCGCTA CGAGTCAGAC CTTGCTTGAC TAGAAATTCA 14220 

TAGACTTTCT GAACAGTAGT TAAATTCTTT CTTTCTTCAC CTTCTAAGAA ATCAAAAACA 14280 

GCTACCTGAA AACCAGCATC TTCTAGGCTG AGCTTGACCT TCTCTGCATA GAGAGAGGCT 14340 

ACATGGTTAT CTGTCACAAT GACTACCTTT TGCGGTTGCC AGAGTTCTCG CAACCACTGA 14400 

CCAGCCTGGG CCATACAACC TTTTTCAATC TGAATATCAT AAGGATGGTG AGGAATATCG 14460 

ATTCTGATTT TCATAGGAGA GTCTCCCTTT CTTTATTGGT ATTTTTCTGT TAAAGACTGC 14520 

CAAATCTCTT CTGTCGGCAT TTCCTTGCCT GTCCACAGTT GAAAAGCTTC TGCAGCTTGA 14580 

TAGAGTAACA TTCCCAGACC ATTGACTGCT GGATTGCCCT GACTTCTAGC CCATTTCAAA 14640 

AACGGTGTTT CAAAGGGTTG GTATATGATA TCTGCAACTA AAAGAGTTTC TGGTAAGACT 14700 

ATGTTTTCAG GAACAGGAGA GGATTGGCCA TCCATGCCCA CACTGGTGGC ATTAACTAGC 14760 

AAATCCGACT CGGCAATCCT TGCTTGCAGT TCAGAAACAT ATTCTAAAGC ACACAAATCC 14820 

ACTTTAAAAC CTGTCTGCTC CTGTAACTTG TCTAGGTAAG GTCTTGTTTT TTCCATAGAA 14880 

ACGGAACGAA CAAAGACCGA AATCTGACTG ACGCCATCCA AAATAGCCTG TGCCAAGATT 14940 
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GATTTAGCCG CACCACCTGC ACCCAGCAGG GTCATCTTTT TACCTGAAAT TGTAAAAGAA 15000 

GGCAAGCACT TAAAAAATCC CTTGCCATCT GTATTATATC CAATTAAATT GCCATTCTCA 15060 

TTGACAACCG TATTAACCGC ACCAATCAAG CGCGCTTCAT CGCTCAGCTT ATCCAAATAA 15120 

GGAATCACCT GCTCCTTATA GGGCATGGAC AGATTGATGC CAAACATCTG GTAGCGACGA 15180 

ATATTGGCCA CTGTTTCTAC CAAGTCACTC GCTTCAATCT CCCAAGCCAC ATAAGCACCG 15240 

TTGGTAGCTG TCGCCTCAAA GGCTCTATTG TGGATGAAGG GAGAAATAGA ATGCTTAATA 15300 

GGATTGGCAA CAACTGCAGC TAAACGTGTA TAGCCATCAA GCTTCATCCA AAATCTCCCT 15360 

GATTTTTTTC ATGCTAGCTA GAGAAATCTG CCCAGGGGCA CTAACCTCAT CCAGACTGGC 15420 

AAAAGACCAA CTCGAACCAG TCACATCCGC AGTGATACGA GAGACCTTGC CCACCTTACC 15480 

CATAGAAATG GTCACATATT CCTGTTCAGG ATTGAGGGTT TTAAAGCCTC GTGTATAGTT 15540 

CATC AAGTCT AAGACATCCT GCTCCGTGTG AGCCATCACC GCAACCTTAA CAAGTTTTGG 15600 

ATTTAGGATC GTCAACTCTG ACAAGATTTC CATCATGTTC TCAGGTGTTT CTTGGAAATT 15660 

ATGGTAACTC AAAACAAGAT TTGGGAAGTC CAGCATTTCC TCAAAAACAT CCTTGTAGCT 15720 

ATAGTACTCA AAATCAATAT AGTCTGGTTG ATAGAGTTGC GCAACTTCCT TGATTAGATG 15780 

GATATACTCT TCTGGAGAAA GGTCGATTTC TCCACCTTCG GAGCGAGTTC GTAGCGTGAA 15840 

AACCAACTCA CGGCCTGCGA ATTTTTCAAA AATGGCTGGA GCTACCTGCA AAATCGCTTC 15900 

TTTAGGCAGA TAGTCGGCAC GCCATTCAAT GATGTCGGCA TCCAGGTACC TCGTGGCATC 15960 

CAGAGCCTGA GCCTCCTCTA AACTTCTTGG CATTACTGAA ACGATTAATT TCATTTACTA 16020 

ACCTTCATAC TAATCACCTT GAGGTAATTA CTACTTTCAT CTTTTTTATT ATAGGCAAAA 16080 

TCTGCTGGAA GACCATATTT GTTTAAAATC TGGTAACTTC TTCCTGCAAA ACCTTTATCA 16140 

ATTTGTTCTG TAAATTTCTG ACGGGAAACA TTGGCAGCAT TGGTACTGGC AATGATAATC 16200 

CCTCCCGGAT TTAAAATCTC AAGACTCTGG GAAATCAACT TGTGATAATC CTTGGCCACA 16260 

GAGAAAGTTT GTTTTTTATT CCGAGCAAAG CTAGGCGGAT CTAGGACAAT CACATCGTAG 16320 

GTCAAGTCTT TGCGTTTGGC ATATTTGAAA TACTCAAAGA CATCCATGAC TATAAAACGA 16380 

TGCTCGTCTG TGCTGAGCCC ATTTGCCTGA AAATGCGCTT GAGACAATTC TCGTGAACGT 16440 

TTGGCTAGAT CAACAGAAGT TGTATGGCTA GCTCCTCCCA TGGCCGCAGC TACTGAAAAA 16500 

GCCGCTGTGT AGGAAAACAT ATTGAGTAAG GATTTACCCA TAGCCAAGCC GTCAACTAAA 16560 

CTACCGCGAA CCTCATGCTG GTCTAGGAAA ATTCCTGTCA TCAAGCCATC ATTCATAAAG 16620 

ACTTGATACA GGACACCATT TTCTAAAACA TTGAAAAAGT CAGGTGCTTC TTGACCATAA 16680 
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ACATGGGCAG ATTCATAGTC CAAACCCTTA AAGCGGATTT TCTCATAAGC TCCTAAAACC 16740 

TCAGGGAAAA CCTGTCTAAA GGCTTCTGAT ATAGTCTGAC GAATCTGATA AACATAAGAG 16800 

TTATACCAAG AAAAGACGGC GTAGTCGCCA TAAAGGTCCA CTGTCAGACC CCCAAAGCCA 16860 

TCTCCCTCTT GATTAAAGAG ACGAAAGGCA GTTGTCAAAT CATCTTGATA GTAGGCGTTT 16920 

CTCTTTTCTT TGGCTTTTCT AAACAACGTT TCAAAGAAAG CTTGATTGAA GGCCACCTTG 16980 

TCTTTGCTGA TAAACCAGCC CAAGCCCTTG TTTTGCTGAG AAAGGTAGGC AGTCCCAAGA 17040 

AAGTTTCCTT CCTGACCCTG CACCTCTACT TCCTGATCCT TAAGATTGAC ATTCTCAAGA 17100 

TCACTGGCTT CTAGTAAAAC TAGCCCCTTA GCAAGCTTCT TTTCAACCCT TTTGCTGACT 17160 

CTTATTCTAT TCATAACTAC CATTATATCA AACTTTTAGA CAATTCTCAA AAAAGAAACT 17220 

ACCCTTGCTT TTTTACTCTT CTTTTAAAAA ATGGTATACT AGACTTCCTG CAAAACTAGG 17280 

AAGTAAATGT GTAAGAATCA CAGTAAAAAA TGCTCTTCCG TCTTGGAGGA GCATTTCTTT 17340 

TTATCAACGA AAATCAAATA GCAAACTATG AAACTAGCCT CAGGTTAACT GTGAGATTAT 17400 

AGGTAGAGAG GTTGTATCAG CAATATGTGT CTGTCAAATT TAGTGACAAA GGTAGTAGAA 17460 

GAAAGATAAA GAAATAAATC AGCTTCAGTA GGTATCTGGA AAATTTGATT TTATAGAGAA 17520 

GCCTTTTGTT ACAAACTCAA TATACTATCA ATAAATAATA TTATAGAAGC AACAATAATT 17580 

ATAATTTCAC CTATCTGCAT CATTCTATTT CGAACTCTAA ATATATGTTC TATCAAAAAT 17640 

ACTTGGAACA CACACATTAT AGGAATTAAC GTTTTTGAAA TTGAAAAATA TCCAAATAAA 17700 

TAAACTATAA ACAACAAAAA TAGAACTATG TTATATTTCT TATTCAAAAC ATTCCTCCCT 17760 

ATATATTTTT GATTACCAAT CTTAATCATT TACAACTACA TTCTAACAAA CTATAAAAGC 17820 

GTTTGTCGAA TTGAATTTAT CAAGCAAGCG ACCAACCAGT TCATCTTTTT TCTATTTCTG 17880 

CCAATATGCG TGACAGGTAA TAATGATAGC CAAAAATAGC AAGAGCAAGC AAGACGATAA 17940 

GAGCTCCTAC TCCCAAGCTG ATGGCAAGGA TAGGGGAGAG AGACTGAACC AAGAATATGC 18000 

TCCCAATTAC AAGGGCCATC AGGATTGCAC TATAAATAAA CAATAAAACT ATGGCGACT A 18060 

TGCCATTTGA ACGATTCACC AGGTCCGTAA TGCTACTCCA ATTGGTTGAC AGATTTTTAA 1.8120 

CGTCCTTAAA GTAATGGTGG CAAGAAAGGA TGACACTGGC AATGATCCAG ACTACAAGAA 18180 

GGTAAATCAT CGAAATGATG GGCAAGCCTA GATATAGAGA AAGACCAAGC AAAGTCAGAA 18240 

CTGGTAAAAA GGACTGGACA GCATATATAA TCCAAAATTT CACTTTCACA TAACGAGCAA 18300 

AGTCAAAGGG TAAACTCTTA AGAAAATCAA CATTTTCCCT CTCCAAGGAC AAGGCAATTG 18360 

AATGCAGGCT GGTGATATTG TTATTGACAA CTGCTATAAA GAGAGCTATA AAAAACAAGG 18420 

GTAACCAGTA TGGAGGATGA ATGTCTGGAA CTATCTGAGA ATCTCGGATT TTGGAAATCA 18480 
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GACCGATCAT CATGAGATAA GGAAGGAAAG CACTTGTAAA AAGCACTGTA ATCAGGCCAG 18540 

TCCCCTGTCC CAAGAGGGTG AGGTGGTAGC GTAAAACCAT GCGAAAAAAT CCCTTTTTAG 18600 

TGGTTGAAAT TCTCTCCTTG CTGCGACGTT CTTTTTTGAC CTTCTCCTCA CTATTAAGCA 18660 

GGATCACGTC ATAAAAACGA GGAAGGACCT TCTTTTTGGT CAGATAAAGC AGGAAGAGAG 18720 

TTAGTCCTAT CCAAGCGAGC AGACCCACTA AGGCTTCTGT CGAAAAAGGC TCCACTGCTA 18780 

TTTTGTAAAA GATATGAAGA GGATAAAGGA GAAATGGAAT GTCTCTAACT TTGTCAACAA 18B40 

TACTTCCAAA AGTCGACTGA AGAAAGAAGA TAAATATTAA AGGTATGAGA ACTCCTATCC 18900 

CAATCATCAC ATTCGAAAAA ATAGACTGAT ACTTTCTGAA GACCCTAGTT TGAGCCAAGA 18960 

AATGCACTGC CACTACCATC ACTAGAGCCA C AG AG AC AAA TAATAAGGTC AAGGACAGTA 19020 

GCATCAAAGG CAAACCCAGC CATAGAGAAG GAGCTAGCCT AATGTAGAGG ACCAGAAAAT 19080 

AAGCTAGGAT TGGTACAATT CCAGTTAGAG CTGGCAAAAG GACAGACAGT CCTTTAGCAA 19140 

TTATAATCTC TGATTCTTTA AAGGCATAGG GCCTATACGA TACCAAATCC TTACTCTCAT 19200 

AAAAGACATT GTAAAAGGCC GTTAAAGAAG TTGAAAAGGC AATCACTAGT AAAATAGCAA 19260 

TCATCGAGCT AAAATAAATA GGTATTTCCT CAAAAGGAAA ATGAATGGCT ATATTACTAA 19320 

AACAGATGAT CATCAAGAGA CTGGAAAAAA TGTAAGAACT TAAGACTCTA GCGGAAACAT 19380 

TTACTTTTTT 19390 



(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 18436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

CCGAGCGTCG TTACAGACTT TATCAAGATT GGACGCAAGA AGAAATTCAA CATATAAAGG 60 

AAAATATGGC ACAATCTCCA TGGCATACTC ATTACCATGT TGAGCCAAAA ACAGGACTTC 120 

TCAACGACCC AAATGGCTTT TCTTACTTTG ATGGCAAGTG GATCCTCTTT TACCAGAATT 180 

TTCCTTTTGG TGCAGCCCAC GGTTTAAAAT CTTGGGCACA GCTAGAAAGT GATGATTTGA 240 

TTCACTTTAA AGAAACTGGA ATCAAAGTTT TACCAGATAC TCCATTAGAT AGCCACGGTG 300 

CCTACTCTGG TTCTGCCATG CAATTTGGCG ATAACTTATT CCTATTTTAT ACAGGAAATG 360 

TTCGCGATAA AAACTGGATC CGTCACCCAT ACCAGATCGG TGCTTTGATG GACAAGGAGG 420 
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GTAAGATTAC AAAGATTGAC AAGATCTTGA TTGACCAGCC AGCAGACTCT ACTGACCACT 480 

TCCGCGATCC ACAAATTTTT AACTTTCAGG GTCAATATTA TGCCATTGTC GGOGGACAAG 540 

ACTTGGAGAA AAAAGGTTTC GTTCGTCTCT ACAAGGCTGT CAATAACGAC TACACAAACT 600 

GGCAAGCAGT TGGCGACCTT GACTTTGCTA ACGACCGTAC TGCCTACATG ATGGAATGTC 660 

CTAATTTGGT CTTTGTAGAG GAACAACCTG TCCTTCTCTA CTGTCCACAA GGATTGGATA 720 

AGAAAGTTCT AGACTACGAT AATATCTTTC CAAATATGTA TAAGATCGGG GCTTCCTTTG 780 

ACCCTAAAAA TGCCAAAATG GTAGATGTGT CTCAACTTCA AAACATGGAT TACGGTTTCG 840 

AAGCCTATGC AACTCAAGCC TTCAACGCTC CTGATGGGCG TGCTCTAGCA GTTAGCTGGC 900 

TTGGTTTGCC AGATGTTTCT TACCCATCTG ACCGTTTTGA CCACCAAGGA ACCTTCTCTT 960 

TGGTCAAGGA ACTCACTATC AAAGACGACA AGCTCTACCA GTATCCAGTC GCTGCTATTA 1020 

AGGACCTTCG TGCTTCTGAA GAAGCCTTCT CAAACCGTTC CCAAACCAAG AACACTTACG 1080 

AACTTGAACT CAACTTGGAA GCTAATAGCC AGAGCGAGAT TGTCTTACTT GCTGATAAAG 1140 

AAGGTAAGGG ACTTTCAATC AACTTTGACC TTGTAAACGG TCAAGTAACA GTGGATCGTA 1200 

GCCAGGCTGG AGAACAGTAT GCCCAAGAAT TTGGGACAAC TCGTTCTTGC CCTATCGAGA 1260 

ATCAGGCTAC TACTGCTACA ATCTTCATCG ATAACTCTGT CTTTGAAATT TTCATCAATA 1320 

AAGGAGAAAA AGTATTTTCT GGTCGTGTCT TCCCACATGC GGACCAAAAT GGTATCCTGA 1380 

TTAAATCTGG AAACCCAACT GGAACTTACT ATGAATTAGA TTATGGTCGC AAAACTAACT 1440 

GATGTCGCCA AACTTGCAGG CGTCAGTCCT ACTACCGTTT CTCGGGTTAT CAATAAAAAA 1500 

GGGTATCTAT CTGAGAAAAC CATCCAAAAA GTCAATGAAG CCATGCGAGA ATTGGGCTAT 1560 

AAACCCAACA ACCTGGCTCG TAGTCTGCAA GGAAAATCAG CTAAGTTAAT CGGCTTGATT 1620 

TTCCCCAATA TTTCCAATGT TTTCTATGCA GAATTGATTG ATAAATTGGA ACACCAACTC 1680 

TTCAAAAATG GTTACAAGAC CATCATCTGC AACAGTGAAC ATGATTCTGA GAAGGAACGC 1740 

GAATACATCG AAATGTTGGA AGCCAATCAG GTGGACGGCA TCATTTCTGG TAGTCACAAC 1800 
CTAGGAATCG AAGACTACAA TCGTGTGACA GCGCCGATTA TTTCCTTTGA CCGAAACCTA . 1860 

TCGCCAGACA TCCCTGTCGT CTCCTCTGAC AACTATGCTG GTGGGGTTCT TGCTGCCCAA 1920 

ACCTTGGTCA AGACAGGTGC CCAGTCTATC ATCATGATTA CAGGGAATGA CAATTCTAAT 1980 

TCGCCAACCG GACTGCGCCA CGCTGGTTTT GCATCCGTAC TCCCAAAAGC TCCTATTATC 2040 

AATGTTTCCA GTGACTTTTC TCCCGTCAGA AAAGAAATGG AAATCAAGAA TATCTTGACC 2100 

CGGGAAAAAC CAGATGCCAT TTTTGCTTCG GATGATTTGA CAGCTATTCT GGTCATTAAA 2160 

ATCGCTCAAG AATTGGGCAT TTCTGTCCCA AAAGAGCTCA AGGTCATCGG CTATGATGGG 2220 
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ACCTACTTTA 


TCGAAAATTA 


CTACCCTCAA 


TTGGCTACTA 


TCAAGCAACC 


TTTGGAAGAG 


2280 


ATTGCTTGTC 


TCACTATTGA 


TCTTCTCTTG 


CAAAAGATTG 


AAGGCAAGGA 


AGTCGCCACA 


2340 


ACTGGTTACT 


TCTTACCAGT 


TACGCTATTA 


CCAGGAAAAA 


GTATTTAAAC 


ACAAGAAAAC 


2400 


TCAGACCGAT 


TCGTCTGAGT 


TTTTATGATC 


TTAAATTTTC 


GAGATAGCGC 


TGGGCTGTCT 


2460 


CTAGGTTAAA 


GGTTTTATCT 


GAGATGAGGC 


GCTCTACTAG 


GGGAGCAACT 


TCAGATTCAC 


2520 


TAGCCCCAGC 


TAGGAGAGCT 


AGGGATTTGG 


CCTGTAGTTT 


CATGTGGCCT 


TGCTGGATGC 


2580 


CCGTACTTAC 


CAAGGCTTTG 


AGGGCTGCAA 


AATTTTGAGC 


AAGACCGATG 


GACACGATAA 


2640 


TCTGGGCTAA 


TTCTCTGGCA 


GAAGGATTTC 


CTAGTAGATC 


ATGACTGAGA 


ACTACACGTG 


2700 


GGTTGAGGCC 


GATAGAGCCA 


CCCTTAGTCG 


CTACAGGCAT 


GGGCAGGGTC 


ATCTCACCGA 


2760 


CCAATTCTTC 


TCTTTCAAGG 


TCCAGCGTCC 


AGCAGCTAAG 


ACCTTGATAG 


CGTCCATCTC 


2820 


GACTGGCAAA 


GGCATGGGCC 


CCAGCTTCGA 


TGGCACGCCA 


GTCATTACCA 


GTGGCAATCA 


2880 


AAATCGCATC 


AATACCATTA 


AAAATTCCTT 


TATTATGAGT 


AGCAGCTCGG 


TAAGGATCAG 


2940 


CCTGCGCAAA 


CTGACTAGCC 


AACGCAATTT 


TCTCCGCAAT 


CTCTCGTCCT 


TGATCCTTTT 


3000 


GGCGGCTCAA 


GTAGCGAAAG 


GCGATGCGAC 


AGCTTGCAGT 


CACCAGAGAA 


TCGGTCGCGT 


3060 


AGTTGGACAG 


GATTCCCATG 


AGACTCTGTC 


CCTGACTGAG 


TTCTTCTAAG 


ACTGGTTTCA 


3120 


AGGCTTCCAG 


CATGGTGTTG 


AGCATATTGG 


CACCCATGGC 


TTCCTGGGTA 


TCGACATGAA 


3180 


TATAAACAAC 


GAGAAAGTCT 


GGTTCGCCTT 


TTATCTGCTC 


GACATGCAGA 


TCACGCGCCC 


3240 


CACCTCCACG 


TTTAACGATA 


GAAGGATAGG 


CTTGATTGGC 


AAGCTCCAAG 


AGCTCCGCTT 


3300 


TCTTGCTGGC 


AATCTTCTCT 


TGCGCTAGTT 


TAGGATTAGC 


AACTTGATAA 


AGGGCTACCT 


3360 


GCCCAATCAT 


CTGTCGCTGA 


TGGACTTGTG 


CAGTAAAACC 


ACCTGCACGC 


TTGATGATTT 


3420 


TGCTGGCATA 


GCTGGCCGCC 


GCAACCACAG 


AGGGTTCTTC 


TGTCACATAG 


GGAACGGTGT 


3480 


ATTCCTGACC 


GTTGACAAGT 


ACCTCCGGAA 


CCAGTGAATA 


AGGCAGAGAA 


AAAGTTCCCA 


3540 


CTACATTCTC 


ACTCAGCTGG 


TCTGCCACAG 


TCACGCTCAT 


CTGTTCATCC 


TTCTCCAGAC 


3600 


TAGCTTGTCT 


CTCAGGACTA 


AGGAGCGCCT 


GAGCTTTTAA 


CAGCTCGAGG 


CGCTCTTGGT 


3660 


ATGATTTTTT 


AGAAAATCCA 


TTCCAACTTA 


TCTTCATTAT 


TTTTCAACCT 


TGCTATAACG 


3720 




TCGAGAATTT 


CAACCAAGGC 


AAAATCTTGA 


TTTTCATAGC 


CAGCAAACTG 


3780 


GGCAGAGTTA 


GTTTCATCCA 


AGTTTACTTC 


CTCAAAAAAG 


ACCTTTTCAT 


AGTCTGCAAC 


3840 


GGATAGGGCA 


GTTCGTTGGT 


TGAGCTTGTT 


CAAACGGTCT 


TTATCCAAAT 


AAGCTTCATA 


3900 


TCCTTCAACC 


AATTCACCAC 


TGAAGAACTC 


AGCCACAGCT 


CCACTTCCGT 


AACTATAAAG 


3960 
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GGCGATTTTA TCCCCAGCTT TCAAGCTATC TGTATTTTCC AAGAGAGACA AAAGTCCAAG 4020 

GAAAAGTGAA CCTGTGTAGA TATTCCCCAC CTTTTGACTG TAGAGAATAG ACTGGTCAAA 4080 

ATGCTTTTGT AAGAGGTCTT TTTTCTCTTG AGGCAGGCTC TTATCCATGA TT TTTTTCAA 4140 

GCCTTTTAGC GCTAATTTAG GATAAGGCAA GTGGAAACAA ACAGCCGCAA AATCATCCAA 4200 

AGTAAGCTGG TAGCGTTTTT GATATTCAAG CCAAGTCGTT TTCAAACTAT CCAAGTATTG 4260 

TTGGGTAGAA TAGACACCAT TTACATAAGG AGTTGTCGAG TAATTTGGTC GCCAGAAATC 4320 

CATGATGTCA CGGGTCTGAG CTACATTGTC ATTATTAAAG GCCATCATGC GTGGATTTTG 4380 

TGTAATCAAC ATAGCTACAC TTCCAGCACC TTGAGTTGGT TCTCCTGGAG TTTCAATACC 4440 

GTATTTGGCA ATATCACTGG CAATGACCAA GACCTTGGAC TCCGGAGAAT TTTCCACATG 4500 

CAATTTGGCA TAATGGAGGG CAGCAGTCGC TCCGTAGCAG GCTTCTTTAA TCTCGAAACT 4560 

ACGAGCAAAG GGCTGGATGC CCAGCAAGCC ATGCACAAAG ACGGCCGCAG CCTTACTCTG 4*20 

GTCAATTCCT GACTCGGTCG CCACAATGAC CATGTCAACT TCTTGTCTTT CTTGCTCAGT 4680 

TAAAATAGAG TCACTAGCAC TGGCCGCCAA GGTCACGATA TCCTCAGTTA GGGGCGCAAT 4740 

ACTCAATTCC TTGAGTAAGA GTCCTTTACT TAATTTTTCA GGGTCAATTC CCCTCGCTTC 4800 

TGCTAAGTCT TGTAATTTCA AGACATATTG AOTGGTCGCA AAACCAATCT TATCAATACC 4860 

GATTGTCATA TTTACCTCTG TTTTATCATT CATGTAAAAA ATCGTTCTAT ACTATTTTAT 4920 

CACAAATGGC AGTAAAAGAG AGAAAAAAGA CTTGATTCAC CAAATCAAGC CTCTTATTGG 4980 

TCATCATTTT AAAGAATGAT TAGTTGCTAG AGAGTTCACC GATATAAGTA GCTTTATAAG 5040 

CTCCATTCAC AGTTATCAGC TCCTGGAGGA TCAAATTTCC TGAGTAAGTC CTTCCCATCT 5100 

CATCTACAAA TTTTTGATAA AACTGACTGG TCGGAATTTC TCTGACATCC TTATCAAATG 5160 

TCTTATCAAG TGTTTTACTA ACCTTCTCAG CAATCAATTG ATGCTCTTGC CATCCACTTT 5220 

GAAACTCTGA GCCCGAACTA GAAACCATGA CTGGGATAAA CAACAAGGTC AGTAGATTTA 5280 

CAGACAATAA GGAAAGTAGT AGACTTCCTG CAAAACTAGA ATCCTAGTTC ATGATTGATA 5340 

ATACCAGCAA TCAAATTCAT TCGTAATCCG AAGCGTTTAC GATGATTTCG ATAGGTTGTT 5400 

GAAAACATTT TAAACGTTTT TACTTTGGCA AAGATGTTCT CAACCTTGCT TCTCTCCTTA 5460 

GATAGCGCAT GGTTACAGGC TTTATCTTCA GCTGTTAGCG GCTTGAGTTT GCTGGATTTA 5520 

CGTGGAGTTT GTGCTTGAGG ATATATCTTC ATGAGCCCTT GATAATCACT GTCAGCCAAG 5580 

ATTTTACCAG CTTGTCCGAT ATTTCTGCAA CTCATTTTGA ACAACTTCAT ATCATGACTA 5640 

TAGTTCACAG CGATATCCAA AGAAACAATT CTCCCTTGAC TTGTGACAAT CGCTTCAGCC 5700 

TTCATAGCGT GAAATTTCTT TTTACCAGAA TCATTCGCTA ATTGTTTTTT AGGGCGATTG 5760 
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ATTTTTACTT CCGTCACATC AATCATTATC GTGTCCTCAA AGCTGAGAGG AGTTCTTGAA 5820 

ATCGTAACAC CACTTTGAAC AAGAGTTACT TCAACCCATT GGCTCCGACG GATTAAGTTG 5880 

• CTTTCGTGGA TACCAAAATC AGCCGCAATT TCTTCATAAG TGCGGTATTC TCGCACATAT 5940 

AGAAAGCGTT ATCAATTTAT TTATCTCATT TTTCAGAAAA TTCTTTTATT TCTGTAAAGT 6000 

CTACGATACT CGATGTGTTT TTATATAATG ATAGAGTCTG AGAATCACTG TTCCGCTAGC 6060 

CATTCCAATA GAGATTACCA AAGCCAACAT GACAACCAAG GTCGCACTTG CCAGTGCTTT 6120 

ATTATAGTCC CCTGTCACAA AAAAGGCAGT TGTTCGGTAG GAGAGATAAC CTGGAACCAG 6180 

CGGTGCCAAA ATGGCCAAGA TAAAGACCAC AGCAGGTGTC TTATAAAGAA TACTTAAAAT 6240 

CTGGCTGACA CAAGAACCAA TAATGGCTGC AATGAAGGTA GCTACAATGA CATTGGTCGG 6300 

TTCCTTGAGC AAGAGATAGA TTAGCCAGAC AGTCATGCCC AAAATCCCTC CAGGTAAGAG 6360 

CATAGACCGT TGCACATTGA GTACGATTAA AAAAGTGATA ATGGCAAGAA AACTTGCTAC 6420 

TGCTTGTAAT AAAAAGGTTG TTAGTGTCAT ATTAGTTCAT CAATACCAAG GCGACAGAAG 6480 

TTCCTGCCCC TAAAGCGAGG GTAATGAGCA GGGATTCAAA CATCTTACTC ATACCAGAGT 6540 

TTATGTGGTT GGTCATAATA TCACGGACCG CATTGGTCAA GGCAATACCT GGTACAAACG 6600 

GCATGACCGC ACCAGCTATA ATCAAATCTG CCGTTGAAGG AAAACCTGTG TAGCGAGCCC 6660 

AAAACTGGGC AATTATCCCA AAGACAAAGG CTCCAGCAAA GGCTGTCACA AAGGGAATTC 6720 

GGATAAATTT TTCCACATAG AGGGAAAAGG CAAAACCAAA TAAGGTCGCC ACTCCTGCCC 6780 

CAAGTGCGTC GTAGATATTT CCGCTAAACA TAACTGAAAA GAAAGGAGCA CTAAAGGTCG 6840 

CAGCCAGAGT TACCTGCAAC TTAGTATAGG GAAGGGGTTT GGCTTGCAAG GCCGTCAATT 6900 

GCTTAAAGGC TGTTTCTAAG TCAATCTGCC CCCCAACTAG CTGACGAGAA ATCTGGTTCA 6960 

CATCGCAGAC TTTTTCGATG TTATAAGAAG AGGAGGTCAC GCGCTTCATG CGCAAATATT 7020 

GGTATTTTCA ATAGAGAAAA AGATAGCGGC AGGCATGGCA AGGACATTGC AATCCACAAT 7080 

CCCCTGCGAA TGCGCGATTC GAATCATGGT ATCTTCTACA CGATGGATTT CTGAGCCACT 7140 

TTTAAGGAGA ATAGTCCCCG CTAGCATAAT CACATCAATG ACGGCATTTA ATTCTTTTGA 7200 

TTCTTCCATG CTTTCCTCCT TTTATCAACT CCCTCTATTC TATCACAAAT CCGGACTCAA 7260 

AAAAAATCTT TGCCATGAAA TCATGACAAA GATTGATTAC TCATTTTGAT TATCCATCTG 7320 

CTTTTAAGGA GTAGCTGAAG TTGTTTTAGG TTTGTAGATT GAAATCTTGA CTCTAGTCTT 7380 

ATTGAGGTCT ACCTTTTCAC CTGCTCTAGG ACTTTGTTCA ACAACCATGC CTTCTGCACT 7440 

ACCTGCAGGC GCTGTCGTCA CTTCTACAAC TTCTATATTA GCTTCCTTAA TCCCAACAAT 7500 
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TTGAATCAAA TTGTTCTTAG TAAACTCCAA GCTAGAACCA ATGTAACTCG GCATGGCAAC 7560 

ACTTGTAACT TTTTTAGCTA CTGTCAAGAC AATTTGAGTA GGTTTACTCA CATCATAAGT 7620 

CGTTCCGGCA CCTGGACTTT GTTTCATAAT CGTTCCTGGT TCGCTTTCGC TGGACTCTTC 7680 

TTCCTCTATC TTAATCAAAT TCTCAGGAAC CTTCTTCTGC TTGAGTTCTG AGATTACTTC 7740 

TGTAGAGTTC CGTCCAATAT AGTTCCCTAA TTGAATCGTC GTAGCTTTTT TAGCTACTGT 7800 

CAAAACAATT TGAGTTGCCT TGCTCAAGTC ATAGGTCGTA CCTTCTGGTA GACTTTGCTT 7860 

CAGGACCGTT CCAGCCTCAC TCTCATTCGA CTCTTCTTCC TCAATTTTAA TCAAATTATC 7920 

TGGAACTTTT TTCTCTTTTA ATTCCGCAAT GACATCAGAG GATTTCCGAC CGACATAATT 7980 

ACTAATTTGG AAAGATTGCT TGCCTGATGA GACAACCAAA TTGATTTTCG TTGCTTCTTT 8040 

TCGACCAGTT CCAGCGCCAG GATCTGTACG GATAATCCGC CCTTCTTCCA CCTTTTCACT 8100 

AGCCTCTGTC TTCTCCTCAC CAATCTCAAA ATTGGCTTTT TTGAGCGTTG CCTTGGCCTC 8160 

TGCAACTGTC TGACCTGCCA CATCTGGAAT GGCAATGGTT GCAGGAGTTC TGGATAGTAT 8220 

CCAAATAAGA GAAGCTGCCA CCAATACAAG GCTGGCCAAC AAAATCAGGT AACGCATCTT 8280 

AAATCTATGT TTTTTCGGTG CTTGTGGTTG GTAAGTTTCC TCTGTCACAG CCTGGCTTGG 8340 

GTTTTTGATT GATTTGTGTT CTGTTTGCGC TTGAACCTTA GGAATAGATG TCAAGGTACT 8400 

CTGAGAAACC TTCGGCAAGG TCTTGGTATC TGCCTTGCTC GTTTCATCAA AGATTAACTT 8460 

ACTTTCATTT CTACGATTGT AGGACAAGCT ACTAGACAAG TCCACATACA TCTCTGAAAC 8520 

CGAGCGGTAG CGATTGGTCA ACTTTTTAGC AGTTGCCTTG ATAATAACAT TTTCTAAAGC 8580 

CTGAGGTACA GATGGATTTT CTGCAATAAC GGACGGCAGG GGTTTCTGGA AATGCTGGAG 8640 

GGCAATGGTC ACCGCGCTAT CCCCGTCATA AGGGATATGG CCTGTCAGCA TCTCATAGAA 8700 

AATAATCCCC ATGGCATAGA TATCACTCTG CACAGTCGCC TTCGAACCAC GCGCCTGCTC 8760 

TGGTGACAAG TAATGAACTG AGCCCAACAT CGAGTTAGTC TGGGTCAGAC TTGTCTCTGC 8820 

AAAGGCTACA GCAATCCCAA AGTCTGTGAC CTTGGCAGTC CCATCTGGTG TCAAGAGGAT 8880 

ATTTTGAGGT TTCAAGTCCC TGTGAACAAT TCCTCGAGTA TGGGCCAAGC GCATAGCCAA 8940 

GAGAATTTGT CCCATGATAC GGACTGCTTC TTCATTAGAA AGAGGATAAT GTTCCTTGAT 9000 

ATAGCGTTTG AGGTCCAGTC CAGCCACATA CTCCATAGCT AGGTACTGTT GACCGTCTTC 9060 

CTCGCCAATA TCTGTTATCC GAACGATATG AGGATGGTCT AGATCTGCCA TAGCTCTCGC 9120 

TTCACGCTGA AAACGAGCTA CAGCTATCGG GTCCGTCTGG TAGTTGGTCC TCAGAACCTT 9180 

CACTGCCACT TCTTCCCCAT CTAAGATTAA GTCTTTGGCT AGGTAGACAT CCGCCATACC 9240 

TCCTCGACCA ATCTGTTTGA CAATCCGATA GCGTCCGGCA AAAATCTTGC CGATTTGGAT 9300 
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CATTCTGCAT CCTCCTCGTT CATAGAAACA AGGGCAACCG TAATGTTGTC TAAACCTCCT 9360 

GCATTGTTAG CAAAACGAAC AAGTGTCTCC GTTTTATCTG CTAAAGGAAT ATCACTGGTT 9420 

ACAATATCAC GAATCTCACT GCCTGAAATC ATGTTGGTCA AGCCGTCACT ATTGAGCAAG 9480 

AGATAGTCAC CTGACTCAAG GATAACTGTC CCAAAATCAG GCTGAATTTC ATCTTTTTGC 9540 

CCAATAGACT GGGTGATAAT ATTTTTTTGC GGATGAGCTT CTGCCTCTTC TGGTGTCAAT 9600 

TGACCAGCCT TGAGCAATTC ATTAACCAAG GAATGATCGC TCGTCAACTG ATGGTATTCT 9660 

TCTCCACGAA TCAAGCCGAT ACGCGAATCA CCAATATGAG CATAGATAGC CTGATTATCA 9720 

ATAATAGCAA GGACTTCCAA AGTAGTTCCC ATGCCTCTGT AAGCTTCATC CTGACCAAGC 97 BO 

TGGTGAATCT TTTGATTTTC AATTTCTAGG TAATGGGCGA ACCATTCACG CACTTCATTG 9840 

ACTGTATCGA TCTGGGTATC AACCCAAGCT ACACCCAGGT CTGTGACCGC CATTTCACTA 9900 

GCGATATTCC CTGCGCGATG ACCTCCCATC CCATCAGCTA AAATAATCAT GGTACGTCCA 9960 

GCTCTATTGA CATAGTGGTT GACATAGTCT TGGTTATTTG TTCGTTTCTG ACCAACATCT 10020 

GTTAATAATG AAATTTCCAT GTGTCAGTTC CTTCCTAATC CGATATCTTG CGAAATTGAC 10080 

TGATGAAGAA TCCATCACTT CCATACAATT CAGGTGTAAT GAGGATACAG CCGTCTTTCA 10140 

TGATATCCTT ACATTCATGT TCTAGTTTTA CCTGCTCGAA CTCGGGATGA CTCTCTAAAA 10200 

AGGCCTTAAC GACTTGAAAA TTCTCCTCTG AGACGATAGT GCAGGTGCTA TAAGTTATTA 10260 

TACCACCTTT GCCTAGTATT TGACAAACAC TACCTAATAT TTCTAACTGA ATTTCCTGCA 10320 

AGGACGCGAA ATCTGCCGTT TCTTTATTGT ATTTGATATC TGGTTTTCGG CGCAAAAGAC 10380 

CGATTCCTGA ACAAGGAGCA TCCACCAAAA TCTTATCAAA GGAATCCTGG TCAAAAAACT 10440 

CATGCACCTT TCTGGCATCC AATTTTTGAG TTTGAACCCG ATCTTCAACT CCCAGACGTT 10500 

GGGCATTTTC TTGAATTAAA TCCAACTTGT GGTCGTACAA GTCCAGAGCA GTAACCTGAC 10560 

CTGTCGTAAG ATAAGAGGCT ATATGGGCTG TTTTCCCACC TGGAGCCGCA CAGGCATCAA 10620 

GCACTCGCTC ATCACCTTGT AAATCAAGCG TCGGAGCAAC CAGCTGACTG GACTCATCTT 10680 

GGATGGTAAT GGCTCCATCC GCAAACAAAT TATGCCCTGC AAAATGCCCC TGCTCCTTAA 10740 

CCAGACCAGT GGTTGCTAAA AGGGAATTAT TCGCCTCCAA CAAGGCTTGG ATTTCCTCTT 10800 

TTCGACTTAG GTCTGTTACA CGAATACTGG CTTTGTTTCG CACTAACAGG CTTTCAAAGA 10860 

TGGCTTTTGC TCTCTCCTCT CCGTATTCTT CCTTGAGTTT GGCAACTAGC CAAACTGGGA 10920 

GAGAATAGGC AATGGAGTCA CGCTTGTTTT TTCGCTTGAT GCTAGCAATA TCTGGCCAGC 10980 

CTTCACGCAA GATACGGCGA AGGACAGCGT TGACCAATTT TTCACTGCCT TTTTTACGGA 11040 
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GTTTGGCCAA 


TTCCACTGCT 


TCATTAACCA 


CAGCATGATC 


TGGAATCTTG 


TCCAAATAGC 


11100 


GGAGTTGGTA 


GGCACTCATG 


AGAAGAAGGA 


CATAGAGCCA 


GCTGTCTAAC 


TGGTCTCTGT 


11160 


CTTCGATAAA 


GTGGGATAGG 


TACCATTCCA 


GAGTCAGTTT 


ACGGGCTACC 


GTTCCATAGA 


11220 


CCAGCTCGGT 


CACTAAGCCC 


TTGTCTGCTG 


CCAAAAGTTG 


ACTTCCCTTT 


AGATGCTTAT 


11280 


TTAAGGCGAT 


ATTTGAATAT 


GCTTGGTTCA 


CAAAAACATC 


CTCTAGCACT 


GCTAGAGCTA 


11340 


AACTTCTAGC 


CGTTTCTACT 


TTAGTCACCA 


AATCGTTCTC 


CTACAGTCAA 


TGTACGTCCA 


11400 


ACTCCGTTGA 


GGAAGGAAGC 


AATGTCCATC 


TTAGGCTTAC 


CAGCTGGCTG 


CACTTGTTTG 


11460 


AGGGATAGAG 


CCCCTTCAGC 


CGTTGCGACA 


ATCAATTCTT 


TCTTGCCGAT 


AGAGAGAATf* 


11520 


TCACCTGGAT 


TTCCCTGACC 


TTCTACTGGT 


AGGGCTTCAT 


AAATCTTAAA 




11580 


TTAAGGAAAG 


TATGGGCAAC 


AGGCCAGGGG 


TT CATTCC AC 


GAATTTGCtTT 




11D4U 


CGATTGGTTT 


TGTTCCAGTC 


CAGTTTTTCT 


TCCTCTGGCT 


TTATATTTT?n 

X X *i « #» ill 




1 1 *?nn 
11 f uu 


ACCTGACTCG 


TATCCTGCGG 


TTCAGGTTTG 


ATATCACPAfS 


PAATATAHHP 




1 1 Ten 
11 / ou 


TCCAAAAGCA 


AATCACGACC 


AACTAGCGCC 


4 A 4 4 A WU\ 


ACAAGGTGPP 


AiriT'iYi'prp 


i toon 


TCATCTGTGA 


TCGGAATGCT 


GCGACGAGAA 


ATCATATCTC 


CTGCATCCAT 


1 1 \- I. 1 1 iUtl. 


11880 


ATTTCCATGA 


TGGTCACACC 


AGCTTCCTCA 


TCCCCTTGAA 


TCAAGGCATA 


ATGGATARfif' 


11940 


GCACCACCAC 


GGTGTCTAGG 


AAGGAGGGAG 


GCATGAACGT 


TGACAGCAAA 


GTCCATGCTA 


12000 


TCAAGGAGTT TGCTTGGGAG 


AAACTGCCCA 


AAAGCAGCAG 


TCACAATTCC 




12060 


AGCTTCATAA 


GATCTTCCAT 


CTCTGGACTT 


CCAGATAATT 


TTTCAGGTTG 


GTAGATAGAT 


12120 


AGTCCTGCTT 


CCTTGGCAGC 


CTGCTTGACT 


GGGGTTTCTT 


GGATAACTTT 


TTTACGACCA 


12180 


ACAGCACGGT 


CTGGCTGGGT 


CACAACGGCT 


AGAATTTCGT 


AACGGTCATC 


TGTCAAAAGT 


12240 


CCTTTTAAGA 


CTGTTGCTGA 


AAAGTCGGGG 


GTCCCCATAA 


AGATTAGTTT 


TGTCATATCT 


12300 


TCTCCTTCTT 


ATAAAAATTG 


CTGCGGCTCA 


TGGTCAATGC 


TGAGACGGAG 


CTCACTATTT 


12360 


TCCCGTTCTT 


GAGTCAAGGC 


TAAAACCTGG 


TTGAGGGTCG 


ACCCCAGCTC 


ATCTTCTAAA 


12420 


CGGTATTTAA 


TTAAAATCTG 


GTAATGATAG 


AGGTTGTGGG 


TACGGGCAAT 


CGGTTTTGGC 


12480 


GTTGGCCCCA GAATGGGACT 


GGTCTCTGAC 


AAGCCTGACC 


GCAAAATGTT 


CATGACTTCA 


12540 


TAGGCACGTT 


TGAAAACCTC 


TTCTTCTTTC 


TTGTGAGAAA 


GGGTAATACC 


AATCGTGAAA 


12600 


TAGTAAGGTG 


GATAGCCGAG 


TTGTCGTCTG 


ATTCCCATTT 


CATAGGCATA 


AAAGCCTTCG 


12660 


TAATCTTGAT 


CCTTGGCAAA 


TCGAATAGCA 


TAGTGCTGCG 


GATTGTAGGA 


CTGTATCAAG 


12720 


ACTTGACCTG 


CCTTTTCAGC 


CCGACCTGAT 


CGACCTGCCA 


CCTGAGTCAA GAGCTGGAAG 


12780 


GTTCTCTCAG 


AAGAACGGAA 


ATCAGGCAGA 


TTCAAGGCCG 


TATCCGCATT 


TAGAACTCCG 


12840 
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ACTAGGGTAA CATTGGGAAA ATCCAAACCC TTTGCAATCA TCTGAGTACC AAGTAAAATA 12900 

TCCGCTTCCC CTCGCCCAAA CTGGTCAAGC AAGGCTTGGT GACTGCCTTT CTTTCGAGTC 12960 

GTATCCACAT CCATCCTCAG AATGCGAGCT TGGGGAAAGA GTTCTGCTAG CTCATCATAA 13020 

GCCTTCTGAG TTCCCGTCCC ATAGTAACGA ATACTGCGGC TCTTACAGTT AGGACAGACC 13080 
TGAGGAATAT CCTTCGAGAA ACCACAATAA TGGCAGTTCA TAGTCTTGGT ATCCATATGC . 13140 

AAGGTCAGAG AAATATCGCA GTTGGGACAA GTATCCACCG TCCCACACTC CCGACACATG 13200 

ACAAAGCTAG AATAACCACG GCGATTGAGC ATGAGAACCA CCTGCTCTTT TTTAACCAGA 13260 

CGGTCTTGGA TAGCCTCTAG CAAAGGAGGC GTAAAGTTTG ACGTCTCATT TTGTCCGATA 13320 

TAGTCTCGAA AGTCAATCAC TTGAACCTCA GGGATTGTAG CCAAAGGATT GGCACGTTGG 13380 

GTTAGACGTA AGTGTTGATA GACGCCTTTG CCAGCACGTG CCCGGCTCTC TAAGCTCGGC 13440 

GTTGCAGATC CAAGTACCAG AGTTGCTTGA TTATACTGAG CCCGTAAAAT AGCTACCTCT 13500 

CTGGCATGGT AACGGGGATT GCTGTCCTGC TTATAAGCCG CTTCATGCTC TTCATCAATA 13560 

ATCATGACAC CCAGATTTTT CAGAGGAGCA AAGATAGCAG ATCTGGCACC AACAACAACT 13620 

TGGGCATCGC CACGCTCCAC CTTGCGCCAT TCATCATACT TTTCACCATT GGATAATCCT 13680 

GAGTGAAGAA TGGCTACCTT GTCCCCAAAA CGTGCTATAA AACGCTCGGT CATCTGAGGA 13740 

GTCAAGGAAA TCTCAGGTAC CAGCAAAATA GCTGTCTTGC CCTTATCCAG GGCACCTTGG 13800 

ATAATCTGCA AGTAAACCTC GGTCTTCCCA CTTCCTGTAA TCCCTTGAAG TAGAAAGGGA 13860 

GGTTGAGAAC TGCCAATAGA ACTCACAACC GCATCACGCG CCTGTCTTTG TTCTGGATTT 13920 

AACTCCAAAG GTCTACTTGC TTCAATTCCT TCAAAATAAG CAGCCGAGCG TTGAACTTCC 13980 

TTTTGGACTA TGGTAACAGC ACCTTGATCC ACAAAGAAGT TGACTTGCTC TCGCGAGTAG 14040 

GACTCTAACA AGCTAGCCAA GGAAGCGCTC TCTGGATGAG ACAGCAGATA ATCTCTCAGT 14100 

TCCAACTTTT TCTTGGCACG TGTAGAAATC TCAACACCTT CTAATTGAGC ATGGTCAACC 14160 

TCATACCAAG ACTGGGTCTT GACCTTCTTT TGATCGACTC CCTGATATTC CAGACCAAGC 14220 

AGGCCTTTTC TAGTCAAACG CATCATTTCA GCTTGCTTGG CAAGGTCTAG TGAAGAAAAG 14280 

GCTAGCGAAT CTTCTGAACC AAACAGGCGC ACTCGTTCTT CCTGACTCAA GCCTTCCAGA 14340 

GGATAGAGAA TCTTGTCATA GCTAGAATTC AGAAACCCTG GAAGCATGGC CTTGAGGATA 14400 

GAGATTTTGT AGGAGAAGAC AGATTTGCGT AACTCCTCAG CCAGCCAGAG TTGTTCTGGC 14460 

GTGAGAACAG GAGAAAAATC CAGCACCTCT GCAATATCTT TTAAATCTTG CTCCATCTCT 14520 

TCTCCATCTG ATTGGGACTT CAAACCAAGA ACAATCCCTT GAATCAGGCG ATTACCCTTA 14580 
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CCAAAAGGCA 


CATGAACCCG 


CATCCCAACT 


TCCAGCATTC 


CCTCAAATTC 


CTCCGGAATC 


14640 


CTGTAACTAT 


AGGGCTGGTC 


CGTCTGCATC 


AAGGGCACAT 


CTACGATAAT 


CTTAGCTAGG 


14700 


GCCATCTTCT 


CACCTCCTCC 


TTGTCAGTAC 


ATTCTTGCAA 


TAGAAAAAAT 


AAGATTGAGT 


14760 


CCCCCCAACC 


TTAAATTTTT 


TCACCATCTT 


CTTTTTCTTT 


AGCAATTTGC 


TCTTTGATTT 


14820 


TCTTTTCTTC 


TTCTTCTTTG 


CGGCGTTTTT 


CTTCTTCGAT 


ACGGCGACGC 


ACTGCTTCAC 


14880 


GTTTTCCTTC 


TGGATCTGGG 


TGAATTGTAA 


CGTTTCCTGA 


TTCGATTTCT 


TCTAAAGCGC 


14940 


GAAGAGTTGA 


TTTTTCAGAC 


TTGAAACCTT 


GAGTTGCTGG 


GGCACCTGCT 


TCCAATTCGT 


15000 


GGGCACGTTT 


TGCTTCCAAG 


ATTACGAGTG 


AATATTTTGA 


AGGAACCTTG 


TCGAGCAAGG 


15060 


TATCAATAGA 


GGGTTTTAAC 


ATCATTTGCT 


TGTACCTATT 


TTCTAAATTT 


TATCGGGTAG 


15120 


TTGGAGATTT 


TGGTAACATC 


TCCTGATAGT 


GACCAATGAC 


ACGATCCACA 


CAGAAGTGTT 


15180 


CTGCTTCAAT 


CACACATTTG 


ACACGTTCAG 


CAGCTAGGGG 


TACCTGATCG 


TTGACAATCG 


15240 


CATAATCATA 


CTCACGCATG 


AGGGCAATTT 


CTTCCTTGGC 


CTTTTCGATT 


CGTTGGGCAA 


15300 


TCACTTCTGC 


ACTATCTGTT 


CCACGACCTA 


CCAAGCGATC 


TTGCAATTCA 


TCCAAATCTG 


15360 


GTGGTGTCAG 


GAAGATAAAG 


ACAGCATCTG 


GAACCTTTTT 


CTTGACCTGA 


AGAGCACGCT 


15420 


GAACTTCAAT 


TTCAAGGAAA 


ACATCGATTC 


CCTTGTCCAA 


GGTTTCATTG 


ACATAGGTCA 


15480 


GAGGAGTTCC 


ATAGTAGTTA 


CCGACATATT 


CTGCGTATTC 


CAACATCTGT 


CCTTGACGAA 


15540 


TCAGCTCTTC 


AAATTCTTCA 


CGAGTACGGA 


AGAAATAGTC 


AACACOGTCC 


ACTTCTCCAG 


15600 


GACGTTGTGC 


GCGTGTCGTC 


ATCGATACAG 


AATATTGAAA 


TTGGTTTTCA 


GAACTCTCAA 


15660 


AAATCTCTCT 


TCTAACCGTT 


CCTTTTCCAA 


CCCCTGAAGG 


ACCAGAAAAA 


ACGATTAGTA 


15720 


AGCCTCGGTC 


TGCCATTGTG 


TCTCCTTTTA 


GTCAATCTGT 


GAAATAACAT 


TTCTCTAGAA 


15780 


TAATGGCAAA 


AAGCCAGATT 


ATCCTTTACA 


GTCTTTCTAT 


CTAGTGTAAC 


AAAAAAGCAG 


15840 


TAATTTTTCA 


ACTGCTCTTT CTTATTTATT 


T Af2P AT A ATP 




nVA. 1 V_Ljt-o/\>\ 


i conn 


TCACGGTTAC 


CTTGATATTT 


CCTGGATAAT 


CGAGATTGTT 


TTCAATTTTC 


TTACGAACTT 


15960 


TGTGAGCCAA 


GATTGTGACT 


TTGTCGTCCT 


TGATTTTTCC 


TGGATTGACC 


ATGATACGAA 


16020 


TTTCACGTCC 


TGCTTGAAGG 


GCAAAGCTAG 


TTTGCACTCC 


TTCAAAGCCG 


TTAGCAATTT 


16080 


CTTCCAAATC ATGGAGACGC 


TTGATGTAGC 


TTTCAAGAGA 


CTCACTACGA 


GCACCTGGAC 


16140 


GGGCTGCGCT 


CAAGGCATCT 


GCTGCAGCGA 


CGATAACTGC 


TATCACGCTC 


TCAGCTTCAA 


16200 


CATCTCCGTG 


GTGACTAGCA 


ATCGTATTCA 


CCACAACTGG 


GGGTTCCTTG 


TACTTACGGG 


16260 


CCAATTCCAT 


ACCGATTTCA 


ACGTGGCTAC 


CTTCAACCTC 


ATGGTCAATG 


GCTTTCCCGA 


16320 


TATCGTGAAG 


GAATCCAGCA 


CGACGGGCAA 


GAGCCGCATT 


TTCACCAAGT 


TCGCTCGCCA 


16380 
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TGATACCAGC CAACTTAGCA ACCTCAATCG AATGGCGCAA AACATTTTGT CCATATGAAG 16440 

TACGGAACTG CAAACGTCCC ATAATCTTCA TCAAGTCTGG ATGAAGGTTT GGOGCACCAA 16500 

TTTCATAGGC AGCAGCCTCA CCGTATTCAC GAATCTTATT GTCAATCTCT TGACGGTTTT 16560 

TCTCAACCAA CTCTTCGATA CGAGCTGGAT GTATACGACC ATCTTTGAGC AACATTTCCA 16620 

TAGTCATACG GGCAATCTCA CGACGAATCG GATCAAATCC TGACAAGGTC ACCACTTCTG 16680 

GTGTATCGTC GATAATCACA TCGACCCCTG TCAAACTTTC AAAGGTACGA ATGTTACGAC 16740 

CTTCACGACC AATAATGCGT CCCTTCATAG TATCGTCTGG CAGATGAACT GTTGAGTTTG 16800 

TTGACTCCGC TACATATTCA CCAGCGATAC GTTGCATAGC TTGAACCAAG ATGTCCTTGG 16860 

CCATTTTGTC AGAACGTTCC TTGACCTCTT GCTCAGCTTC GCGAATGCGA CTGGCAATCT 16920 

CCCTGGTCAA GTTTTCCTCT GTCTGAGCCA AGATAATATC TCGTGCTTCT GCCTGAGACA 16980 

GCGCACCAAT ACGCTCTAGT TCTGCTTCTT TTTGTCTTTC GACTTCCTCT AATTGCTCTT 17040 

CACGCGCATC AAGGTTTTTC GCTCTATCAG AAATACTTTG TTCTTTTTGT TCAAGTGTTT 17100 

GTTCTTTACT CGTCAAATTG TCGTCCTTAC GGTCAAGGCT AGTAGCTCTC TCTGTCAAAC 17160 

GACTTTCGAT TTGTTTGAGT TCTTGACGTT CTGATTTGAA TTCAGCGTCC ACTTCTTCAC 17220 

GGTATTTTCT GGCTTCTTCT TTGGCCTCCA ATAGTGCTTC TTTTTTAAGA GACTTGCTTT 17280 

CACGTTTGGC TTCATTAACA AGTAAATCCG CTTCACGCTC AGCTTGTCCA CGTAAATTAG 17340 

TTGCTTCTTG TTCAGCATTT AAAAGCATCA ACTCTGCAGC TTCCTGAGAT GATTTCATCT 17400 

TAGCTGAGAT GCTGACATAT CCAATGACTA AACCAATGAT GACGGCAAAA ACAGCAATCG 17460 

CAAGCGACAT GATTTCCATG TTTTTACCTC ATTTTATTGT TATTCCGAAT GACATACATT 17520 

CTTTTACATT CTACCATAAA AAAGTGATTT TCACAAACCT AAAATAGAAT ATGTTTTGAG 17580 

GAATTTGGAA CACATTTACC AAAATAAACT TGTTGTTTAG AAATAGTAGT TTAGTAGAGA 17640 

CTTGAGAAAA AGCCTACCTT TCAATAGACT TAGTAATGAT CTTTAAAGGA CAAGAAAGCC 17700 

ACGCTATCTC CATCCATCAT ATAAATCAAG CGATTTTCTG CATCAATACG CCGTGACCAG 17760 

GCTCCTTGGT AATCATATTT GAGTGGTTCT GGTTTACCTA TTCCTGTAAA GGGATCACGT 17820 
TGAATATCCT TGATTAGTTT ATTGATTCTT TTTAACGTTT TCTTATCCTG ATTTTGCCAG . 17880 

TAGCAATAAT CTGCCCAGGC ATCTTCTGTA AACTTGAGCA GCATTTCTTA CTCCTCAATA 17940 

ACATGGACCT GAGTACTTCC AGCACGAACT TGAGCCATTC CTCGCAAAAC CTTATCAGAA 18000 
AGTTCCTTAT TTTGAGCAAT TCTCAGGGTT TCTTGGATAC TATCCCACTC ACTCTTTGAA . 18060 

AGGACTACAA TGTCCTCATC TGGATTTTTA TTGACCACCG TCAAAGGCTC AAATTCATCA 18120 
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TTTACCTTCT TCATGTAGTC CTTTAAATGA TTTCGGAATG TTGAGTAAAG GACTGCTTCC 18180 

ATAACCATAC CTCGTTTTAG CTCTTTTCCA CTATTATACA CGAAAAGAAA GAAATTGTCA 18240 

GGAACTTGTA CAAGATTTTC TTTTCTATCT ATTTATACTC AATGAAAATC AAAGAGCAAA 18300 

CTAGGAAACT AGCCGCAGGC TGTACTTGAG TACGGCAAGG CGACGTTGAC GCGATTTGAA 18360 

TTTGATTTTC GAAGAGTATT ATTCGTAAAA AATCTCAAAA AGCCTACCTT TCGGTAGACT 18420 

TAGTTTGTTT CTATTC 18436 



(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7001 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

ACGTAGAAAA ACTATTTCTA TCACAGATAA TATTCCGTAT GTTGTTGGAG GTATTGAAAT 60 

AAACGTCCTA GGTATCTTTC TCAGTCTATG TGACTTACAA GGGAAAACTC TTTTCGAGAC 120 

AGAAATTTTG AATGAAGATT ATCCTATTTC AGAAATCAAT TCCACCATTA CCAATATGAT 180 

AAAAACAGCT ATAGAGTACG TCCCTTTGGA AACAAAATTA CTTGGATTTG GCTTATCAAT 240 

ACCTGGACAT TATAACAAAG ACTCCGGAAG TATCATTACA AACAACCCCA TATGGGAATC 300 

TTTTAATTTA TTAAATGTAA TTAAAAGATT CAATTTTCCT TTTATTGTAA AAAATAATAT 360 

CGATTGTATG GCTATAGGAC AATACCTTTT TAATCCACAC AATACCCCCG ATAACTTTAT 420 

TTTCCTACAC GCTGGATTAG GTATTTACAC TTCCTTTTTC ACAAAAGAAA AAATAGGAGC 480 

CTCTAAAAAT CCTTATATCG GAGAAATTGG ACACACCATT GTCGAATTGA ATGGGCAATA 540 

TTGTGAATGC GGAAAAAAAG GTTGTTTACA AACATATATT TCGGATGCTT GGTTAATCAA 600 

ACACGCCCAA TTATTATTTA AAAATTCCCA ACTAACTGTA CTAAAAAGCC TTGTAAAGAC 660 

TGAAAAAGAC ATTCATTTAG ACACCCTTTT AACGGCTTAT AATTTAGGCG ACTCCGCTTT 720 

ACGTCAACAA ATTGATAAAG GAGTCAATTT ATTAGCCACT TCTATTGCAA ATCTCCTCCT 780 

CATCAATCCT GCTGATAAAA TCTATATCAA CAGTCAATTG CTTAATTATC AACCTTTCAC 840 

TCATGAAGTC AGGGATAAAA TCCAAGACCA GCTCCACTTC GTTCCCTTTA CTCGTAATAT 900 

AGAAATTGAA ATTTTACCTT ACAACAAACA TCGTGGAAGT ATAGGAGCTT GTGCATTAGC 960 

TATCGTCGCT TTTTTCATAG AACATAGCAA TGTATTACAA GATATTATTT CACCTTAATA 1020 

TATTAGAAAT CTATAGACCT GTTTAAATCA ACTATAACCT GTAGTAGATA TCTCGTATTT 1080 
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AGACAATATG AAAACAAGAC GACTTCCATA TAGGAAACCG CCTTCTCGCT ATGTTGAGTG 1140 

ATTTATATTA AAATAACTTT TCTTCTAGCT GCATTTTATT ATTATAAAAA CATTCATCAT 1200 

AACCCCCAGA ACTTAAATAA CAATTTTTAT TCAAGATACA TACTCCTAGA ATAAACTTTA 1260 

TATGAAATTC TCATTTTTGT TTTTACAATT CTCCTTAGTT AAATCTTGTT TAATATATGT 1320 

TTTACATATA GTATTTAGCG CCACATAGTA CTGAACTCTC TCCAAAAACG GTTATTCCTC 1380 

TTTGAATAGG GCGTTATCAC AAGAAAAGCA TCTCCACGTT TCAACTTCAT ATGGCTCAAA 1440 

AACAATCAAT TGATGCTAAA ACCTGTACCT AGATGTTTCG GTTCATAAAA CCATGAAACT 1500 

GTAAAAGTGG ATGAAATTGA TAGCGATAGT CAAATCAAGA GGCATCATAA CTCTAAAAAG 1560 

TCACAATATA TAAGTTCATC CTCGGAAAAA TATCATTCTA ATTGTTGAAA TGCCTACATG 1620 

AAAAGAAACG TCAAATGCTC ATGAAACAAC GAATACAGGT ATCAAAACTA TGACAAAACA 1680 

AATCCCTAAA TTTACTAAAG ACACTGCTCA ACTTTACACC TGTAAATGGT TGTTGTATAA 1740 

TAAAGTTACA AAGATGTACG ACCACACTGT TGTAAATCAT AGTGTTCGCG AATATATTAC 1800 

TGATAGCATT TCTACAAATA CAAGTAAAGA GAGCGGATGA GATTCAAACG AAATATGTCA 1860 

GTGCTTTGGC ATTCCTAGCC TTCATATCAT TTAAAGAATT CTATAGACAA AATTTTTTCC 1920 

AATACAGACA CTCGTAACAA CTGCTTCATT TTTCTACCAA CATATTTAGG AACAGGATAA 1980 

GATACAAGAG TATTAATCCA TAGCTCAGTT CTATACCAAT CTAAGACAAA TAAGCTAAAA 2040 

AAACGATTGA TAATAAGCAA ATAGATTCCA AATTTTCTCT ATCTGCTCAT TTTAATAAAC 2100 

AATACTAGTG TAACTATCCT TCCAGTCAGA AGCTTGTCAA ATCACACCGA AAATTCTTCT 2160 

AAAATTTATC TCGTTAGGCA ATCAAGCAAA AACTCGACGA TAGTACAAAC ATTATCATAC 2220 

AGGATTGACT TCCTAAATTA TATACTTTAG TAAGGTTTTC GGATAAGAAA AAAGGTTCAT 2280 

TTTACATTTC TAAACATTCT TTTCTAAGAT GAAAAACAGA ATTTTTCGAT TGTGATTTAA 2340 

AGCAACAAGA AGATTTTCAG TATCATCCTA TAGATACGAG CTAATTAAGA AAAACTACAT 2400 

TTTTGAATAT AAACTACAAT AATATAAACT AAATTTTATA GGAGGAAGAC AATGGATTGG 2460 

TACGATTATA TGATACAGGC ATCCAAACAA TCACAATTCA ACGCAAGCCA TTGGTTTCGC 2520 

TATTTGCGAA AAGTTATTTT TGAAGACTAT TCTTATTTAA CAAACCAAGA TGTAGAAAAG 2580 

TTGCTAGACT CCAAAGAACT AACCCGTTTT CAAAAAATTA GCTTGAAGTA TGCCTTTCAA 2640 

GAGCATACTC CAACTCATAA ATATGTGATT TCATTAAATA AACCTGCTAA GTTAACCAAT 2700 

GTTCAAAAAT TGATGGAGAA ATACAAACAT GGATAAAATG AAACCGGTCT TCCAAGCCCT 2760 

AAATAAGGAA TTAATTCAGG AAAATCTGAC TTTAACAATT ATCTGTGTCG GTGGTTATGT 2820 
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CTTAGAATAT CATGGTTTAC GTGCCACACA AGATGTTGAT GCTTTTATGG CTCTATAATA 2880 

TTTGTAGTGG GTAAATCCCC TATGGATATT ATGGAGCCTA TTTTTGTGTA GAAAAAAAGT 2940 

CCCATATGAC CTATAATGAA AAGCGACAAA ACAACTCATT AGAAAGAATC ATATGGAACA 3000 

ATTACATTTT ATCACAAAAT TACTAGACAT TAAAGACCCT AATATCCAGA TTTTAGACAT 3060 

CATCAATAAG GATACACACA AGGAAATCAT CGCCAAACTG GACTACGACG CCCCATCTTG 3120 

CCCTGAGTGC GGAAACCAAT TGAAGAAATA TGACTTTCAA AAACCGTCTA AGATCCCTTA 3180 

CCTCGAAACA ACTGGTATGC CTTCTAGAAT TCTCCTTAGA AAACGCCGTT TCAAGTGCTA 3240 

TCACTGTTCA AAAATGATGG TCGCTGAAAC TTCTATCGTC AAGAAGAATC ATCAAATTCC 3300 
TCGTATTATC AACCAAAAAA TTGCGCAAAA GTTGATTGAG AAGATTTCTA TGACCGATAT . 3360 

TGCTCATCAG CTGGCCATTT CAACTTCAAC TGTCATTCGC AAGCTCAATG ATTCTCACTT 3420 

TGAGCATGAT TTTTCGCGTC TTCCTGAGAT TATGTCCTGG GACGTTGAAA CAGTCCGGGG 3480 

AGTGACTGTT TCAATCGGGA GATGGAGATG AGCTTTATTG CGCAAGATTT TGAAAAGCTC 3540 

GATATCATCA CTGTTCTTGA AGGTAGAACA CAAGCTGTCA TCCGAGATCA CTTTCTTAAA 3600 

TATGATAGAG CCGTCCGATG TCGCGTCAAA ATTATTACTA TGGATATGTT TAGTCCTTAC 3660 

TATGACTTAG CTAGACAACT TTTCCCGTGT GCTAAAATCG TTCTTGATCG CTTTCACATT 372 0 

GTACAACATC TTAGCCGTGC TATGAGTCGT GTGCGTGTCC AAATCATGAA TCAGTTTCAT 3780 

CGAAAATCCC ATGAATACAA GGCTATCAAG CGCTACTGGA AACTCATTCA ACAGGATAGC 3 840 

CGTAAACTCA GCGATAAACA TTTTTATCGC CCTACTTTTC GTATGCATTT AACCAATAAA 3900 

GAGATTTTAG ACAAGCTTTT GAGCTATTCA CAAGACTTGA AACATCACTA TCAGCTCTAT 3960 

CAACTCTTGC TGTTTCACTT TCAGAATAAG GAACCGGAGA AATTTTTCGA ACTTATCGAG 4020 

GACAATCTTA AGCAGGTTCA TCCTATTTTT CAGACTGTCT TTAAAACCTT CCTCAAAGAT 4080 

AAAGAAAAGG TTATCAACGC CCTTCAACTA CACTATTCTA ATGCCAAACT GGAAGCGACC 4140 

AATAATCTCA TCAAACTTAT CAAGCGCAAT GCCTTTGGTT TTCGAAACTT TGAAAACTTC 4200 

AAAAAACGGA TTTTTATCGC TCTGAATATC AAAAAAGAAA GGACAAAATT TGTCCTTTCT 4260 

CGAGCTTAGC TTTTTTTCAA CCCACTACAG TTGACAAAGA GCCGGAAAAA GGAACAGCCT 4320 

TAGCTTTCCT TTCATTTCTT TTTATTTCCC TCGTAGTAAA CGTGCTAGCT TCCACAAAAC 4380 

AAACAGGATT CCCAGAAATG CCAGTACCAC TAGCCCACGG TACAACCATT GAGAGGTTGC 4440 

AACACGCGAT ACAGATTGTC CTTCTTTCGT AAAAGCAACC CTCGCAACTG CAGCTGTTTG 4500 

TGGATCTGAT TTTTGATAAA CAGCGACTCG TTCAAAATTC ACTAATAAGC GTTTATTAAA 4560 

GGTAGGAATC GGATCGCAGG TTATCAAGGT CATGATATTT TTAGAGCTAA CCGATTCTAA 4620 
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TTTTTCCCAT TCCGACGGTA AAATAATCTC TGTGTCCATC ATCTGATATT CTACAATTTC 4680 

CTGGCCATTA TCATAATAAA GAGCATCTCC AACTTTTAGC TGATCCAAAT GGCGGAAAAA 4740 

GACATGGCTT GGCTCTGCAC GGTGCCCAGC AATCACTGAG CGAATCCCTG TACCATCCAG 48O0 

AGGCAGCGGT GTACCATCCA CATGAGCCAA GCCCATCCCT AAATGATGAT AATCTGCTCC 4860 

CAAATAAACC GGCTCCATGA TTTCCAAACT TGGAATAGAC AAGTAACCAT AGACTGCATC 4920 

AGGGTCGTCA GACACTTGGT AATTGACCTC ATATCCCTCC GCCAAAAAAG GATCTACAAT 4980 

GCGATTTTGC GAAGCCAAGC GTTGATTGTA GGCGAGAGAA TGGTTCTGTT GTTCTTGGTA 5040 

CATTTCAGTT GTCATGGATT TCACAAATGT AGCATGACCT TTCACCTGTC CAAGAGACTG 5100 

CAACACCATC TGTCCAAAAC AATAAATAGG AATCAAACAG GCTACCAACA TCAACAAGTA 5160 

TCCCAATAAG GCTCGTAGTT TAGTCCTTGA CATGACGCCC CTCCAATTGC TTTTCTAGTC 5220 

CTTTGACAAT CCGTCGATTA CGATACACGC GATACAGCAA GAGAAGGATG ACCGCCATCG 5280 

CTCCTAGTAA TAACCACAAC CAGAATTGCC CACGCTCTCT CACCGCTCGA TTCCGCTCTG 5340 

CAATTGGTGC CGTATACGGA ATCCGCTTCC CACGTACCAA CAGACGATGA CTGTTAATCA 5400 

TATACGGTGT ACAAGTCAAC AAGGTCGCAT AATCTTCCCC ATGTTGAATC AAGACAGGCT 5460 

CAAAGTCATT CGGCTCCACC GTCACTATCT GATCCACTTG GTAGGCCAAC ACCTGATCTA 5520 

AAACGTGAAG ATAAAAGATA TCCCCTTTTT TCATCTTATC CAATTGACTG AACAATTCTG 5580 

CCGTTGGCAA TCCTCTGTGA GCAGTGATCA CTGTATGGGT ATTTTCACCT CCAACAGGCA 5640 

GCGAAGCCCC TTCTAACAGC CCTGCCCCTT TCTGAAGAAT GTCCTCACTC GTTCCGACAT 57 00 

ACATCGGAAT TTCCTGATCA ATCGCAGGAA TTTCCACATA GCCAATCCGC TCATGGACCT 5760 

TTAGCATATT GGCATATTCT GAGACGCCTT TCTTTTTCTC TTGCTCTGTA AAAGGATCAA 5820 

GAATTTCAGA TGGTTTCAAG GTCGCATTGA AGGCTTGAGC CAAGCGCCAA CGCTCCTCAA 5880 

GTTCTGCCTT ATCCATCTGG GAAACCGTCT CATCAAACTC TTTAATAACC TCGTTTGACT 5940 

CAATACGATA ATAATAACGA GACACCAATG GATATATCGC AACGGCGAAT CCTACTAAGA 6000 

AAATCAGAAG AAGGATCAGC GGATGTTTCT TCTTTTTTGT GCCTTTTTTT CGTGAACGTC 6060 

TACTGTTGTC CATCCTCCAC CTTCACTTCC TTCCTTGCTG CTTTCAGCGC CTTCAAAGCC 6120 

TTTTCCGGTT GTTTTTTCTT CTTGCGCAAG CGTCGAATAA TCCATAAAAG AATCACAATC 6180 

AAACCAACTG CCACATAAAA CAGGTAGCGA TAGAGATGAC TGAGTTTGTT TGCTGCAATA 6240 

AATTCTTCCT CAACCTCTGC TACGTACGGT ATCCGATGCC CCCGAACCAA TAGACGATGG 6300 

GTATTGATCA TGTATGGCGT ACAAGTCAGC AAGGTCACAT AATCATGACC TGGTACAATC 63 60 
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AATAAATCAT CAAAGTTCGT CGGCTCAATC ACCTTTACTT GATCCACTTG ATAGGCCATC 6420 

ACTTCCTTGA TATTGTGCAC ATAAAACTTA TCCCCAACTT TAAGTTTGGT CAAATCCGTA 6480 

AACATCTTAG CTGTTGGCAA ACCTGTATGT GCCGTAATCA CCGCATGGGT CGAATTGCCT 6540 

CCGATCGGCA GAGAAGTTCC CTCTAGATGC CCAGCCCCTT GCTGCAATAC CTCTTCAGCA 6600 

GTACCAGCAT AAACCGGCAA ATCCACGTCA ATAACGGGGA TTTCCACATG CCCCATCCGC 6660 

TCATGGATTT CTAACATACG TGCATACTCT GCTCGCCCTT TTTTCTTCAT TTCTTCCGAC 6720 

CAAGGATCGC CACTCACTAC . ATT ATTC AAA GAGTCATTGA AGGCTTGTGC CAATTTCATT 6780 

CGTTCATCAA TGTCAGCCTC ATCCAACGTT GCTTTTTCCT TATCAAAGTC AGCAATTTGT 6840 

TGATTTGATT CCACTCGATA ATACAAGCGA GACACCAGCG GATACGCCAT TACCGCCATT 6900 

CCAATGAAAA ATACCACTCC TAATAGGAGA TTATTTCGTT TTTGCTTTTT TGTTTTTACC 6960 

ATTTTTATCA GCATCCCTTT ATCTTCAAAC TTCAGGGTAT C 7001 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10411 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 89: 



GAGGGAGCTT 


AAGAAGTTAC 


CACCGTCCTC 


TAGCGCCTTA 


TCCGCATCAA 


AGTTAAGGTT 


60 


GATATTTTTA 


AAACTGTCGC 


CAGCTTGTGA 


TACGATGCTT 


TGTTTAAGGT 


CATTTAGGGT 


120 


TTTAGTGAAA 


TCTGCATTGC 


TGAGGATATC 


ACTCTTTGAG 


AGATTCAAGG 


CAAAATTGAT 


180 


GATGATATTG 


ATCTGGTTTC 


CTGTTATGAC 


CTGATCAAGT 


TTGTAATTTT 


TTAAGGTATC 


240 


TTCAACAATC 


TTGCGGATAT 


CTTCTTCTGT 


CAGATTTCCC 


TTACTTTCTT 


TAGCTTTGGC 


300 


GAGTCCTGAC 


TTGATATCAG 


CTAGGGCAAC 


GTTTAATTTA 


TTAGCATCAT 


AGCCTGATTT 


360 


GTCCTTGTTT 


TCAGCATTGA 


TATCTGACAA 


AGCTTTTAGC 


TCTTCTTGAG 


CCAAATCTTT 


420 


ATTAGCTTGT 


GGCACCTTGG 


CTCCATTAGC 


CTCTAGCGAA 


TAGTAAATCC 


CTGCTAAAGC 


460 


ACTTTCTCCT 


GTAACTGGAA 


TAGGGGCTGC 


TACAGTGATT 


TTGGCATGTT 


CCATACCCAG 


540 


CGTTACTGCT 


GCGTTTCGGT 


ACATATCCTG 


AGTCACCTTA 


GTGATATTTT 


CTGGTGTTTC 


600 


AATCTTGACC 


TCAAGTGGCG 


ATTTGTCACC 


TAGCTTTTGA 


ATCTTGGCTG 


ATGAATACAA 


660 


CTGTAAGCTA 


GAGTCATTGG 


CCACATTCAT 


GATTTTAGAA 


TAAACATCAG 


GTGTCATGGT 


720 


CTTGAGTTCT 


TTGGTATCTG 


TTGAGGCATT 


GTAGCCCAGT 


TTTTTAAGAG 


TTTGATTTTT 


780 
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TTGGTCTTCA GATAGGGAGG AACCTAGGAC ATATTCAGGT TGGACATAGG TTTCATCGAT 840 

AACTTTTTGA ACATCTGTTG CTGCATGGAC GCTATTCATA GCTGTTACTG CCCACAAGAT 900 

CGCAGCGCTA GTCAGAAAGA GTTTCTTTCT CATAGGGAAT TTCCTCCTTT ACTTCTTTAG 960 

AGTAATATAT CTATCTTAAA GAAAACTTAT AACAAAAACA CCTGGTCTAG CCAGATGTTG 1020 

AAAAGAGAGT GAAACATTTG ATGATGTAAA GGTTAAGTCG TACCTGTCTA GAATAATAAT 1080 

AGTTTCCTCC ATTTACATAG AGTTCAGCAC CGTGAAAAAT GGAAATGGGG TGAATATAAC 1140 

TATAAGTCTT TCCAGTCCTA TTACCAAGCA AGGGGGCAAC AGTCTCACGA GAGTACTGTT 1200 

TGGCTAGAGC CAGGGTATTT TCCTTGCCAT TTTGGGCGAT AAAATCGATA TAGGCAGGTC 1260 

CAAAATTATA GGCTTGAACA GCTGTCCAGA TATCTACCCC CTTCTTCTGC GCCAGATAGA 1320 

GATTGCCTGT CAGAGTTTGA ATGCCTTGCC GAATGCTAGA GGCATTATCA TTGATGGTGT 1380 

TGGTGGAACC ACTTGCAGAC TCACTAGACT GCATAACATC GCCTTCTTTT CCTTTTGTTT 1440 

CAGTATAAAT CATAGCAAGC ACAAGCTCTT CGTTTGCTGG GGTGTCTTGT TCACTCAATA 1500 

TTTCTCGCAC CATGGGTTGA TAGGTCATGA CTTGTTTGAC ATCTTGATGA ACGCGGTAAG 1560 

CTTTATAGCC AGCAAAAAGG AAGACTGCTA GTACAAGCAC TCTTCGAATT CGTTTAAACA 1620 

TTATTTACTT TGGATATCCT CGATATTTTT GATTAAGATA GAGTAGGTTC CATTTTCGTT 1680 

TTGGATAAAC TCAACAGACT CGGCGTCTTG ATAGACGTTA TTGGGAACGA TGAGCTCAAT 1740 

TCCATTTGAT AAGGAGAGTT TTTGGTTTTC AAATTTCTTT AATTGGCGAC TGGCATCAAT 1800 

TTCATCAAAT TGAACAGGTT CTGGTACGGC TTCTTTGACT TGGTCAATAA AGCTCAAACG 1860 

AGCCGTCAGA TTGTTGTCAA AAAGGTCATT AGCCAATTTC TCAGGTGACA ATTCATTGCT 1920 

TTCTTCTAGG TTGTTGAAAA TAGCTGATTT GACCTTGGAT TGAAATTGAA AATCATCTGT 1980 

GTTAAAAGAT TTAGCAATTC TCTGGGCTGT TTTTTCCAGT TCCTTGATAG ATTTTTTAGG 2040 

AGAAATCTTA GGAGCGACAG CAAGAAGATT ATCTGAAAAA TAGTTCAAAA AAGTCCCGTT 2100 

GTACTTGATT CGTTTTTCAA TCAGGTGATA CTTGCTACTC TGAAGATTGA CCACCAAGGC 2160 

CTCATCAGCT CCTGTTCCAA ATCCAGGCAG GTTATTCTGA GTTAGCTTGA TTGGATTATC 2220 

AACTTCTCCT CCGAGGTGGG TCAAGGTCTC CCGCAGGGCA ATTCGCAAGA AAGCGAAATG 2280 

TTCTACACCT TCTTTAGAAA ATTGCACAAA AATCAAGTCA TTGGTCTTGA GATTTTCAGA 2340 

AATGCTAAAC TCCTCTTTCC AGAGATTAGC CAGCGTTACT GATGTCTCCA ACAAATCGTC 2400 

TGTAATATGA TTGAAGAAGG GATTTTCTTC TTCGAAAATC CCAGTCTTGG CTTCATCTGA 2460 

ATACACATGT TCAATTTTTT TACGCAGGTA TTCTTCGATT TTTGGAGTAA TATTGAGAAA 2520 
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CTTATCTGCT 


AAGAACAGTT 


CGGTATCATC CGGACTGAAC TGGTGAATAA 


TGGCTTTCTT 


2580 


AATATAAATG 


TCCATAAAAG TTTTAGTCCT CGTATAATGG GAAGGCATCT GTCAATTCTT 


2640 


TGACTGCACT 


TCTCACTTCT 


TCTAATACAG 


CCTCATTTTC 


TGAATTCTTA 


AGGGTTTTAA 


2700 


TGATGAGTTC 


AGCCACTTTG 


CGACTTTCTT 


CTTCACCAAA 


TCCACGTGCA 


GTAATGGCTG 


2760 


CTGCTCCGAT 


ACGAATCCCA 


CTTGTCTTGA 


ATGGTGACAA 


GCTTTCGTAA 


GGGATTGAGT 


2820 


TTTTATTTAA 


GGTAATATTG 


ACTTCATCCA 


ACAAGTTTTG 


AGCAACTTTG 


CCGTTTTCTA 


2880 


CAACTTTAGT 


CACATCAACA 


AGGAAGAGAT 


GGTTTTCAGT 


TCCACCTGAA 


ATAATACGGA 


2940 


AATCAGGGTC 


TTGCAAGAAG 


ACATCTGCCA 


TAGCCTTGCT 


GTTCTTAATT 


ACATTGGCAG 


3000 


CATATTCCTT 


GAAGGCTGGA 


TCCAAAACTT 


CTTTGAAGGA 


AACTGCCTTA 


GCCGCCACAA 


3060 


CATGCTCTAA 


AGGACCGCCC 


TGAATACCTG 


GGAAAATAGC 


TGAATTGATT 


TTTTTAGCAA 


3120 


GTTCTTCGTC 


ATTGGTCAAA 


ATCAAACCAC 


CACGAGGTCC 


ACGAAGGGTT 


TTGTGGGTCG 


3180 


TTGTTGTTGT 


GATATGAGCG 


TATGGAACTG 


GGCTTGGATG 


AAGGCCAGCC 


GCAACCAAGC 


3240 


CAGCGATATG 


GGCCATGTCC 


ACCATGAGCT 


TCGCACCGAC 


AGCATCTGCG 


ATTTCACGGA 


3300 


ATTTTGAAAA 


ATCGATAATT 


TGAGAATAGG 


CTGAAGCACC 


AGCTACAATC 


AGTTTTGGTT 


3360 


TTACTTCTTG 


GGCTTGTTTC 


AAGATAGCAT 


CAAAGTCTAA 


GAGTTCCGTT 


TTAGGATCAA 


3420 


CACTATAAGA 


AACAAAGTTG 


TAGGTTTGAC 


CAGAGAAGCT 


AACAGGAGCC 


CCATGAGTCA 


3480 


AATGACCACC 


TGATGCCAAA 


TCCATTCCCA 


TAACCGTATC 


ACCTGGCTCA 


ATCAAGGACA 


3540 


TGTAAGCCGC 


ACAGTTAGCT 


TGGCTTCCTG 


AATGTGGTTG 


AACATTGGCA 


AATTTAGCAC 


3600 


CGAAAATTTC 


TTTTGCGCGT 


TCAATAGCAA 


GAGTCTCTAC 


AACGTCTACT 


ACATCAGTTC 


3660 


CACCATAATA 


ACGGCGTCCT 


GGGTAACCCT 


CGGCATATTT 


ATTTGTCAAG 


ATAGACCCTT 


3720 


GAGCTGCCAT 


AACAGCCTTG 


GAAACTACGT 


TTTCCGAAGC 


AATTAACTCG 


ATATTATTTT 


3780 


GTTGGCGTTC 




GCAATAGCAT 


TCCAGAGATC 


AGCATCATAT 


GCTTTAAAAT 


3840 


CATCTTTGTC 


AAAAATCATA 


GGTCTTCTCC 


TTTATTGTGT 


GACTAGTCCA 


TTAGTTTGAT 


3900 


TTTACAATAA 


GAAAATCAAA 


CTAACAGATG 


CGAATAAACC 


GTTTCTGCAT 


TTTATCACAA 


3960 


GTATAGCCAA 


CTTTTTCATA 


AAATGCATGA 


GCACCCAGAC 


GATGATTGGC 


AGAATTTAAG 


4020 


CGGATAAACC 


CATAACCACA 


TCTTTTTGCT 


TCTTCTTCCA 


ACCCTTGTAG 


TAAACTTTTA 


4080 


CCAATACCTT 


GACCTTGCGC 


TTGAGGTGAA 


ACTGCTAAAG 


CTAAGATATT 


AAATCCTGCT 


4140 


TTGGAATAGA 


GTGATTCGTA 


AACTTCAGCG 


TGGACATATC 


CAAGTAAGAC 


ATGATTAGCT 


4200 


GCATCCTCAT 


AGCCAAGTAG 


GAAATGATGG 


GAATCCTGAG 


ACAGTCTAGC 


TAGTTGGCTA 


4260 


GCCGTTTCCT 


CTGGACTAAA AGTATAACCC 


AAAGCCTCTT 


GGTTGATGTC 


ACATATAGCT 


4320 
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TTCACATCAG TTTCTCTTAA ATCTCTTAGC ATCTCATTCC TCCTCAAAAG AAATCTTTGG 4380 

CAACCGAGCA AGAATATCTT CTCGCTTAAT GGCCCCTTGA CGTAAGATTT TCACCTTGTC 4440 

TCCCGACAAA TTCAAAATAG TTGAATCCTG TCCAGTTAGA AAAGCATCGT CTTCCAGACC 4500 

CAGAACCTCT TGGTCAAAAT CCTCTAGAAT TTGATTAAAG GTCACTCCAC TCGCCTGACC 4560 

TGAGATATTG GCAGACGGCC CAATCAAGGG ACCTGTCTCT CGAATCAAAT CAAGGGTAAT 4620 

GGGATGACTA GGCATCCGAA ATCCAACAGT TGCAAGGCCA GAATTGACCC AATAGGGAAC 4680 

TCGGTCATTA GCTTCGAGAA TAATGGTCAA GGGACCTGGT AAAAAGATCT CTACAAGTTT 4740 

TTGAAGATAA GTTGGCTGAT TCTTTGAAAA GTACAAGATG TCCTCTAAAG AGGCAACATT 4800 

GAGATTGAGC GCCTTGTCTC TACGTCGACG TTTAAGCTGG TAAACATGGT CAACTGCTTT 4860 

TTCGTCTAGC GCCTTAGCAA AGAGACCGTA AACTGTCTCT GTAGGCAAAA CGACAGCTCC 4920 

ACCATTTTCC AACTCTTGTC TAATCCTGTC CATCATCAAC GACAACCATC CTATCTTGAC 4980 

CAAATTGGTC CTTGAGTGTT CGTACTCGCT TTTCAGGAAG ATGTTTCCTA AAAAGTTCAG 5040 

GAACACTTTG ACCTTGCTTG TATCCAATTT CAAGGTAAAT CTTACCACCA TCTTTGAGAT 5100 

AGTCTTTTGC ATCTTCCGCA ATTCTACGGT AAATAGCTAG GCCATCCTCA TCTGCAAAGA 5160 

GAGCTAGATG AGGCTCCGAA TACAAGACAT TCAAGCCTAC CTCTGACTCA TCTTCACGAG 5220 

AGATATAGGG TGGATTGGAA ACAATTATAT CATATTTTTC AGAAATTTCT GTAAAACAGT 5280 

CAGATTTTTT TAAAAATATT TGAAGATTTT GATTTTTAGC ATTTTCGCTA GCTACATCTA 5340 

AAGCATCTTG GGAAATATCT GCTGCCGTCA CTGACCAATC TGGTCTGTTT TTTGCTAGAG 5400 

CGAGAGCAAT AGCTCCACTA CCTGTTCCGA TATCTAGGAC CATAAGATTT TTCACAGGAT 5460 

TTTCAGCCAG GATAAGCTCC ACCAACTCCT CTGTTTCTGG ACGAGGAATC AAAACCCGTT 5520 

CATCCACCTT TAAATGCATT CCATAAAAAT CTGCCTGTCC AATGATGTAC TGAGCTGGCT 5580 

TGTGAGCTGC TAGTTGCTGG TAAATATCTT CTACAAATTG TTTTTCTTCC TCTGTTGTCA 5640 

CCTCCTGCTG GAGGGCAAAA ATAAAGTCTG TAAAAGATAG ATTTTTCAGA CTACGATAGA 5700 

CAAAAGAGAG GCTTTCCGCT TCCTCTCCTT GTCTTATCAA CTCTTCTTCA AAATTTGAAA 5760 

ATAATTGAGC TAATTTCATT ATTTGTTTAA TTCTTCTAGT TTTTGTGTTT GGTCATAAAG 5820 

CACCAAGGCA TCCACAACTT CGTCCAATTT ACCAGACAAA ATCGTATCTA GTTTTTGGAG 5880 

GGTCAAGCCG ATACGGTGGT CTGTGACACG GTTTTGTGGG AAGTTATAAG TTCGGATCCG 5940 

TTCTGAACGG TCACCAGTAC CGATTGTCGA CTTACGCTCA GGGTCCTGCT CATCTTGAGC 6000 

AATCTGAGCA AAGTGGTCAG CAACACGGGC ACGGATGATT TTCATGGCCT TCTCACGGTT 6060 
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CTTCTGCTGG GTACGTTCTT CCTGCATCTC AACCTTGATA TTGGTTGGCA AGTGAACGAT 6120 

ACGAACGGCA GTCGCAACCT TATTGACGTT CTGTCCACCA GCACCAGAGG CGTGATAGAT 6180 

GTCGACACGA AGGTCTTTTG GATCAATGTC GTATTCAACC TCTTCAACTT CTGGCATAAC 6240 

AAGAACTGTC GCTGTCGAAG TATGAACACG GCCTTGGCTT TCTGTCACAG GAACACGTTG 6300 

CACACGGTGG GCACCTGATT CATACTTAAG CTTAGAGTAT ACAGACTGAC CTGAAACCAT 6360 

AGCAACCACT TCTTTAAAAC CACCGACACC ATTCATAGAG GCTTCCATGA CTTCAAAGCG 6420 

CCAACCTTGG GCTTCCGCAT ACTTTTGGTA CATAGTTAGC AAATCTCCAG CGAAAAGTGC 6480 

CGCTTCGTCT CCACCAGCTG CTCCACGGAT TTCAAGGATG ATATTCTTGT CATCGTTTGG 6540 

ATCCTTTGGA AGGAGCAAAA TTTTCAGTTT TTCTTCATAT TCTTCTTTTT CAGCCTTGGC 6600 

ATCTTTGAGT TCTTGCTTGG CCAATTCTTC CAAGTCCGCA TCTCCGCCTG ATTCCTTAAT 6660 

CATCTCTTCG GCATCGACGA TATTTTGAAG GACTTGTTTA TACTCACGGT AGGCTATTAC 6720 

GGTGTCACGA TTGGAAGCTT CTTCTTTTGA AAGCTCCATA AAACGCTTGG TGTCTGAAAC 6780 

GACATCAGGG TCACTCAGCA ATTCTCCTAA TTCTTCATAA CGGTCTTCTA CAACTTGTAG 6840 

TTGATCATAG ATGTTCATTT TTTCTCCTTA TTTCTCAATT GTTAAATCAT AGATTGCTAC 6900 

TACTTCATTC TCGGATATTT CCCCAGTTTC TTTAAATCCA TAACTGAGGT AACAAAATCT 6960 

TGCCTGTTCA TTTTCTGGTT CATArGACAA CCAAAGTTTA TTGCTTAAAC CTGCTGGCGC 7020 

TGTTCGAACA TAGTCTAGTA CTTTATCCAT AATTGGTTTA AAATATCCTT GATTTTGAAA 7080 

ATTCTTATCA ATCATAAAAC GAAATAGTAA ATAATTTCCA CTACTAATTC CGATCTTTTT 7140 

ATCATAAGCT ATCATCACAA AACCTATAAT TGCATCATTA TCATAAACTG CCAATGGAGC 7200 

TACAAAATCT CCATTTTTAG TGTAGACGTA TGCTTCAGCT AAACTAATTG CGTTGGTTGC 7260 

AATGAATTGT TTTTGATATT CCTTGACATC CAAATTTAAA ACATCAAAAT AATTTTCCAT 7320 
TGTAACATCT CTTAGTTCAA TTGTCATAGT TTTGCTCCTT GTTAGAGGTT ATCATTGGCG . 7380 

CAAAATAATG TTTACGGCAA ACTGAGATAT AGGTTTCGTT ACCACCAATC TGGATCTG'l'T 7440 

CTCCATCGTA AACGGGCAGT CCATCCTGTG TTCGCAACAC CATGGTCGCC TTTTTCTTGC 7500 

AATACTGACA GATGGTCTTG ATTTCGTCAA TCTTGTCTGC TAAAAGCAAG AGATATTTGG 7560 

AACCTTCGAA CAATTCATTG CGAAAGTCAT TTTTCAAGCC AAAAGCCATG ACGGGTATGT 7620 

CTAACTCGTC CACAACACGA GCTAGGTCGT AAACATGGTG GCGTTTGAGA AACTGGGCTT 7680 

CATCGACCAA AACACAGTAA GGTTTTTCTG GTAGGTCTCG GATATAGCCA AAGATATCCG 7740 

TTGTTTCCTC AATCGCAAgG GCAGGGCGTT TCATGCCAAT TCGACTCGAC ACATAGCCAA 7800 

CGCCGTCACG CGTATCCAGA GCCGAGGTCA TAATCACAAC ACCTTTTCCT TGCTCCTCGT 7860 
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AGTTATAGGC CACTTTGAGA ATCTCAATCG TTTTACCAGA GTTCATGGTC CCATAACGAT 7920 

AGTACAACTG TGCCATGTTT CTTGCTTCAC GTCCATTTCT AAATTTTTGC TACATTCTAG 7980 

TATATCATAA TTTTCTTAAG CTTTAAACGG CAAAATGTGG TAAAATAGAA GAAATCAAAA 8040 

ACTAGTGGAG GAAGCTATTA TGCCATTTGT ACGCATCGAT TTATTTGAAG GACGCACGCT 8100 

CGAGCAAAAG AAAGCTCTTG CTAAGGAAGT AACGGAAGCA GTTGTCCGCA ACACTGGAGC 8160 

CCCTCAATCT GCTGTCCATG TCATCATCAA CGACATGCCA GAAGGAACTT ACTTCCCACA 8220 

AGGGGAAATG CGTACTAAAT AAGCTAGCTT AAGCAGAATT GCTTAGGCTT TTTCAATCTC 8280 

CAAGTAGCAT TCATTGAAGA AATATCCTAA ATTTGTTACA ATTTGAAAAG AAACTTGGAG 8340 

AATTTCCAAG AAAAGAGCTA TTAATTAAAG GAAACATTAT GATTACACGT GAATTTGATA 8400 

CCATCGCTGC TATCTCTACT CCACTAGGTG AAGGGGCTAT TGGTATTGTC CGCCTGAGCG 8460 

gAACAGACAG TTTTGCTATT GCGCAAAAGA TTTTTAAAGG AAAAGACTTG AACAAGGTTG 8520 

CCAGCCACAC TCTCAACTAC GGTCACATTA TTGATCCTCT GACTGGTAAA GTCATGGACG 8580 

AGGTTATGGT TGGGGCTATG AAGTCTCCAA AGACCTTCAC TCGTGAGGAT ATTATCGAGA 8640 

TTAACACCCA CGGTGGGATT GCGGTGACCA ATGAAATTCT CCAGCTAGCT ATTCGTGAAG 8700 

GGGCTCGGTT GGCAGAACCT GGTGAATTTA CCAAACGTGC TTTTTTAAAC GGTCGCGTAG 8760 

ACTTGACACA GGCAGAGGCT GTGATGGATA TCATCCGTGC CAAGACTGAC AAGGCCATGA 8820 

ACATTGCGGT CAAACAATTA GACGGCTCCC TTTCTGACCT CATTAACAAT ACCCGTCAAG 8880 

AAATCCTCAA TACACTTGCC CAAGTTGAGG TCAATATCGA CTATCCTGAG TATGACGATG 8940 

TTGAGGAAGC CACTACTGCT GTTGTCCGAG AGAAGACAAT GGAGTTTGAG CAATTACTAA 9000 

CCAAACTCCT TAGGACAGCA CGTCGTGGTA AAATCCTTCG TGAAGGAATT TCAACGGCTA 9060 

TCATTGGACG TCCCAACGTT GGGAAATCAA GCCTTCTCAA CAACCTCTTG CGTGAGGACA 9120 

AGGCTATCGT AACAGATATC GCTGGGACAA CACGAGATGT CATCGAAGAG TACGTCAACA 9180 

TCAATGGTGT ACCTCTCAAA TTGATTGATA CAGCCGGTAT TCGTGAAACG GATGATATCG 9240 

TTGAACAAAT TGGAGTTGAG CGTTCGAAAA AAGCTCTTAA GGAAGCTGAC CTAGTTCTGC 9300 

TAG TACT AAA CGCTAGTGAA CCACTAACCG CCCAAGATCG CCAACTCCTA GAAATCAGTC 9360 

AGGAGACTAA TCGCATTATT CTTCTTAACA AAACTGACCT GCCTGAAACG ATTGAAACTT 9420 

CGGAACTACC TGAAGATGTC ATCCGCATTT CAGTTCTTAA AAATCAAAAC ATCGATAAAA 9480 

TCGAAGAGAG AATCAACAAC CTCTTCTTTG AAAATGCTGG TTTGGTTGAG CAAGATGCTA 9540 

CCTACTTGTC AAACGCCCGT CACATTTCCT TGATTGAGAA GGCCGTTGAA AGCCTACAAG 9600 
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CTGTTAACCA AGGTCTTGAA CTAGGGATGC CAGTTGACTT GCTTCAAGTT GACTTGACCC 9660 

GTACTTGGGA AATTCTAGGA GAAATCACTG GAGATGCTGC TCCAGATGAA CTCATCACCC 9720 

AACTCTTTAG CCAATTCTGT TTAGGAAAAT AAGAAAAATC CATGATCCTT CATTCGGTCA 9780 

TGGATTTTAG GTTCTATAAT ATTTGTAGTG GGTAAATCCA CTATAGATAT TATGGAGCCT 9840 

ATTTTATTGT AGAAAAAAAG TCCCATATGA CCTATAATGA AAAGCGACAA AACAACTCAT 9900 

TAGAAAGAAT CATATGGAAC AATTACATTT TATCACAAAA TTACTAGACA TTAAAGACCC 9960 

TAATATCCAG ATTTTAGACA TCATCAATAA GGATACACAC AAGGAAATCA TCGCCAAACT 10020 

GGACTACGAC GCCCCATCTT GCCCTGAGTG CGGAAACCAA TTGAAGAAAT ATGACTTTCA 10080 

AAAAACCTTC TAAAATTCCT TATCTTGAAA CGACTGGTAT GCCCACTAGA ATTCTCCTTA 10140 

GAAAGCGTCG ATTCAAGTGC TATCACTGTT CAAAAATGAT GGTCGCTGAA ACTTCTATCG 10200 

TCAAGAAGAA TCACCAAATC CCTCGTATCA TCAACCAAAA GATTGCTCAA AAGTTAATTG 10260 

AAAAGATTTC TATGACTGAT ATTGCCCATC AGCTTTCCAT CTCAACTTCA ACTGTTATTC 10320 

GTAAGCTCAA TGACTTTCAC TTTAAACATG ATTTTTCTTG TCTTCCTGAG ATTATGTCTT 10380 

GGGATGAGTA TGCTTTTACA AAAGGGAAGA T 10411 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2393 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

GTTTTGGGTT CTGGAAATTA TCAGATGGTT GGAAAAGCCG TCCACATCAA GATAGTGTTC 60 

GGAGATTTAA GTTTAAATTG AAGAAACTAA CACAGAGGAA ATGGAGTATA GACCTAACAA 120 

GACGTATTGA GCAACTGAAT TTGTCTATTC GAGGATGGAT AAACTATTGC TCATTGGGAA 180 

ATATGAAAAG TATAGTCGCC AGCATAGATG AGCGCTTGCG TACTCGCCTA CGAGTGATTA 240 

TCTGGAAGCA ATGGAAGAAG AAATCGAGAC GATTATGGGG ATTGCTTAAG TTAGGAGTTC 300 

CTAAATGGAT AGCAGATAAG GTATCTGGCT GGGGOGACCA TTATCAATTA GTAGCTCAGA 360 

AGTCGGTACT TAAACGTGCT ATATCAAAAC CAGTCCTGGA AAAACGTGGA CTGGTTTCGT 420 

GTTTGGATTA TTACCTTGAA CGACATGCGT TAAAAGTTAG TTGAACCGCC GTATGCCAAA 480 

CGGCACGTAC GGTGGTGTGA GAGGGGCTAG AGATTATCCC CTACTCGATT AACTCCCCTG 540 

AAATTTATTT TAATTATGCA AATTTCACGT ATTTTTGATG CTGAGACGAC GATCCTGGGA 600 
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ACTTTTCAGA TATTTTTTTG ACTATCTAAA TCTATCATTA GAAAAGCTTA GAGCGCCAAA 660 

GGATTTGAGC GTTTTTCTGA TTTTTAAGAC TTTTTCCAGT CTCTTTTTCG ATTGAAGATG 720 

TAATTATTCT ACTAACTAAC TAACTTCTTA GTACTAGCCA ACAACGATAA TCATAATTCC 780 

TCCTAAAATT AGGAATAATA AAGGCAATAG TTTTTGTTTT TTCATGTAAA AAACCTCACT 840 

TTTGTTTTCT GCTATTTTAT GCTAAAATAT TAAAAATCAA ATTTAATTCC AAAGTTTGTA 900 

ACTAAAGGGG GAGCGCTACA TGTCTAATTC ATTTGTCAAG TTGTTAGTCT CTCAATTATT 960 

TGCAAATTTA GCAGATATTT TCTTTAGAGT AACAATCATT GCTAACATAT ACATTATTTC 1020 

AAAATCAGTA ATTGCCACAT CACTAGTTCC TATCTTAATA GGAATATCCT CTTTTGTTGC 1080 

GAGTCTTTTA GTTCCGTTGG TTACTAAAAG GTTAGCGCTA AATAGGGTTT TATCTTTATC 1140 

TCAATTTGGA AAGACTATAT TATTGGCGAT ACTGGTAGGA ATGTTTACCG TAATGCAATC 1200 

CGTAGCGCCT TTGGTGACCT ATCTATTTGT TGTTGCAATT TCCATACTAG ATGGTTTTGC 1260 

AGCACCCGTT TCCTATGCTA TTGTGCCACG CTATGCGACC GATTTGGGTA AGGCTAATTC 1320 

AGCCTTATCA ATGACTGGTG AAGCTGTTCA ATTGATAGGT TGGGGATTAG GTGGACTCTT 1380 

GTTTGCAACA ATTGGTCTGT TACCTACCAC GTGTATCAAT TTAGTCTTGT ATATCATTTC 1440 

TAGCTTTCTG ATGTTATTTC TTCCTAACGC TGAAGTGGAG GTGTTAGAGT CAGAAACTAA 1500 

TCTTGAAATT TTGCTCAAAG GTTGGAAGTT AGTTGCTAGA AATCCTAGAT TAAGACTTTT 1560 

TGTATCAGCA AATTTATTGG AAATTTTTTC AAATACGATT TGGGTTTCTT CC ATT AT ACT 1620 

TGTTTTTGTA ACGGAGTTAT TAAATAAAAC GGAAAGTTAC TGGGGATATT CTAATACAGC 1680 

ATACTCTATT GGTATTATAA TTAGTGGCTT AATTGCTTTT AGGCTATCTG AAAAGTTCCT 1740 

TGCTGCTAAA TGGGAACCCC AATTATTCAC CCCAAATCTA AAAACCATCC AGAATCCTTG 1800 

CCTTAGCTTA GATCCTGGAT GGTTTCTTTT TTCACCCAAT GGGTGTTTTT TACTAGACAA 1860 

AAAAGAGTTT CCCCTTTATG GTATAAGTGT AGAAAAAAAC ACAAAAAGAA AGGAAACTCA 1920 

CATGAACAGT TTACCAAATC ATCACTTCCA AAACAAGTCT TTTTACCAAC TATCTTTCGA 1980 

TGGAGGTCAT TTAACCCAGT ATGGTGGTCT TATCTTTTTT CAGGAACTTT TTTCCCAGTT 2040 

GAAACTAAAA GAGCGGATTT CTAAGTATTT AGTAACGAAT GACCAACGCC GCTACTGTCG 2100 

TTATTCGGAT TCAGATATCC TTGTCCAGTT CCTCTTTCAA CTGTTAACAG GTTATGGAAC 2160 

GGACTATGCT TGTAAAGAAT TGTCAGCTGA TGCCTACTTT CCAAAATTGT TGGAAGGAGG 2220 

GCAGCTTGCT TCACAGCCAA CCTTATCCCG TTTTCTTTCC AGAACTGACG AGGAAACAGT 2280 

CCATAGTTTG CGATGCCTCA ACCTTGAATT GGTCGAATTC TTTTTACAGT TTCACCAGCT 2340 
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AAACCAACTC ATTGTAGATA ACGATTCTAC CCATTTCACA ACTTATGGCA AGC 2393 

(2) INFORMAT I ON FOR SEQ ID NO: 91: 

<i) SEQUENCE CHARACTERISTICS: 

. <A) LENGTH: 4762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 



TTTGTATCTT 


TTTAGGTCTC 


TTTCAATCCA 


AACCCTTTAA 


ACTATACGTC 




60 


CTGCAAGTCT 


TGTGGTAATT 


TTAGGTTTGA 


TTTTACTTTT 


CTTTTCACAA 






ACGCTTCTTA 


TTTGATGGTC 


GTCTTCCCTG 


TTTTCCTACT 


TTTATTGGTA 


apcaatatta 


180 


AGAGTCAACA 


GAGGGGGCGT 


AGTGCTAGAA 


GAAGCCGAAG 


AGAAACGCCA 


TTATGCCTAT 


240 


GGAGTCGTTT 


CTTCAAAGGA 


AATCTATATC 


TGCTAGTTTT 


TGGGTTTGTC 


TATCTTTTGT 


300 


CTGTTCCTTT 


TTTGATGAAG 


TTTGTCCTTT 


ATCCAGTACC 


TTATCAAGAA 


CGTAATCGTC 


3 60 


TTGCTGATTT 


GGTAAAAGAG 


GAGACAAATA 


CGGAAGATGC 


TATCTCATGC 


ATGGGATGAT 


420 


ACTGCGACTC 


TTTATCGTAA 


GAGTGAGCGC 


TTGTCCCATC 


GGCGATTTTG 


TCCCCGTTGC 


480 


ACTATACAGC 


AACTGAGGAA 


AATCGTAATA 


AGTTACTTAA 


TGACTTGAAA 


GAAAAACAAC 


540 


CTAAGGTGAT 


TGTGGTAAAT 


GATAAGGTGG 


TAGTCTGGTC 


TGAAGTGGAA 


ACACTCTTAA 


600 


AAGAAAATTA 


CCAACAAGTA 


AAGACTGATT 


ACTCAGAGTT 


TAAAGTCTAT 


AAAATTAAAT 


660 


AACCAAATCA 


ATATCTTGTG 


TATTTTTAAA 


AATTTTAGGA 


TTTTTAACAC 


AAGATATTGA 


720 


TTTTTCTTTT 


TAGAGTGGTA 


TAATACTTTT 


TAGAAAGAAC 


ATTTTAGAAA 


AGAGCATGCA 


780 


TATGATTGCA 


CTAGAAGAAA 


AAATTACAAT 


TTTGCCAACT 


CTCTTCGTCG 


AGAAACGAGA 


840 


TGGGAGACGT 


GTTGTATTTG 


ATGTGGACAA 


GATTGACAAG 


GCTCTCCACA 


AGGCGGCTGA 


900 


CAAGGTTATG 


GATGTGACAC 


CCCTGGTTGA 


AAAATGCCTC 


AATGATCTGA 


CTGAGCGAAT 


960 


TATTACAGAA 


ATTCATAGTC 


GCTTTCCACA 


GGGAATTAAG 


ATTTACGAAA 


TTCAAAATAT 


1020 


CGTAGAACAT 


GAACTCCTTG 


AAGCCAAAGA 


ATATGCGCTG 


GCTGAGGAGT 


ATATTACTTA 


1080 


TCGGACACAG 


AGGGATTTTG 


AGCGCTCAAA 


AGCGACGGAT 


ATCAACTTTA GTATTCATAA 


1140 


ACTTCTCAAC 


AAAGACCAGA 


CAGTTGTCAA 


TGAAAACGCT 


AATAAAGACA 


GTGATGTCTT 


1200 


TAACACTCAG 


CGTGATTTGA 


CAGCAGGGAT 


TGTTGGGAAA 


TCAATCGGAC 


TGCAAATGCT 


1260 


TCCTAAGCAC 


GTAGCCAATG 


CCCACCAAAA 


GGGGGATATC 


CACTATCACG 


ATTTGGACTA 


1320 


CAGTCCCTAT 


ACCCCTATGA 


CCAACTGCTG 


TTTGATTGAT 


TTTAAGGGTA TGTTGGAAAA 


1380 
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TGGTTTTAAG 


ATTGGAAATG 


CAGAGGTAGA 


GAGTCCCAAG 


TCTATCCAGA 


CTGCGACAGC 


1440 


ACAGATTTCT 


CAAATCATTG 


CCAACGTTGC 


TTCTAGCCAG 


TACGGTGGCT 


GTTCAGCTGA 


1500 


CCGTATCGAT 


GAAATTTTGG 


CGCCTTATGC 


AGAGAAGAAT 


TATCAAAAAC 


ATCTCAAAGA 


1560 


TGCAGAAGAG 


TGGGTATTGC 


CTGAAAAACA 


GGAAGATTAC 


GCTTGGAAGA 


AAGCGCAAAA 


1620 


GGACATCTAC 


GATGCCATGC 


AATCTCTTGA 


GTATGAAATC 


AATACTCTCT 


TCACTTCAAA 


1680 


TGGACAAACA 


CCTTTTACTT 


CGTTAGGTTT 


TGGTCTGGGA 


ACCAGTCGTT 


TTGAACGAGA 


1740 


AATTCAAAAA GCTATTTTAA ACATTCGCAT CAAGGGTCTT GGTTCAGAAC ACCGTACGGC 


1800 


TATCTTTCCT 


AAACTTATCT 


TTACGCTTAA 


AAGAGGCCTC 


AACTTAGAGG 


AAGGAACTCC 


1860 


CAACTATGAC 


ATCAAGCAGT 


TGGCTCTAGA 


GTGTGCAACC 


AAGCGGATGT 


ATCCAGACGT 


1920 


CTTGTCTTAT 


GATAAGATTG 


TTGATTTGAC 


AGGTTCTTTC 


AAGGTGCCTA 


TGGGCTGCCG 


1980 


TTCTTTCCTT 


CAAGGGTGGA 


AGGATGAAAA 


TGGTGTAGAA 


GTCAATTCAG 


GTCGCATGAA 


2040 


TCTGGGTGTT 


GTGACGGTTA 


ATCTGCCTCG 


TATTGCTCTT 


GAGTCTGAAG 


GTGATATGAA 


2100 


TAAGTTCTGG 


GAAATCTTCA 


ACGAGCGAAT 


GAATATCGCA 


GAAGATGCTC 


TTGTTTACCG 


2160 


TGTCGAACGC 


ACTAAAGAGG 


CGACACCAGC 


GAATGCTCCT 


ATTCTTTATC 


AGTACGGTGC 


2220 


TTTTGGCCAT 


CGTCTAGGTA 


AAGAAGAAAG 


TGTTGACCAG 


CTCTTTAAGA 


ATCGTCGTGC 


2280 


GACCGTTTCG 


CTGGGCTATA 


TCGGCTTGTA 


TGAAGTAGCG 


ACAGTTTTCT 


TTGGTAACAG 


2340 


CTGGGAAAGT 


AATCCAGATG 


CTAAGGAATT 


CACGCTAGAC 


ATCATTCACG 


ATATGAAACG 


2400 


CCGTGTAGAA 


GAGTGGTCAG 


ACCAATATGG 


CTACCATTTC 


TCTATCTACT 


CAACACCATC 


2460 


CGAAAGTCTG 


ACAGACCGTT 


TCTGCCGACT 


AGATATAGAC 


AAGTTTGGCT 


CTATTCCTGA 


2520 


TATCACAGAC 


AAGGAATACT 


ACACCAACTC 


TTTCCACTAC 


GATGTTCGTA 


AAAATCCAAC 


2580 


ACCGTTTGAA AAATTGGACT 


TTGAGAAAGT 


CTATCCGGAA 


GCAGGTGCGT 


CAGGTGGTTT 


2640 


CATCCATTAT 


TGTGAGTATC 


CAGTCCTTCA 


GCAAAATCCA 


AAGGCCTTGG 


AAGCTGTCTG 


2700 


GGATTATGCT 


TATGACCGTG 


TAGGCTATCT 


AGGCACCAAT 


ACTCCGATTG 


ACCGTTGCTA 


2760 


CAAGTGTGAC 


TTTGAAGGGG 


ATTTTGAACC 


AACTGAGAGA 


GGGTTTGCTT 


GTCCAAACTG 


2820 


TGGCAATAGC 


GACCCTAAAA 


CAGTAGATGT 


GGTGAAACGA 


ACTTGTGGCT 


ACCTAGGTAA 


2880 


TCCTCAAGCA AGACCGATGG 


TCAACGGGCG 


TCACAAGGAA 


ATCGCTGCGC 


GTGTCAAACA 


2940 


TATGAATGGT 


TCAACGATTA 


AAATAGCTGG 


GCATCAAGTA 


ACAAATTAGA AAGAAATGAA 


3000 


ATGGGAAAAT ATCAACTAGA CGATAAGGGG CGCGCACAAG TGACCCGTTA TCACGAGAAA 


3060 


CACTCTAAAG GTGGAGCTGG TAAGAAAGAA CGCTTGCTTA GCTTCAGAGA ACAATTTTTA 


3120 
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AACAAGAACA AGAAAAAATA AAAGTGAGAG CCAGCTCTCG CTTTTCTCAT AGTGGGAGGT 3180 

AAGGATGGAA TTACGCAGAC CAAGATTAGC GGATAAGAAA GCTGTTTTAG ATATGATGAC 3240 

AGAGTTTGAA AAATTTCAGT CGCCTCACGA CGGCGGTTTC TGGGATACAG AGAACTTTGT 3300 

GTATGAAGAC TGGTTAGAAA GCAATCAGGA ACAGGAAATG GGGATTAATC TGCCTGAAGG 3360 

ATGGGTTTCT GCAATTCAGT TAGTGGCTTT TTCTGAGAAA GGTCAAGCAG TTGGATTTCT 3420 

TAATCTCCGG TTGCGCCTCA GTAACTTTCT ACTAGAAGAA GGTGGCCACA TTGGCTACTC 3480 

CATTCGTCCA TCTGAAAGAG GCAAGGGTTA TGCAAAAGAG ACTCTCCGTC AGGGCTTGCA 3540 

AGTTGCTAAG GAAAAGAACA TCAAGAAAGC TCTGGTGACC TGTAGTGTGA ATAATCCTGC 3600 

TAGCAGAGCA GTCATTCTAG CAAATGGTGG AATATTTGAG GATGCTCGCA ATGGAGTCGA 3660 

GCGTTATTGG ATAGAGGTAG CGAATGAATA ATCCAAAACC ACAAGAATGG AAAAGCGAGG 3720 

AACTTAGTCA AGGTCGTATC ATTGACTACA AGGCCTTTAA CTTTGTGGAC GGCGAAGGCG 3780 

TGCGCAACTC TCTCTATGTA TCAGGCTGTA TGTTTCACTG CGAGGGATGT TATAATGTTG 3840 

CGACTTGGTC TTTTAATGCT GGCATTCCCT ATACAGCAGA ATTAGAAGAG CAGATTATGG 3900 

CAGACCTTGC CCAACCCTAT GTTCAAGGCT TGACTTTGCT GGGAGGGGAG CCTTTTCTCA 3960 

ATACTGGGAT TCTCTTGCCA CTTGTTAAGC GGATTCGGAA GGAATTGCCA GACAAGGACA 4020 

TCTGGTCCTG GACCGGCTAC ACTTGGGAAG AAATGATGTT GGAAACTCCA GATAAACTGG 4080 

AATTCTTGTC ACTGATTGAC ATTCTTGTCG ATGGAAGATA TGATCGAACT AAGAGAAATC 4140 

TTATGCTCCA GTTTCGAGGT TCATCTAACC AACGAATTAT CGATGTGCAA AAATCGCTCA 4200 

AAAGTGGGCA AGTAGTGATT TGGGACAAGC TCAATGACGG AAAAGAAAGC TATGAACAGG 4260 

TGAAGAGAGA ATGAAGAAAA AGGACTTAGT AGACCAACTA GTCTCAGAGA TCGAGACGGG 4320 

GAAAGTCAGG ACACTGGGAA TATACGGTCA TGGAGCTTCA GGTAAATCAA CCTTTGCACA 4380 

GGAATTGTAC CAAGCTTTAG ATTCTACTAC AGTAAATTTG CTAGAGACAG ATCCTTATAT 4440 

CACCTCAGGA CGCCATCTGG TAGTACCCAA GGACGCGCCG AATCAAAAGG TGACAGCCAG 4500 

TCTGCCAGTG GCGCATGAAC TGGAGAGTTT GCAGAGAGAT ATCCTTGCTT GCAGGCGGGT 4560 

ATGGATGTCT TGACAATTGA AGAACCTTGG AAGGCTAGTG AGGTCTTGTC TGGAGCCAAA 4620 

CCAATTTTGA TTGTCGAAGG GATGTCTGTT GGCTTTCTAC CCAAGGAACT CTTTGAAAAA 4680 

ACCATCTGTT TCTACACGGA TGAGGAGACC GAATTAAAGC GACGCCTTGC TAGAGATACG 4740 

ACTGTGAGAA ATCGCGATGC GG 4762 
(2) INFORMATION FOR SEQ ID NO: 92: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 3832 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

GATGCAGGTT TCGACCCACA TATTCCAGAA AATTACTTTA AAGATGATGA TGTTAATCAG 60 

GTACCTTGTC TTTGTTGGTC TTCATCTGCA GCCCTCTTTT TCAGTAATTG GGTAGACCAT 120 

GCGGTCTATC AGGAGACGCC TTTTGATTGG AGAAAGATAG AAGATGATGC ATCTGCATAT 180 

GGGTATTTAT AAGAGGAATT ATGACATATT TAGACGCTTT TAAATCAGGT ACCTTGGTTT 240 

TACCGAGTGC CCTGCTCTTG CATTTTAAGG AACTCTTTCC TTCTAGCGAC GATTTTCTGG 300 

TTTGGCAATT TTTCTATTTG CAAAATACGA CAGGCTTAGA AGAAATGTCG CCAAGCCAGA 360 

TTGCTGAAAG GATTGGCAAG GAAATTTCGG ATGTCAACCA GTCCATTTCT AATCTGACGG 420 

AAAGGGGACT GCTCCAGTAT CGTACTATCG AATTAAATGG CGAAATTGAA TTGCTCTTTG 480 

ATGCTAGTTT GGCCTTGGAA CGTTTGGATG ACCTGTTTGG AGCAGTTCAT TCAAGTTCAG 540 

ACCAGCTAAC ACCTCAAAAC CAGCTCAAGG ATTTGGTGGA AACCTTCCAG CAGGAGTTGG 600 

GACGATTGTT GACGCCTTTT GAGATTGAGG ATTTGACCAA GACACTAAAG GAAGATGGAA 660 

CCAGTGCTGA CTTGATTAAG GAGGCTCTTC GTGAAGCTGT TTTGAATGGA AAACCAAACT 720 

GGAAGTACAT TCAGGCGATT TTGAGAAACT GGCGCCATGA AGGAATCAAG AGTGTGGCTC 780 

AAATTGAGGC CAAGAGAGCA GAAAGAGAAG CAAGCAATCC TCAGTTGACA CAGGTATCTG 840 

CAGATTTCAT AAATGCCATG GATCTCTGGA AGGATTAATC CATGCAAGTA GGCTTGAAAT 900 

CCGAGTAAGA TTTGCAAGCT GTGTATAATT GTGATAGAAT AAATAGAAAA TAAATTGAAA 960 

AAAGAGGTAT GTGAAATGTC ACGTAAACCA TTTATCGCTG GTAACTGGAA AATGAACAAA 1020 

AATCCAGAAG AAGCTAAAGC ATTCGTTGAA GCAGTTGCAT CAAAACTTCC TTCATCAGAT 1080 

CTTGTTGAAG CAGGTATCGC TGCTCCAGCT CTTGATTTGA CAACTGTTCT TGCTGTTGCA 1140 

AAAGGCTCAA ACCTTAAAGT TGCTGCTCAA AACTGCTACT TTGAAAATGC AGGTGCTTTC 1200 

ACTGGTGAAA CTAGCCCACA AGTTTTGAAA GAAATCGGTA CTGACTACGT TGTTATCGGT 1260 

CACTCAGAAC GCCGTGACTA CTTCCATGAA ACTGATGAAG ATATCAACAA AAAAGCAAAA 1320 

GCAATCTTTG CGAACGGTAT GCTTCCAATC ATCTGTTGTG GTGAATCACT TGAAACTTAC 1380 

GAAGCTGGTA AAGCTGCTGA ATTCGTAGGT GCTCAAGTAT CTGCTGCATT GGCTGGATTG 1440 

ACTGCTGAAC AAGTTGCTGC CTCAGTTATC GCTTATGAGC CAATCTGGGC TATCGGTACT 1500 
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GGTAAATCAG CTTCACAAGA CGATGCACAA AAAATGTGTA AAGTTGTTCG TGACGTTGTA 1560 

GCTGCTGACT TTGGTCAAGA AGTCGCAGAC AAAGTTCGTG TTCAATACGG TGGTTCTGTT 1620 

AAACCTGAAA ATGTTGCTTC ATACATGGCT TGCCCAGACG TTGACGGTGC CCTTGTAGGT 1680 

GGTGCGTCAC TTGAAGCTGA AAGCTTCTTG GCTTTGCTTG ACTTTGTAAA ATAATCAGTA 1740 

AGTAGCAAAA GCTAGGTGGA ACAGCATTCA GATGTCTGTT ACATTTTTTA TAGGAGAGAA 1800 

AGATTGAAAA CAAAAATTGG ATTAGCAAGT ATCTGTTTAC TAGGCTTGGC AACTAGTCAT 1860 

GTCGCTGCAA ATGAAACTGA AGTAGCAAAA ACTTCGCAGG ATACAACGAC AGCTTCAAGT 1920 

AGTTCAGAGC AAAATCAGTC TTCTAATAAA ACGCAAACGA GCGCAGAAGT ACAGACTAAT I960 

GCTGCTGCCC ACTGGGATGG GGATTATTAT GTAAAGGATG ATGGTTCTAA AGCTCAAAGT 2040 

GAATGGATTT TTGACAACTA CTATAAGGCT TGGTTTTATA TTAATTCAGA TGGTCGTTAC 2100 

TCGCAGAATG AATGGCATGG AAATTACTAC CTGAAATCAG GTGGATATAT GGCCCAAAAC 2160 

GAGTGGATCT ATGACAGTAA TTACAAGAGT TGGTTTTATC TCAAGTCAGA TGGGGCTTAT 2220 

GCTCATCAAG AATGGCAATT GATTGGAAAT AAGTGGTACT ACTTCAAGAA GTGGGGTTAC 2280 

ATGGCTAAAA GCCAATGGCA AGGAAGTTAT TTCTTGAATG GTCAAGGAGC TATGATGCAA 2340 

AATGAATGGC TCTATGATCC AGCCTATTCT GCTTATTTTT ATCTAAAATC CGATGGAACT 2400 

TATGCTAACC AAGAGTGGCA AAAAGTGGGC GGCAAATGGT ACTATTTCAA GAAGTGGGGC 2460 

TATATGGCTC GGAATGAGTG ' GCAAGGCAAC TACTATTTGA CTGGAAGTGG TGCCATGGCG 2520 

ACTGACGAAG TGATTATGGA TGGTACTCGC TATATCTTTG CGGCCTCTGG TGAGCTCAAA 2580 

GAAAAAAAAG ATTTGAATGT CGGCTGGGTT CACAGAGATG GTAAGCGCTA TTTCTTTAAT 2640 

AATAGAGAAG AACAAGTGGG AACCGAACAT GCTAAGAAAG TCATTGATAT TAGTGAGCAC 2700 

AATGGTCGTA TCAATGATTG GAAAAAGGTT ATTGATGAGA ACGAAGTGGA TGGTGTCATT 2760 

GTTCGTCTAG GTTATAGCGG TAAAGAAGAC AAGGAATTGG CGCATAACAT TAAGGAGTTA 2820 

AACCGTCTGG GAATTCCTTA TGGTGTCTAT CTCTATACCT ATGCTGAAAA TGAGACCGAT 2880 

GCTGAGAGTG ACGCTAAACA GACCATTGAA CTTATAAAGA AATACAATAT GAACCTGTCT 2940 

TACCCTATCT ATTATGATGT TGAGAATTGG GAATATGTAA ATAAGAGCAA GAGAGCTCCA 3000 

AGTGATACAG GCACTTGGGT TAAAATCATC AACAAGTACA TGGACACGAT GAAGCAGGCG 3060 

GGTTATCAAA ATGTGTATGT CTATAGCTAT CGTAGTTTAT TACAGACGCG TTTAAAACAC 3120 

CCAGATATTT TAAAACATGT AAACTGGGTA GCGGCCTATA CGAATGCTTT AGAATGGGAA 3180 

AACCCTCATT ATTCAGGAAA AAAAGGTTGG CAATATACCT CTTCTGAATA CATGAAAGGA 3240 

ATCCAAGGGC GCGTAGATGT CAGCGTTTGG TATTAAGCGA TGATTTGAAA GAGGGATGTG 3300 
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ATAGTAGCAC CCTCTTTTTC TTTGTTTTAT GATAGTTCAT OCTCGAGTAA ATTCAAGTTC 3360 

TTGCTCGGAA ATGAAGCTTA TATAGTAGAT TGAATATAGA CAAATACCTT GTGATTGGTA 3420 

AAACATTTTA GAAATTCATT TACCTTTCCT AATCGACTTG GTTTCATCTT ATTTCAATCT 3480 

ATTATAGTAT TGGGGAATTT CTTCAAACCA CATCAGCTTG GTCAGTTCTA CCTGCGACCT 3540 

CAAAACTTGT GCTTTGGTCA AGCTGGGTTT AGTTTCCTAG TTTGCTGATG GATTTCCATT 3600 

GACTATAAGC ATCCAACCCT CTTTTTGTCT TCTAAAGAAT TCTTAAATTA TCAGTCTATT 3660 

GCAACTTTTC TCATATAAGT TCTTTGTCTT GCTATTGGTT TTCCTTAGTA GTATACTAAG 3720 

GTAGTAATCA TTAAGAAGTG GTTACAAAAA ATAATGAATG AGGTAAAGAA AATGGTAGAA 3780 

TTGAAAAAAG AAGCAGTAAA AGACGTAACA TCATTGACAA AAGCAGCGCC GG 3832 



(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10690 base pairs 

(B) TYPE: nucleic acid 

v (C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

CTTGTTTCCC TCACCTTCCT 60 

AGAAACCTTT GTAATTTCGA 120 

GAAGGTCCCT ACTTGGGGAA 180 

TGTTGCTGTC TTTTCTCCTT 240 

AGCCCAGGCA TCGATTTCAT 300 

TTCTTGCGGT ATCATGGGTG 360 

GATGATCTGG ATGGTCAATC 420 

TGCACTTGAT ACCCAACTGA 480 

CAGCAAAATC AATCTTGTTC 540 

TTATTGGACT TTCCATATCA 600 

TCAAATCATT CATTCCAACT 660 

CAAAATACGC TGACGGTAAT 720 

GCAATACTGT AATAGCTTGT 780 

ATAGATTGGG GTTGGTTTGA 840 



TGAAAAAATC CTCATGAACC 
TATAGGCATG GTCAGCTGAC 
CAATGCAGAC AGCCTGATTT 
GTGGTTCGTC TTCTAGCACA 
TAAACACCAA ATCCGCTAAA 
CTCCGATCAA AGGATTGATT 
CTCCTTTGTA ATTTTTTACT 
TCTAAATCAA AAATCTCTGG 
TTCATTTTTT GTATGAAAGA 
TTTCTTGCTT TATCCTGCAT 
TGCCATTGAC TGTTTCTCAT 
ACAATCTTTG AAATGCCCGT 
TCAATCATCG TTCCGACTTT 
TTTAATTGGT CGGCATCATT 



TGGCGCCAAT AGACAAGTGT 
ACTCGATTGA AGGGTTTAAC 
TGACTATCTA AAATGACATC 
TAGAGGTCAT AGGCTGATGC 
AGGTCTGGTT. CAACTCCAAA 
TGCTTGTATT TATTCCACAT 
TTCTTCTTTT ATGTGTTTAA 
AATAGAACTG TAGTGGATAA 
AGTATTCAGA TAGCCTGCTA 
ATCTCTTAGC ATATCTAACA 
AGTCGCAAAA ACAAAGGAAG 
TTCCAGTATA CTAGATAAGT 
CCCAGTAAAA CCCTGCTGAC 
GACAAAAGGA AAGATAACAG 
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TAAACTTCTG TAACGACATG TGCTTCAGCC TGAAATTCAT CCAAACACGC CAGTAAACGC 900 

CTAGTTCCTC TATAGCCAAA CAAGGGATGC CCTTCGTCAA AAAACTCTTT AGTCCCCACT 960 

AAACAATTGG CTTCTGTATT CGTTAATTCA GTAAAACGAT ACCAAACTTC CTTACCTAAG 1020 

TAAAAGGAGC AAATAGTATC AAGATAATCT TTCACAAATT CCTGACAACT TTGTAATAGT 1080 

ATATTTTGAT TGAGCTCTCT CAATAAGTAT TCCCCACGAA TCATGCCGAC GTGGTGAAAT 1140 

AGTTGAGGAT AAATTTTTTC AAGAATTTTT TCGCCACTAA GGGCAAGTTG ATTTCTCATC 1200 

ATTCACCTTC CAATTCATGT AAGAAGTCTT GTCCAGTTCT GGAAATCCTA ATAATTCAGA 1260 

CTTAACCTTC AAGACTAATG GCGATGCATT TTCTTCTGTA ATCTCTTGAA TATCCATCCA 1320 

AATATATCCA AGTGAATCAT TCGCACCATC AGACACAGCT TCCGAAATCG TAACTTGAGG 1380 

TGCACTCTCA TTCATTTCAA CAT CAT AC AA GGCTATGACA TGGTGAACCA TAAAATTTTT 1440 

TAACTCTTCC CTGACGAAAA CATCGTAGAT TCGAGGATTA GAGTAGCTTC TAACAGTAAA 1500 

TCCCGTCTCT TCCATAACTT CTCTAGTCAG CGTTTCCGTC AGTCCTTCAC CAAGTTGCTG 1560 

ACTGCCTCCA GGTAGATCAT ACCGATGTTG ATAAGGGCCT CTCGTTTTTT CAATGCAAAG 1620 

TAACTTTCCA TTTTCAAAGC AAACACAGTA GACCCCAAAG TGATTTTTGA TTTCCATCCA 1680 

ACTCCTCCTA CTTCAAAGAC CAGCCACCAT CTATTGTCAA GATTTGTCCT TGCATGGCGC 1740 

TCGCTTTTCC ACTTGCTAAA AAAAGACTAA GCTCTGCTAT TTCCTCTGGC TCAATCCAGC 1800 

GCTTGATTGG GGTTTCACTA GCCACCCAGT CAGCCAAACC ACCTGGTTCA AAATCCGCAG 1860 

CGGTCATAGC TGTCTTGACT GCTCCTGGAG CGATACCAAA GACCTGAATC CCAGCTTCAG 1920 

CATAGTCTAG AGCCAACTGC TTGGTGAAGC CAGCCAAGGC ATGCTTGGAT GAAGTATAGG 1980 

CGTGACCACC TCCACCTGCT AGGCTAGAAG CAATGGAACA CATATTGATG ATGATTCCCT 2040 

TTTTATTTTC CAGCATTTGT GTCAAATAAT ACCGAGTCAA CTCTACTGGA ATAATGTAGT 2100 

TGATTTCAAA AATCTCTTGA ATGTCCTGCG CCGTTTGTTC CAACAGTGGT TTGTAATCAT 2160 

CCAAAACTCC AGCAGTATTA CACAAAACAT CCACCTGAGG GCACCAGTCA AAAATAGGTT 2220 

CCAAGTCCAA GGTCAAATCT CTCTGTAAAA AGCGAAAATC ACCCTCTAAG AGTGGCTTTT 2280 

CACCTTGGTC AACTCCATAA ACTTGATAGC CCTTCTCTAA AAAGAGGCGA GCTTGAGCCA 2340 

ATCCGATCCC TGAACTCACT CCTGTAATGA GTACACGTTT AGTCATGCAC TTCTACCCAA 2400 

TCCGTTGCCA AAACATCACA AACTGTCGGG CTCCACATGG AAAAACCTTC TCCTTCGCCA 2460 

GAAACGTTGA TTAGGAAATA AGGTGTCATT TCAAGTGCAA GCCCATTTTG CTCGATGGTA 2520 

TCAAAGAGTT GGACATAGTT TTCCGCACCT CCCCAACCAG TTCGTACATA TTTTCTCTTA 2580 

GCCTTTAACC CAGGCAGGAT CTCTTCAAAT GTCATGTTTT TCTCCTTTAA TTCTACATTC 2640 
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TTCATTTAAT 


TATAGCAAAA 


AACCGCTTTA 


TACGGCTTTT 


TGAATGTGAG 


TTATTCAAAC 


2700 


CTGCTACTAC 


TTACGGCAAA 


TTATTCCCTG 


CAGCAAGATA 


AATTTCATAC 


CATTCTTTTC 


2760 


TTGTTAAGCT 


AAAGTTTGCC 


GCTCGGCTAA 


CTTCTCTCAA 


GTGCTTAGGA 


TTTGTTGTAC 


2820 


CTACGACTGC 


CTGCATTTTT 


GCTGGATAAC 


GCAATATCCA 


AGAAATGGCA ATAGTTGAAG 


2880 


AGGTTACTCC 


ATATTTAATA 


GCTAAACGAT 


CAAGTACTTG 


ATTTAAAGCT 


TGAAATTTCT 


2940 


CATTTCCAAC 


AAAATTCCCT 


TTAAAATACC 


CGAATTGTAA 


GACAGACCAT GCTTGAATGA 


3000 


CCACATCGTG 


TAATTGGCAA 


TATTCAAAAA 


TGCTGCCATC 


TCGCATAGCT 


GCTTGACTAT 


3060 


CTTCCATATT 


AACATGAAAA 


GCTGATTCAA 


ATCCTGGAGT 


AAAAGCCGCA 


CTCAATTGTA 


3120 


GCTGATTAAC 


AGCTAACGGC 


TGCTTGACAT 


CTTTTTTAAG 


CAACTCCATC 


ATCATAGGAT 


3180 


TTTGATTAGA 


AACTCCAAAA 


TCTCGAACTT 


TACCTTGTTT 


ATAAAGGAGA 


TTAAAGGCTT 


3240 


CTGCTACTTG 


GTCAGATTCC 


ATCAAAGCAT 


CTGGTCGATG 


AAGGAGCAAG 


CTATCTAGAT 


3300 


GATCAATCTT 


CAATCTTTGC 


AAAATACCGT 


CTACTGATTT 


TATAATATAG 


TCCTTAGAAA 


3360 


AATCAAAATA 


GGTAAATTCT 


TCAATGCGAA 


TGCCACATTT 


GGACTGAATC 


CACATCTTTT 


3420 


CTCTTAAATC 


TGGACGATTT 


TTTAGGACAA 


GACCTAACAG 


TTCTT C AC AA 


CGACCACGAC 


3480 


CATAAATATC 


AGCCAAGTCG 


AAGGCATTGA 


TTCCAACAGA 


AAGTGCTGTT 


TCTACAAGCT 


3540 


CTTCAACTTC 


TTTTACAGAT 


TTATCTTTTA 


TTCTCATCAT 


TCCGAGAACA 


ATTTCTGATA 


3600 


ATTCTTTGTC 


ATCTTGACCA 


AGAGTTATGT 


ATCTCATCAA 


ATTTTTCTCC 


TTTAATTTCT 


3660 


AACATTCTTC 


CCTTCATTAT 


AACAAAAAAC 


CGCTTTGCAA 


CGACTTTTTG 


ACTATACTTC 


3720 


ACTCCATTTT 


ATCTTCTTAA 


ACCCACGGAA 


CAAGACAAAG 


ATTCCAATAA 


AGAGGACAGC 


3780 


TAAAGGAATA 


ACTTTTGTAA 


GGAAAACATT 


TGAAATTCCC 


ATCCACTCAT 


AATAACGGAG 


. 3840 


CAGAGAACCC 


ACCACAAGAT 


GGGCAATAAT 


CATACTGACA 


AATGGACGAA 


AGACCGCTTC 


3900 


TTTCCAATTC 


CAAATACCGA 


TAACTAGCGA 


AATCGTAAAG 


ACAGACAAAC 


TATCCCAGGG 


3960 


AGCCGGAATA 


TAAAAGGCTC 


CTTCTTGTAT 


GAAGCTTGCC 


ATTCCTACAT 


ATCCTAAAAC 


4020 


AACTAGAAGA 


ACTATAGTCC 


CAACAACAAT 


GTAAGTGCCA 


ATTTTCATTT 


TAGGAGAATC 


4080 


TTGGACTAAA 


CTTCTTCGTA 


AAATTGTGGC 


CACAAGTCCA 


AATCCAATCA 


GAAAAATAAG 


4140 


AAGTTGCCCT 


AAAAATGTGA 


GCAAATTGAC 


TGTTAAGAGA 


GGACCTTTAG 


AAAAATCACT 


4200 


TAGTAGTTGA 


TAATAACGTA 


ATACCGCCAG 


GACAAGAATT 


GGCGTCAAAA 


GGGACTCTTT 


4260 


GATAGAACTG 


CGAGGTGCTC 


CCTTGAGAAT 


CTCTTTCATT 


ATTTTTTTAG 


GATTCTTACC 


4320 


TAGATAATCC 


TCTGCACTCA 


TGCCATCTCG 


TTCTGCTTCT 


GAGAAATCTA 


GCATCATCAA 


4360 
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ATAGATCTGC 


TCTCTGAGAT 


AGTCTTCATC 


ATAGAGAAAT 


CCAGCAAGAT 


TAAAACTTTC 


4440 


CCACAACTCC 


TCAAAATACT 


TTTGATTCTC 


CTCAGAAAAC 


TCATGTAGCA 


AAGCGCTTGT 


4500 


TTCTTCGTAA 


TACTTCATTT 


TCTTCATGGT 


TTAACCCCCA 


TTCTTAATCC 


CTTCTACTTT 


4560 


TTGACTCAAA 


TCGTCCCATT 


GTTGCCAAAA 


GACTGAGACA 


CGCTCTTCTC 


CTTCTTTCAT 


4620 


TAATGAAAAA 


TACTTCCGAT 


CTGGACCATC 


TGGCGACGGG 


CGCATGTCGC 


CTCTTATCCA 


4680 


TTGATTTTTT 


TCTAACTTTT 


GCAACAAAGG 


ATAAATAGTT 


CCTGGAACGA 


TAGTATCAAA 


4740 


TCCAGCCTCT 


CGCAAAGTCT 


GAACCAACTC 


ATAACCATAC 


CGCTCTTTTT 


GACCAATCAT 


4800 


ATCCAAGACA 


CAACCTTCAA 


GAACACCTTT 


TAATAGCTGA 


GTTTCTTTCA 


TCACTTCTCC 


4860 


CTTCTAATCT 


ATTTTGTAAT ACCTACTAGT GACTTCACCT ATAGTATATC ACTTCTACAC 


4920 


TAGTTTGTAA 


AGCATAATAG 


TTAATACTCT 


TCGAAAATCT 


CTTCAAACCA 


CGTCAGCGTC 


4980 


GCCCTACCGT 


ATGTATGGTT 


ACTGACTTCG 


TCAGTTTCAT 


CTACAACCTC 


AAAAACATGT 


5040 


TTTGAGCTGA 


CTTCGTCAGT 


TTCATCTACA 


ACCTCAAAAC 


AGTGTTTTGA 


GCTGACTTCG 


5100 


TCAGTTTCAT 


CTACAACCTC 


AAAACAGTGT 


TTTGAGCTGA 


CTTCGTCAGT 


TTCATCTACA 


5160 


ACCTCAAAAA 


CATGTTTTGA 


GCTGACTTCG 


TCAGTTTCGT 


CTACAACCTC 


AAAACAGTGT 


5220 


TTTGAGCAAC 


CTGCGGCTAG 


CTTCCTAGTT 


TGCTCTTTGA 


TTTTCATTGA 


GTATAAATAA 


5280 


AAAAACAGAA 


CTAGCCTGAA 


CTAGTCCTGT 


CTACTTTTAC 


CCAATCACAC 


TTCCATTTGG 


5340 


TACAGCTGGA 


TCAACTGTGA 


GAAGGGTTAA 


TTTGCCATCA 


TGTTCAGCTG 


AGAGAATCAT 


5400 


ACCCTGGCTG 


ACATATTTTT 


TCATC ATTTT ACGTGGTTTG 


AGGTTAGCAA 


CGATTTGAAC 


5460 


TTTCTTGCCG 


ACCAATTCTT 


GTTCATTTGG 


ATAGTATTTT 


GCAATTCCTG 


AAAGAATCTG 


5520 


ACGATCTTCT 


CCATCACCAG 


CATCCAAGCG 


GAATTGAAGC 


AACTTATCTG 


AACCTTCTAC 


5580 


TTTAGACACT 


TCTTTGACTT 


CTGCGACACG 


GATTTCAACC 


TTGTCAAAGT 


CTTCAAACTT 


5640 


GATTTCATCC 


TTGTTTAGTT 


TGAGCTCAAC 


TTCGTCCGGA 


TTCCATTCTT 


TTTCGACTGC 


5700 


TGGTTTATTG 


CCTTCCATTT 


GTTCCTTGAT 


ATAGGCGATT 


TCTTCTTCCA 


TATTTAGACG 


5760 


TGGAAAGATA 


GGTGTTCCTT 


TGGCAACTAC 


AGTCACATCT 


GCTGGGAAGT 


CAGCCAAACT 


5820 


CAAGTTTTCA 


AGACTAGAAA 


CTTCTTCCAA 


ACCAAGTTGA 


GTCAAAACTG 


CACGACTAGT 


5880 


TTCCATCATA 


AATGGTTCAA 


TCAAGTGAGC 


AACTACACGA 


ATGCTGGCTG 


CCAAGTGGCT 


5940 


CATGACACTT 


GCCAATTGGT 


CACGAAGAGC 


TTCATCCTTG 


GCCAAGACCC 


ATGGTGCGGT 


6000 


CTCATCGATG 


TATTTATTGG 


TACGAGAGAT 


CAGAGTCCAG 


ACTGCTTCAA 


GCGCACGTGG 


6060 


ATAGTCAACT 


GCTTCCATGT 


GTGTATGGAA 


GTCTGCGATT 


GATTGTwCTG 


CAACCTCAGC 


6120 


AAGAACATGA 


TCATATTCAG 


TCACACCTTC 


TACATAGGCA 


GGGATTTGTC 


CATCAAAGTA 


6180 
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CTTATTAATC ATGGAAACCG TACGGTTAAG GAGGTTCCCA AGGTCATTAG CCAATTCATA 6240 

GTTGATACGG CCGACATAGT CTTCAGGAGT AAAGGTTCCG TCTGAACCAA CTGGAAGGTT 6300 

ACGCATGAGG TAGTAACGAA GTGGATCTAG TCCATAACGC TCTACCAACA TTTCAGGGTA 6360 

AACGACATTC CCTTTTGACT TAGACATTTT TCCGTCTTTC ATGACAAACC AACCATGGGC 6420 

AATCAAACGA TCAGGTAATT TAACATCCAA CATCATAAGA AGGATTGGCC AGTAGATAGA 6480 

GTGGAAGCGA AGGATATCTT TTCCTACCAT ATGGAAGACT GTTCCATTCC AGAACTTGTC 6540 

AAAGTTACCA TGTTCGTCTT GAGCGTAGCC AAGAGCTGTC GCATAGTTAA GAAGGGCATC 6600 

AATCCAAACG TAGACAACGT GTTTTGGATT TGATGGGACA GGCACTCCCC ATGTAAAGGT 6660 

TGTACGAGAT ACCGCCAAAT CTTCCAAGCC TGGCTCGATG AAGTTGCGTA GCATTTCATT 6720 

AAGGCGACCA TCTGGCGTGA TAAATTCAGG ATGAGCTTTG AAAAATTCGA CCAAACGGTC 6780 

TTGGTATTTG CTAAGGCGAA GGAAGTATGA TTCTTCAGAA AcCCATTCAA CCTCATGACC 6840 

TGATGGAGCA ATACCACCAG TCACATTTCC AGCTTCATCA CGGAAAACTT CTGCCAGCTG 6900 

GCTTTCTGTA AAGAATTCTT CGTCTGATAC TGAATACCAA CCAGAGTATT CACCCAAGTA 6960 

GATATCATCT TGAGCAAGTA AGCGTTCAAA GACTTGTGCG ACAACTTTTT CATGGTAGTC 7020 

ATCAGTTGTA CGGATAAATT TATCGTATGA GATATCTAGT AATTGCCAGA GTTCTTTAAC 7080 

TCCAACCGCC ATTCCATCAA CATAGGCTTG AGGTGTAATA CCAGCTTCTT CCGCTTTCTG 7140 

CTGGATTTTC TGACCATGTT CATCAAGACC TGTCAGATAA AATACATCGT AGCCCATCAG 7200 

GCGTTTGTAA CGTGCTAGGA CATCACATGC GATAGTTGTG TAGGCAGAAC CGATATGAAG 7260 

TTTCCCAGAT GGATAGTAAA TCGGCGTTGT AATATAAAAA TTTTTTTCAG ACATAATTTT 7320 

TCCTTTCCAG GCAAATGAAA CCTGTTTTTC TAACACTTCA TTATATCACA TTTTTAATGA 7380 

ATTTCAATAG GGAAATCCAT ACAAAAACAA GATAGACGAG TGTCCATCTT GTTGATCTCA 7440 

TTCATAACGA AGGGCTTCAA TTGGATCAAG TTTCGATGCC TTGTTGGCTG GCAAGACTCC 7500 

AAAAATCATA CCAACACTAG CCGAAACTGC AAGACTAAAT AGGGCGACTG GGATTGATAC 7560 

TCCAACTTCT ATACCTTCTA TTAAACCTTG CAGTAACAAA CCTGCTAAGg CAGTTAAACC 7620 

ACTTGCAATT GTCAAGCCAA TTAAGCCACC TAACAAGGTC AAAATCATGG ATTCAATCAA 7680 

AAACTGAATT AAAATATTGG CACGTGTTGC ACCCAAAGCC TTACGAAGAC CAATCTCACG 7740 

AGTGCGCTCT GTCACCGAAA CCAGCATGAT GTTCATGACA CCAGTTCCTC CAACAAAGAG 7800 

AGAAATCCCT GCGATGGAAC TAATAATCGT CGTCATAAAA CTAAACGATT GTTGAATTTC 7860 

TGCAAATACA ACGGACTCAT CTGCCACCTG GTATTCTCCC TGTTGTAAGC CTGCAAGCTC 7920 
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TGTCATTTTT CGTGCCAGTT CTGGACCCAG AGTTGGGGTT AAACTGGTAT CATTCACTCG 7980 

AAAGACAATA TTAGCTATTT CATCTACATT AAAATTCGCA GCAAGGGAGA TATTGGTAGT 8040 

AATAGGCAAG CCACCAAACC CATATATTTT TGATCTTTTA GCCTCCGGAC TAGTATAAAC 8100 

CCCAATGACC CGGTAACTAA ATCCATTGAC TTCTACAACC TTGTTAATAG CCTCTTGAGG 8160 

AGATTCAAAT AAACTAATGG ACAATTCCTC ATCTAGCAAA ATGACACTTG CAAACTCTTT 8220 

GAAATCTTGC TCTCTCAGAC TACGACCTGC AATAATTTCA TTCTTAACAG CGTCCATGTA 8280 

AGTTCTGTTT CCACCTGTCA AATTAGCATT CTCAACCTTT TTATCTTGAT AGGTCAAGAT 8340 

GGCATTCGTT GAATTGGTTA CATAGTAACT ATCCACTCCC TTCAGTTTAG CTGCCTCTTG 8400 

GACCCAGGAT TCTTGCGGTT TTGGCGGTTC AACAGGAACT TCCTCTTCCT TTCCAGAAAC 8460 

CGTAAAAGCT GATTGTTTCT GAGTAAAAGA CCCGTCTTTA CTTTTTTTAG GAGAGAAAAA 8520 

GACGCTAATA TTTTTCTGAG ATTTAGTCAT ATCTTTATTG ACTTGACGAG ATAGGGAATC 8580 

ACCCAAAGCC ATAATCACAA CAACTGATGA AACACCGATA ATAATCCCAA TCATAGTAAG 8640 

CAAAGAACGC ATCTTGTGAG CCATGATAGA TGAAAAGGCA AATTTCAGAT TCTGCATCTT 8700 

AGTTTTCCTC CTTTCCTAAC TGAGCACTGT CAGACGAAAT GACCCCATCC CGAATGACAA 8760 

TCTGACGTTT GGCATAGGCA GCAATCTCAG GCTCATGCGT TACCATGATA ATGGTTTTTC 8820 

CTTCTTTATT CAAATCAACC AATAATTGCA TAATTTGGTT ACCTGTTTTG GTATCCAAGG 8880 

CTCCTGTCGG TTCATCCGCT AGGATAATAG AAGGATTGTT TACCAAGGCA CGCGCAATGG 8940 

CTACACGTTG CTTTTGACCA CCAGATAATT CTGAAGGTAA ATGGTGACTA CGTTCTGTCA 9000 

ATTCAACCTT GTCTAAATAT TCCTCAGCCA ACTTGCGACG TTTTGAAGAC GAAACTCCTG 9060 

CGTAAATCAA GGGCAATTCT ACATTTTGCA GAGCATTGAG CTTCGATAGA AGAAAGAACT 9120 

GCTGAAAGAC AAAACCGATT TGTTGGTTAC GGACCTTAGC TAGTTGTTTT TCACCAAGCC 9180 

CAGCCACTTC TTGACCTTCA AGATAATATT CTCCACTGGT TGGTGTATCC AACATGCCAA 9240 

TCGTATTCAT CAGAGTGGAC TTACCAGACC CAGATGGTCC CATGATGGCT ACAAATTCAC 9300 

CCTCATTCAC TTCTAGATTG ATATTTTTGA GAACCTGCAG TTCTTGGTCA CCATTACGGT 9360 

AACTTCTGAA GATATTTTTT AGACTAATTA GTTGCTTCAT CAGCCTTCAC CTCTTTTCCT 9420 

TCTTCCAAGG AAGATGTTGG ATTACTGATG ACCTTAGCAC CGTTCGTTAA ACCAGAAGTG 9480 

ATTTCTTGAT TTTCTGCGTC AGCATTTCCC AATGAAACCT CAACTTTTTT AGCCTTTTGT 9540 

TGTTCATCCA CAATCCAGAC ATAATTTTTA CTATCATCCA TTACTAGACT GCTAACAGGA 9600 

ACAAGAATAG CCTTAGTTTT GCTTTTAACC TCAATGTTGA CAGAAAAACC TTGTTTCAAA 9660 

TCACCAACCT CGCCTGTCAC ATCAATAGTA TAAGGGTATT TAGAACCTGT ATTATTCCCG 9720 
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GCTGCTGGAC 


TAGCTGCTTC 


ACCATTGTTT 


TTAGGATAGT 


CAGAAATATA 


GCTTAATTTC 


9780 


CCAGTCCATT 


TTTTATCAGG 


ATACACTTTA 


GAAGTAAAGC 


TTACTTCTTG 


ACCTACAGAA 


9840 


AGGTTGGCTA 


GATTGTACTC 


AGACAATTCT 


CCCTTGACTT 


GTAAATTTTC 


ATTGCTGACA 


9900 


ATATGAACCA 


TAACTTGACT 


CGCCCCTGTT 


GGAGATTTAG 


AAACATTGCT 


ATTGACTTCG 


9960 


ACCACAGTTC 


CCTCTAGGGT 


ACTGAGAACA 


GTTGTTGCAT 


CCAATTGACT 


TTGAGCCTTG 


10020 


CTTAATTGCG 


CCGCAGCATC 


TGCACGCGCA 


TCACGGGCAT 


CACCCAATTG 


AGCGTCAATA 


10080 


GAAGCAACAG 


AATTTCCAGC 


CACTGGAGTT 


GGGCTTTGCA 


CCGTTGCATC 


TTCTCCTCCT 


10140 


ACTGGCGCTG 


GTAACTGTGG 


AGCCGGAGCT 


GAAGCGGCTT 


CATTTCGTGC 


TTGATTGAGT 


10200 


TCATTGATAT 


GACGATCTGC 


CCTAGCTACT 


GCTCGACTAG 


CTGAATCATA 


GGCCGCCTGC 


10260 


GCTTCTGAAC 


TACTGTACTT 


GACTAAAGCC 


TGCCCTTCGC 


TGACCTTATC 


GCCCACAGAA 


10320 


ACAAGGATTT 


CATCTAAATC 


ACCCTTACTA 


GCATCAAAAT 


AAACATATTG 


TTCATTTTTT 


10380 


GCTGTTACTG 


TCCCTGACAA 


TAAAACAGAG 


GAGGCCACGC 


TTCCTTCCTT 


GGCAACAACA 


10440 


AGATGAGTAG 


GCTCATCTTT 


TAGAGCAGTC 


TGAGAAGGTT 


GTCTAAAGAG 


TAAAATCCCC 


10500 


CCAGCACCCA 


ATACAACTAC 


ACTCGCAGCA 


CCGATTGCTG 


CATACAGTTG 


CCACTTTTTA 


10560 


GCTTTACCAT 


TCTTTTTCTT 


CATAATGAAA 


CTCCTTTTCT 


TTTTTACAAT 


ACTTTGCTAT 


10620 


TATACCAAAT 


TTCCCTCCAG 


CAAACAATAC 


AGTTCAGGAT 


TAAACAATCG 


TTCGGAATTT 


10680 
10690 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 8195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

GAGAAAGCGC CCACGTTTCC CCGAAGGGAG AAAGGCGGAC AGGTATCCGG TAAGCGGCCA 60 

GGGTCGGAAC AGGAGAGCGC AACGAGGGAG CTTCCCAGGG GGAAACGCCT GGTATCTTTA 120 

TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG 180 

GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG 240 

CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATCCCCT GATTCTGTGG ATAACCGTAT 300 

TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA ACGACCGAGC GCAGCGAGTC 360 
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AGTGAGCGAG 


GAAGCGGAAG 


AGCGCCCAAT 


ACGCAAACCG 


CCTCTCCCCG 


CGCGTTGGCC 


420 


GATTCATTAA 


TGCAGCTGGC 


ACGACAGGTT 


TCCCGACTGG 


AAAGCGGGCA 


GTGAGCGCAA 


480 


CGCAATTAAT 


GTGAGTTAGC 


TCACTCATTA 


GGCACCCCAG 


GCTTTACACT 


TTATGCTTCC 


540 


GGCTCGTATG 


TTGTGTGGAA 


TTGTGAGCGG 


ATAACAATTT 


CACACAGGAA 


ACAGCTATGA 


600 


CaTGATTACG 


AATTCGAGCT 


CGGTACCCGG 


AAAATCCAGA AAATGCTTGA 


AAAAAATCCT 


660 


AGAAGATGGT 


ATAATACTAA 


ATTGTAAGGG 


TTATCACATA 


TAACTCAAAA 


AAAGAAAGAA 


720 


CAAAAGGAGA 


GTCAAACTAT 


GGCTTCTAAA 


GATTTCCACG 


TAGTGGCAGA 


AACAGGTATT 


780 


CACGCACGTC 


CAGCAACATT 


GTTGGTACAA 


ACTGCTAGCA 


AATTTGCTTC 


AGATATCACT 


840 


CTTGAGTACA 


AAGGTAAATC 


AGTTAACCTT 


AAATCAATTA 


TGGGTGTTAT 


GAGTCTTGGT 


900 


GTTGGCCAAG 


GTGCTGACGT 


AACTATCTCA 


GCTGAAGGTG 


CAGATGCAGA 


TGACGCTATC 


960 


GCTGCAATCT 


CAGAAACAAT 


GGAAAAAGAA 


GGATTGGCAT 


AAGGGAAATG 


ACAGAAATGC 


1020 


TTAAAGGAAT 


CGCAGCATCT 


GACGGTGTTG 


CAGTTGCAAA AGCATATCTA 


CTCGTTCAGC 


1080 


CGGATTTGTC 


ATTTGAGACT 


ATTACAGTCG 


AAGATACAAA 


CGCAGAAGAA 


GCTCGCCTTG 


1140 


ATGCCGCTCT 


ACAGGCATCA 


CAAGACGAGC 


TTTCTGTTAT 


TCGCGAGAAA 


GCAGTAGGTA 


1200 


CGCTCGGTGA 


AGAAGCAGCT 


CAAGTTTTTG 


ATGCTCACTT 


AATGGTTCTT 


GCTGACCCAG 


1260 


AAATGATCAG 


CCAAATCAAG 


GAAACTATCC 


GTGCGAAGAA 


AGTGAATGCA 


GAAGCAGGTC 


1320 


TGAAAGAAGT 


TACAGATATG 


TTTATCACTA 


TCTTTGAAGG 


CATGGAAGAC 


AACCCATACA 


1380 


TGCAAGAACG 


CGCAGcGGAT 


WTCCGCGACG 


TGACAAAACG 


TGTATTGGCA 


AACCTTCTTG 


1440 


GTAAAAAATT 


GCCAAACCCA 


GCTTCTATCA 


ATGAAGAAGT 


GATTGTGATT 


GCGCATGACT 


1500 


TGACTCCTTC 


AGATACAGCT 


CAATTGGACA 


AAAACTTTGT 


AAAAGCTTTT 


GTAACCAACA 


1560 


TTGGTGGACG 


TACAAGCCAC 


TCAGCTATCA 


TGGCACGTAC 


ACTTGAAATT 


GCTGCTGTAT 


1620 


TAGGTACAAA 


TAACATCACT 


GAAATCGTTA 


AAGACGGTGA 


CATCCTTGCT 


GTTAACGGGA 


1680 


TCACTGGAGA 


AGTGATTATC 


AACCCAACAG 


ATGAACAAGC 


GGCAGAATTT 


AAAGCAGCTG 


1740 


GTGAAGCCTA 


TGCGAAACAA 


AAAGCTGAAT 


GGGCACTTTT 


GAAAGATGCT 


CAAACAGTGA 


1800 


CTGCTGACGG 


TAAACACTTC 


GAGTTGGCTG 


CTAATATCGG 


TACTCCAAAA 


GACGTTGAAG 


1860 


GTGTTAACAA 


CAACGGTGCA 


GAAGCTGTTG 


GACTTTACCG 


TACAGAGTTC 


TTGTACATGG 


1920 


ATTCTCAAGA 


CTTCCCAACT 


GAAGATGAGC 


AGTATGAAGC 


ATACAAGGCT 


GTTCTTGAAG 


1980 


GAATGAACGG 


TAAACCTGTT 


GTCGTTCGTA 


CAATGGATAT 


CGGTGGAGAT 


AAGGAACTTC 


2040 


CTTACTTCGA 


TATGCCTCAC 


GAAATGAACC CATTCCTTGG 


ATTCCGTGCT 


CTTCGTATCT 


2100 


CTATCTCTGA GACTGGAGAT 


GCTATGTTCC 


GCACACAAAT 


CCGTGCTCTT 


CTTCGTGCGT 


2160 
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CTGTTCACGG TCAATTGCGT ATCATGTTCC CAATGGTTGC GCTCTTGAAA GAATTCCGTG 2220 

CAGCGAAAGC AGTCTTTGAT GAAGAAAAAG CAAACCTTCT TGCTGAAGGT GTTGCAGTTG 2280 

CGGATAACAT CCAAGTTGGT ATCATGATCG AGATTCCTGC AGCGGCTATG CTTGCAGACC 2340 

AATTTGCTAA AGAAGTTGAC TTCTTCTCAA TTGGTACAAA CGACTTGATC CAATATACAA 2400 

TGGCAGCAGA CCGTATGAAC GAACAAGTTT CATACCTTTA CCAACCATAC AACCCATCAA 2460 

TCCTACGCTT GATTAACAAT GTGATCAAAG CAGCTCACGC TGAAGGTAAA TGGGCTGGTA 2520 

TGTGTGGTGA GATGGCTGGT GACCAACAAG CTGTTCCACT TCTTGTCGGA ATGGGCTTGG 2580 

ATGAGTTCTC TATGTCAGCA ACATCTGTAC TTCGTACACG CAGCTTGATG AAGAAACTCG 2640 

ACACAGCTAA GATGGAAGAG TACGCAAACC GTGCCCTTAC AGAATGCTCA ACAATGGAAG 2700 

AAGTTCTTGA ACTTCAAAAA GAATACGTTA ATTTTGATTA ATCGAAAAGT CCCTGCAACT 2760 

CAGTTACAGG GATTTTTTTG AT ATT TT AAA AAGAATTTTC AAGAAAATCT TTCTTATAGA 2820 

AAGTCCAACC TTGAAAAAGT AGTGGTCAGA ACAAAAAATA CTTAAATGGT TCATAAAATT 2 880 

CTTGACAAGT TGGATATTTA GGAGTAAACT ATTAACCAGT TAAGTAATAG AGAGGAGTTT 2940 

CTGCAATTTA GAAATGAATT GCAACTAGAA ATATCAAATA GAAAGAGAGT TTCGATGAAA 3000 

ATTAATAAGA AATACCTTGT TGGTTCTGCG GCACTTTGAT TTTAAGTGTT TGTTCTTACG 3060 

AGTTGGGACT GTATCAAGCT AGAACGGTTA AGGAAAATAA TCGTGTTTCC TATATAGATG 3120 

GAAAACAAGC GACGCAAAAA ACGGAGAATT TGACTCCTGA TGAGGTTAGC AAGCGTGAAG 3180 

GAATCAATGC TGAGCAAATC GTCATCAAGA TAACAGACCA AGGCTATGTC ACTTCACATG 3240 

GCGACCACTA TCATTATTAC AATGGTAAGG TTCCTTATGA CGCTATCATC AGTGAAGAAT 3300 

TACTCATGAA AGATCCAAAC TATAAGCTAA AAGATGAGGA TATTGTTAAT GAGGTCAAGG 3360 

GTGGATATGT TATCAAGGTA GATGGAAAAT ACTATGTTTA CCTTAAGGAT GCTGCCCACG 3420 

CGGATAACGT CCGTACAAAA GAGGAAATCA ATCGACAAAA ACAAGAGCAT AGTCAACATC 3480 

GTGAAGGTGG AACTCCAAGA AACGATGGTG CTGTTGCCTT GGCACGTTCG CAAGGACGCT 3540 

ATACTACAGA TGATGGTTAT ATCTTTAATG CTTCTGATAT CATAGAGGAT ACTGGTGATG 3600 

CTTATATCGT TCCTCATGGA GATCATTACC ATTACATTCC TAAGAATGAG TTATCAGCTA 3660 

GCGAGTTGGC TGCTGCAGAA GCCTTCCTAT CTGGTCGAGG AAATCTGTCA AATTCAAGAA 3720 

CCTATCGCCG ACAAAATAGC GATAACACTT CAAGAACAAA CTGGGTACCT TCTGTAAGCA 3780 

ATCCAGGAAC TACAAATACT AACACAAGCA ACAACAGCAA CACTAACAGT CAAGCAAGTC 3840 

AAAGTAATGA CATTGATAGT CTCTTGAAAC AGCTCTACAA ACTGCCTTTG AGTCAACGAC 3900 
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ATGTAGAATC 


TGATGGCCTT 


GTCTTTGATC 


CAGCACAAAT 


CACAAGTCGA 


ACAGCTAGAG 


3960 


GTGTTGCAGT 


GCCACACGGA 


GATCATTACC 


ACTTCATCCC 


TTACTCTCAA 


ATGTCTGAAT 


4020 


TGGAAGAACG 


AATCGCTCGT 


ATTATTCCCC 


TTCGTTATCG 


TTCAAACCAT 


TGGGTACCAG 


4080 


ATTCAAGGCC 


AGAACAACCA 


AGTCCACAAC 


CGACTCCGGA 


ACCTAGTCCA 


GGCCCGCAAC 


4140 


CTGCACCAAA 


TCTTAAAATA 


GACTCAAATT 


CTTCTTTGGT 


TAGTCAGCTG 


GTACGAAAAG 


4200 


TTGGGGAAGG 


ATATGTATTC 


GAAGAAAAGG 


GCATCTCTCG 


TTATGTCTTT 


GCGAAAGATT 


4260 


TACCATCTGA 


AACTGTTAAA 


AATCTTGAAA 


GCAAGTTATC 


AAAACAAGAG 


AGTGTTTCAC 


4320 


ACACTTTAAC 


TGCTAAAAAA 


GAAAATGTTG 


CTCCTCGTGA 


CCAAGAATTT 


TATGATAAAG 


4380 


CATATAATCT 


GTTAACTGAG 


GCTCATAAAG 


CCTTGTTTGA 


AAATAAGGGT 


CGTAATTCTG 


4440 


ATTTCCAAGC 


CTTAGACAAA 


TTATTAGAAC 


-GCTTGAATGA 


TGAATCGACT 


AATAAAGAAA 


4500 


AATTGGTAGA 


TGATTTATTG 


GCATTCCTAG 


CACCAATTAC 


CCATCCAGAG 


CGACTTGGCA 


4560 


AACCAAATTC 


TCAAATTGAG 


TATACTGAAG 


ACGAAGTTCG 


TATTGCTCAA 


TTAGCTGATA 


4620 


AGTATACAAC 


GTCAGATGGT 


TACATTTTTG 


ATGAACATGA 


TATAATCAGT 


GATGAAGGAG 


4680 


ATGCATATGT 


AACGCCTCAT 


ATGGGCCATA 


GTCACTGGAT 


TGGAAAAGAT 


AGCCTTTCTG 


4740 


ATAAGGAAAA 


AGTTGCAGCT 


CAAGCCTATA 


CTAAAGAAAA 


AGGTATCCTA 


CCTCCATCTC 


4800 


CAGACGCAGA 


TGTTAAAGCA AATCCAACTG 


GAGATAGTGC 


AGCAGCTATT 


TACAATCGTG 


4860 


TGAAAGGGGA 


AAAACGAATT 


CCACTCGTTC 


GACTTCCATA 


TATGGTTGAG 


CATACAGT TG 


4920 


AGGTTAAAAA 


CGGTAATTTG 


ATTATTCCTC 


ATAAGGATCA 


TTACCATAAT 


ATTAAATTTG 


4980 


CTTGGTTTGA 


TGATCACACA 


TACAAAGCTC 


CAAATGGCTA 


TACCTTGGAA 


GATTTGTTTG 


5040 


CGACGATTAA 


GTACTACGTA 


GAACACCCTG 


ACGAACGTCC 


ACATTCTAAT 


GATGGATGGG 


5100 


GCAATGCCAG 


TGAGCATGTG 


TTAGGCAAGA 


AAGACCACAG 


TGAAGATCCA 


AATAAGAACT 


5160 


TCAAAGCGGA 


TGAAGAGCCA 


GTAGAGGAAA 


CACCTGCTGA 


GCCAGAAGTC 






AGACTGAAAA 


AGTAGAAGCC 


CAACTCAAAG 


AAGCAGAAGT 


TTTGCTTGCG 


AAAGTAACGG 


5280 


ATTCTAGTCT 


GAAAGCCAAT 


GCAACAGAAA 


CTCTAGCTGG 


TTTACGAAAT 


AATTTGACTC 


5340 


TTCAAATTAT 


GGATAACAAT 


AGTATCATGG 


CAGAAGCAGA 


AAAATTACTT 


GCGTTGTTAA 


5400 


AAGGAAGTAA 


TCCTTCATCT 


GTAAGTAAGG 


AAAAAATAAA 


CTAATGAAAA 


ATGAAAGTCT 


5460 


CGATAAAGAG 


GCTTTCATTT 


TTATTATGTA 


TATATGTAAA 


ATTCTTGACA 


AGCAATATTA 


5520 


AAAAGAGTAA 


ACTATTAACT 


AGTTAATTAA 


CCGGTTTATT 


ACTTTATAGT 


GAATCAAATA 


5580 


TACTTAAGAA 


AAGAGGAAAG 


AATGAAAATT 


AATAAAAAAT 


ATCTAGCAGG 


TTCAGTGGCA 


5640 


GTCCTTGCCC 


TAAGTGTTTG 


TTCCTATGAA 


CTTGGTCGTC 


ACCAAGCTGG 


TCAGGTTAAG 


5700 
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AAAGAGTCTA ATCGAGTTkC TTATATAGAT GGTGATCAGG CTGGTCAAAA GGCAGAAAAC . 5760 

TTGACACCAG ATGAAGTCAG TAAGAGGGAG GGGATCAACG CCGAACAAAT CGTCATCAAG 5820 

ATTAOGGATC AAGGTTATGT GACCTCTCAT GGAGACCATT ATCATTACTA TAATGGCAAG 5880 

GTCCCTTATG ATGCCATCAT CAGTGAAGAG CTCCTCATGA AAGATCCGAA TTATCAGTTG 5940 

AAGGATTCAG ACATTGTCAA TGAAATCAAG GGTGGTTATG TTATCAAGGT AGATGGAAAA 6000 

TACTATGTTT ACCTTAAGGA TGCAGCTCAT GCGGATAATA TTCGGACAAA AGAAGAGATT 6060 

AAACGTCAGA AGCAGGAACA CAGTCATAAT CACGGGGGTG GTTCTAACGA TCAAGCAGTA 6120 

GTTGCAGCCA GAGCCCAAGG ACGCTATACA ACGGATGATG GTTATATCTT CAATGCATCT 6180 

GATATCATTG AGGACACGGG TGATGCTTAT ATCGTTCCTC ACGGCGACCA TTACCATTAC 6240 

ATTCCTAAGA ATGAGTTATC AGCTAGCGAG TTAGCTGCTG CAGAAGCCTA TTGGAATGGG 6300 

AAGCAGGGAT CTCGTCCTTC TTCAAGTTCT AGTTATAATG CAAATCCAGC TCAACCAAGA 6360 

TTGTCAGAGA ACCACAATCT GACTGTCACT CCAACTTATC ATCAAAATCA AGGGGAAAAC 6420 

ATTTCAAGCC TTTTACGTGA ATTGTATGCT AAACCCTTAT CAGAACGCCA TGTGGAATCT 6480 

GATGGCCTTA TTTTCGACCC AGCGCAAATC ACAAGTCGAA CCGCCAGAGG TGTAGCTGTC 6540 

CCTCATGGTA ACCATTACCA CTTTATCCCT TATGAACAAA TGTCTGAATT GGAAAAACGA 6600 

ATTGCTCGTA TTATTCCCCT TCGTTATCGT TCAAACCATT GGGTACCAGA TTCAAGACCA 6660 

GAACAACCAA GTCCACAATC GACTCCGGAA CCTAGTCCAA GTCCGCAACC TGCACCAAAT 6720 

CCTCAACCAG CTCCAAGCAA TCCAATTGAT GAGAAATTGG TCAAAGAAGC TGTTCGAAAA 6780 

GTAGGCGATG GTTATGTCTT TGAGGAGAAT GGAGTTTCTC GTTATATCCC AGCCAAGGAT 6840 

CTTTCAGCAG AAACAGCAGC AGGCATTGAT AGCAAACTGG CCAAGCAGGA AAGTTTATCT 6900 

CATAAGCTAG GAGCTAAGAA AACTGACCTC CCATCTAGTG ATCGAGAATT TTACAATAAG 6960 

GCTTATGACT TACTAGCAAG AATTCACCAA GATTTACTTG ATAATAAAGG TCGACAAGTT 7020 

GATTTTGAGG CTTTGGATAA CCTGTTGGAA CGACTCAAGG ATGTCyCAAG TGATAAAGTC 7080 

AAGTTAGTGG ATGATATTCT TGCCTTCTTA GCTCCGATTC GTCATCCAGA ACGTTTAGGA 7140 

AAACCAAATG CGCAAATTAC CTACACTGAT GATGAGATTC AAGTAGCCAA GTTGGCAGGC 7200 

AAGTACACAA CAGAAGACGG TTATATCTTT GATCCTCGTG ATATAACCAG TGATGAGGGG 7260 

GATGCCTATG TAACTCCACA TATGACCCAT AGCCACTGGA TTAAAAAAGA TAGTTTGTCT 7320 

GAAGCTGAGA GAGCGGCAGC CCAGGCTTAT GCTAAAGAGA AAGGTTTGAC CCCTCCTTCG 7380 

AC AG AC CATC AGGATTCAGG AAATACTGAG GCAAAAGGAG CAGAAGCTAT CTACAACCGC 7440 
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GTGAAAGCAG CTAAGAAGGT GCCACTTGAT CGTATGCCTT ACAATCTTCA ATATACTGTA 7500 

GAAGTCAAAA ACGGTAGTTT AATCATACCT CATTATGACC ATTACCATAA CATCAAATTT 7560 

GAGTGGTTTG ACGAAGGCCT TTATGAGGCA CCTAAGGGGT ATACTCTTGA GGATCTTTTG 7620 

GCGACTGTCA AGTACTATGT CGAACATCCA AACGAACGTC CGCATTCAGA TAATGGTTTT 7680 

GGTAACGCTA GCGACCATGT TCGTAAAAAT AAGGTAGACC AAGACAGTAA ACCTGATGAA 7740 

GATAAGGAAC ATGATGAAGT AAGTGAGCCA ACTCACCCTG AATCTGATGA AAAAGAGAAT 7800 

CACGCTGGTT TAAATCCTTC AGCAGATAAT CTTTATAAAC CAAGCACTGA TACGGAAGAG 7860 

ACAGAGGAAG AAGCTGAAGA TACCACAGAT GAGGCTGAAA TTCCTCAAGT AGAGAATTCT 7920 

GTTATTAACG CTAAGATAGC AGATGCGGAG GCCTTGCTAG AAAAAGTAAC AGATCCTAGT 7980 

ATTAGACAAA ATGCTATGGA GACATTGACT GGTCTAAAAA GTAGTCTTCT TCTCGGAACG 8040 

AAAGATAATA ACACTATTTC AGCAGAAGTA GATAGTCTCT TGGCTTTGTT AAAAGAAAGT 8100 

CAACCGGCTC CTATACAGTA GTAAAATGAA TGGAGCATAT TTTATGGAGA AGTAACCTTT 8160 

CGTGTTACTT CTCTTTTTTA GAAAAACGTA ACAGA 8195 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

TTTACTAAAA GGAAAAAAGA ACTGATTTCT CAGTCCTTCA TTAATCTTAT TCCACACTAA 60 

ATAGGTATGG GTAAACAGGT TGTTGACCTT GGTGAATCTC GACTTCAACG TCTTCGAATT 120 

CTTCTACGAT TTCTTGAGCG ATTTCATTGG CAAGTTCTTC GCTTCCGTCT TCACCTACAT 180 

AGAAGGTTAC GATTTCACTG TCTTCATCCA ACATATGTTT CAAGGTTTCA GTCAATGT IT 240 

GGTGCATATC AGGGTTTGAC ACAAGAATTT TTCCATCCAC CATACCTAAA TTATCGTTTT 300 

CATGGATTTC TAAGCCATCG ATCGTTGTAT CACGCACGGC TGTTGTGACG CTTCCGCTAA 360 

CGACATCGCT AAGAGCAGCT GTCATACGCT CTTGGTTTTC TTCAATGGAC TTGCTTGGAT 420 

CAAAGGCAAG AAGACTTGTC ATACCTTGAG GAAGAGTGCG AGCCTCTACC ACTACCGCTG 480 

GTTGCTCCAA AACTTCTGCC GCAGATTGAG CTGCCATGAA GATGTTCTTG TTGTTTGGCA 540 

AGAAGATGAT GTTACGGGCA TTAACCTGTT CAACAGCCTT GATAAAGTCT TCTGTTGAAG 600 

GGTTCATGGT TTGACCGCCT TCGATAACAT AATCCACGCC TTGAGAACAG AAGATATCTG 660 
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CTAGACCTTT ACCAGCCACC ACAGCAATCA AAGCATACTC TTTTTCTTCA GCCGACTTGA 720 

TAACTTGAGT AGCTTCTTTC TCAACCTGTG CTTCGTGTTG GTTACGCATA TTGTCAACTT 780 

TTACCTTGAC CAAGCTACCA TATTTGAGAC CTTCTTGCAT AACAAGTCCT GGATCTTCTG 840 

TATGAACATG GACTTTGACA ATTTCATCAT CGTTAACAAC AAGGAGAGAA TCTCCAAGCT 900 

CATCCAAGTA GTTACGGAAT TCATCGTAGT CAAAATCTTT AGCATAGGTT GGACCTTGCT 960 

TAAGAGCTAC CATGATTTCA GTACAGTAAC CAAACGTGAT GTCCTCAGTC GCTACGTGAC 1020 

CAGCTACAGA CTTATGATGC TCTACATTGA TCATCTCACT CATGTTGGCA GGAGTCGCTA 1080 

CAAAGTCCTC AGATGCAATA TATTCGCCAG TAAGGGCTGA AAGGAAACCT TCGTAGATGA 1140 

AGACCAATCC TTGACCACCT GAGTCCACAA CGCCAACTTC TTTCAATACT GGAAGCATGT 1200 

CTGGTGTTTT AGCTAGAGCT GTTTTAGCAC CTTCCAAGGC TGCGCGCATG ACTTCAACAG 1260 

CGTCATCTGT TTGCTCAGCT TTTTTCTTAG CACCGATAGC AGCTCCACGA GAAACTGTTA 1320 

AAATCGTTCC TTCAACAGGT TTCATCACTG CCTTATAGGC AACTTCCACA CCTGATTGGA 1380 

AGGCCAGAGC CAAGTCTTGA CCTGTTAACT CGTCTTTATC CTTGATAGCT TGGGAAAATC 1440 

CACGGAAAAG CTGAGACGTA ATCACTCCTG AGTTCCCACG CGCACCCATC AAAAGCCCTT 1500 

TGGCAAGAAT GCTCGCTACT TCTCCAACTG TAGAAGCTGG CTTGTCTGCA ACTTCTTTAG 1560 

CACCATTTTC AATGGTCATT CCCATATTTG TCCCAGTATC TCCATCTGGA ACTGGAAAGA 1620 

CGTTTAATGA ATTGACATAT TCAGCTTGCT TATTCAAGCG AGTTGATGCA GCCTGCACCA 1680 

TTTCTTGAAA TAAGCTAGTA GTAATTTTTG ACACGGTTAT TCTCCTACAA CTTTGATATT 1740 

TTGAATGTAG ACATTTACAG TCTGAGCAGT AATTCCAAGC TGGTTTTCCA AGCTAAAGGC 1800 

AACACGCTCT TGAATGTTTT TTGACACTTC ACTAATCTTT GTTCCGTAGC TTAACACGGT 1860 

ATATACATCA ACTGCAATAC TGCCATCTTC GGCTGCCTTT ACGACGACAC CTTTAGAATA 1920 

ATTTTCCTTA CCTAGCAGGG CTTGGAAATT ATCTTTGAGG GCATTTTTAC TAGCCATACC 1980 

GACCACACCA GAAATCTCAG TTGC 2004 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear. 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
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CCGGGTTGGG CTGTTCGCCC ATTAAAGCGG CACCACAGCT GGGTTCAGAA CGTCGTGAGA 60 

CAGTTCGGTC CCTATCCGTC GCGGGCGTAG GAAATTTGAG AGGATCTGCT CCTAGTACGA 120 

GAGGACCAGA GTGGACTTAC CGCTGGTGTA CCAGTTGTCT TGCCAAAGGC ATCGCTGGGT 180 

AGCTATGTAG GGAAGGGATA AACGCTGAAA GCATCTAAGT GTGAAACCCA CCTCAAGATG 240 

AGATTTCCCA TGATTATATA TCAGTAAGAG CCCTGAGAGA TGATCAGGTA GATAGGTTAG 300 

AAGTGGAAGT GTGGCGACAC ATGTAGCGGA CTAATACTAA TAGCTCGAGG ACTTATCCAA 360 

AGTAACTGAG AATATGAAAG CGAACGGTTT TCTTAAATTG AATAGATATT CAATTTTGAG 420 

TAGGTATTAC TCAGAGTTAA GTGACGATAG CCTAGGAGAT ACACCTGTAC CCATGCCGAA 480 

CACAGAAGTT AAGCCCTAGA ACGCCGGAAG TAGTTGGGGG TTGCCCCCTG TGAGATAGGG 540 

AAGTCGCTTA GCTCTAGGGA GTTTAGCTCA GCTGGGAGAG CATCTGCCTT ACAAGCAGAG 600 

GGTCAGCGGT TCGATCCCGT TAACTCCCAT TTTAGCGGGT GTAGTTTAGT GGTAAAACTA 660 

CAGCCTTCCA AGCTGTTGTC GCGAGTTCGA TTCTCGTCAC CCGCTTTGAA CTTTGTTCTT 720 

TGTACCAAGT TTTTGACTTG GGCGCGTAGC TCAGGTGGTT AGAGCGCACG CCTGATAAGC 780 

GTGAGGTCGG TGGTTCGAGT CCACTCGTGC CCATAGTGTT TAGTCCATTA CTAGGGGATT 840 

GGAATATTAT CTGTTCACTA AGAGGACACG GGCTTGTTCC CGTATAAACT ATTTTGGAGG 900 

ATTACCCAAG TCCGGCTGAA GGGAACGGTC TTGAAAACCG TCAGGCGTGT AAAAGCGTGC 960 

GTGGGTTCGA ATCCCACATC CTCCTTTTAT ATTAACGCGG GATGGAGCAG CTCGGTAGCT 1020 

CGTCGGGCTC ATAACCCGAA GGTCGTAGGT TCAAATCCTG CTCCCGCAAT AAGGCTCGGT 1080 

AGCTCAGTTG GTAGAGCAAT GGATTGAAGC TCCATGTGTC GGCGGTTCGA TTCCGTCTCG 1140 

CGCCATTTAT ATATTTTGGA AGGGTAGCGA AGAGGCTAAA CGCGGCGGAC TGTAAATCCG 1200 

CTCCTTCGGG TTCGGGGGTT CGAATCCCTC CCCTTCCATT TTACGGGCAT AGTTTAAAGG 1260 

TAGAACTAAG GTCTCCAAAA CCTTCAGTGT GGGTTCAATT CCTACTGCCC GTGTTAATAG 1320 

AATTATGGCG GGTGTGGTGA AGTGGTTAAC ACACCAGATT GTGGCTCTGG CATGCGTGGG 1380 

TTCGATCCCC ATCACTCGCC TATTTTATAT TGGGGTATAG CCAAGCGGTA AGGCAAGGGA 1440 

CTTTGACTCC CTCATGCGTT GGTTCGAATC CAGCTACCCC AGTTACTATT TGCCGGCGTG 1500 

GCGGAATTGG CAGACGCGCT GGACTCAAAA TCCAGTGTCC GCAAGGACGT GCCGGTTCGA 1560 

CCCCGGCCGC CGGTATAGTA TAGTGTTAGG AACGTTGTTA TTCTTCGTTC CTTTTTTATA 1620 

TTATTTTTGG TATAATTATA GTTATTCAAA TTTTATTTAG ATTAAGAAAG TGTAGGGGAG 1680 

TATGTCTTGT TCTATCGATT TATTAAAACA TCGGTATTTG AAAAATATTA AAGAAAATCC 1740 

TGAATTGTTT GTCGGAATTG AGTTGGAGTA TCCTGTTGCA AGTTTAGAAG GGGATGCTAC 1800 
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AGATGTTGAA GTTATGAAGG ATCTATTTCA TTATTTAGTT TCTACTTTGG ATCTCACCGT I860 

AGCAAAGGTA GATGATTTTG GCAATCTGAT CCAGTTAGTA GATCCGATAA GTCAGGATGC 1920 

TATTTTATTT GAAGTTTCCT ATACAACGAT TGAGTTTGCA TTTGGTAAGG CTGAAACGAT 1980 

TCAAGAGGTC GAAAATCGTT TCAATAATTA TATGAATGTA ATTCAGAGAA AGTTAGCTGA 2040 

ATCAAATCAT GCTATTGTTG GCTGTGGTAT CCATCCCAAC TGGGATAAAA ATGAGAATTG 2100 

TCCAGTGGCT TATCCACGCT ATCAGATGTT GATGGATTAT TTGAATTTGA GTAGAAATAT 2160 

TATTAAATCA GATTTACATC ATTTCCCTGA ATATGGTACT TTTATCTGTG GGAGCCAGGT 2220 

TCAGCTGGAT ATTTCAAAAA CCAACTACTT ACGGGTGATT AATGCTTTTA CTCAAATTGA 2280 

AGCGGCTAAG GCTTATTTAT TTGCAAACTC TGAATTTTCG GGTGCGGATT GGGATACGAA 2340 

AATTTCAAGG GATATTTTCT GGGAAGAATC TATGCATGGT ATCTATCCAG AGAATGTTGG 2400 

GGTCAATGCT AGACTCCTTA ATGATGAAAC TGATTTTTTT GACTATCTAA ATCATTCTGC 2460 

GATTTTTACT GCGGAACGTG ATGGGCAGAC CTATTATTTT TATCCTATTC AGGCTGGGGA 2520 

CTATTTGGCT ACGTCCGAAA TCCAAGCATT TGCTCTGAAT GGGGATGAGG TTATTATTTA 2580 

CCCCCAAGAG AAGGATTTTG AAACTCATCG TAGTTACCAG TACCAAGATT TAACGACTCG 2640 

AGGAACAGTT GAGTTTCGTA GTGTGTGTAC ACAGCCACTT GATAGGACTT TTGCTTCTGC 2700 

AGCTTTTCAC TTGGGATTAT TGGTTAATTT AGACAAGTTA GAAGCTTACT TAGAAACAGC 2760 

ACCTTTCTTT AAAGTATTTG GTTATGATTA CAAGTCTTTA AGGAGACAAT TTTCTAAGAA 2820 

AAATCTTACA GATGAGGAAG AAACTACGAT TATTGAATTT TCCAAAGACT TACTCCTACT 2880 

AGCTGAGGAG GGACTAGTGG TGAGAAATAA GGAAGAAATG ACCTATTTAC AGCCTTTGAG 2940 

AGAAGAATTG AGCCTATAAT TTCTCTTATA AAGGGAGAAT TTTCTGAAAA ATCATGATAT 3000 

AATGGACGAG ACTATAGATA AAGGATAGAG AGTAATGACA TTAGTTTATC AATCAACGCG 3060 

TGATGCCAAC AATACAGTAA CTGCCAGCCA AGCAATTTTG CAAGGTTTGG CGACGGACGG 3120 

CGGTTTGTTT ACACCGGATA CTTATCCAAA GGTAGATTTG AACTTTGACA AATTGAAAGA 3180 

TGCTTCTTAC CAGGAAGTTG CTAAGCTAGT TTTGTCAGCA TTTTTAGATG ACTTTACAGT 3240 

TGAGGAGTTG GACTACTGTA TCAACAATGC CTACGATAGC AAATTTGATA CTCCAGCTAT 3300 

TGCACCATTA GTGAAATTAG ATGGGCAATA CAATTTGGAA CTTTTCCATG GTTCAACGAT 3360 

TGCCTTTAAG GATATGGCCT TGTCTATTTT GCCATACTTT ATGACGACTG CTGCTAAGAA 3420 

ACATGGTTTG GAGAACAAGA TTGTTATCTT GACAGCGACA TCTGGTGACA CGGGGAAAGC 3480 

TGCTATGGCG GGGTTTGCGA ATGTGCCTGG TACTGAGATT ATCGTCTTTT ATCCAAAGGA 3540 
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TGGTGTCAGC AAGATTCAAG AGTTACAAAT GACCACTCAG ACTGGCGACA ATACTCATGT 3600 

TATTGCTATT GATGGTAACT TTGACGATGC GCAAACAAAT GTGAAGCACA TGTTTAACGA 3660 

CGTGGCTCTT CGTGAAAAAT TGACTACCAA CAAGTTGCAA TTTTCATCAG CTAACTCTAT 3720 

GAACATTGGT CGTCTGGTGC CACAAATTGT TTATTATGTT TATGCTTACG CTCAATTGGT 3780 

TAAGACTGGT GAAATTGTAG CTGGTGAAAA GGTTAACTTC ACAGTACCAA CAGGAAACTT 3840 

TGGAAATATC TTGGCTGCCT TTTATGCCAA ACAAATTGGT TTGCCAGTTG GTAAATTAAT 3900 

CTGTGCTTCA AATGACAACA ATGTTTTGAC AGACTTCTTT AAAACACGTG TCTATGACAA 3960 

AAAACGTGAG TTTAAGGTAA CAACCAGCCC ATCTATGGAT ATCTTGGTAT CTTCAAACTT 4020 

GGAGCGCTTG ATTTTCCATC TTTTGGGAAA TAATGCTGAA AAGACAACTG AACTTATGAA 4080 

TGCCTTGAAC ACGCAAGGAC AATATAAGTT GACAGACTTT GATGCAGAGA TTTTGGACCT 4140 

CTTTGCAGCT GAATATGCGA CTGAGGAAGA AACGGCAGCA GAGATCAAGC GTGTTTGTGA 4200 

GTTAGATTCT TATATCGAGG ACCCTCATAC AGCTGTTGCT TCAGCAGTTT ATAAAAAATA 4260 

CCAATCGGCC ACTGGAGATG TAACTAAGAC AGTGATTGCT TCAACAGCTA GTCCATACAA 4320 

GTTCCCAGTA GTTGCAGTAG AAGCTGTAAC TGGAAAAGCA GGTTTAACAG ACTTTGAAGC 4380 

CTTGGCTCAA TTACATGAAA TCTCAGGCGT TGCAGTGCCA CCAGCAGTTG ATGGGCTTGA 4440 

AATAGCTCCA ATTCGTCACA AGACAACAGT GGCAGCTGCT GACATGCAAG CAGCGGTTGA 4500 

GGCTTATTTA GGACTTTAAG ACAGAGGGAG CAAACTCGGT TGGGAAACCA ACTGAGTTTC 4560 

TTTTCATCAG GAGGAGAGAT TGTTTAAGAA AAATAAAGAC ATTCTTAATA TTGCATTGCC 4620 

AGCTATGGGT GAAAACTTTT TGCAGATGCT AATGGGAATG GTGGACAGTT ATTTGGTTGC 4680 

TCATTTAGGA TTGATAGCTA TTTCAGGGGT TTCAGTAGCT GGTAATATTA TCACCATTTA 4740 

TCAGGCGATT TTCATCGCTC TGGGAGCTGC TATTTCCAGT GTTATTTCAA AAAGCATAGG 4800 

GCAGAAAGAC CAGTCGAAGT TGGCCTATCA TGTGACTGAG GCGTTGAAGA TTACCTTACT 4860 

ATTAAGTTTC CTTTTAGGAT TTTTGTCCAT CTTCGCTGGG AAAGAGATGA TAGGACTT r iT 4920 

GGGGACGGAG AGGGATGTAG CTGAGAGTGG TGGACTGTAT CTATCTTTGG TAGGCGGATC 4980 

GATTGTTCTC TTAGGTTTAA TGACTAGTCT AGGAGCCTTG ATTCGTGCAA CGCATAATCC 5040 

ACGTCTGCCT CTCTATGTTA GTTTTTTATC CAATGCCTTG AATATTCTTT TTTCAAGTCT 5100 

AGCTATTTTT GTTCTGGATA TGGGGATAGC TGGTGTTGCT TGGGGGACAA TTGTGTCTCG 5160 

TTTGGTTGGT CTTGTGATTT TGTGGTCACA ATTAAAACTG CCTTATGGGA AGCCAACTTT 5220 

TGGTTTAGAT AAGGAACTGT TGACCTTGGC TTTACCAGCA GCTGGAGAGC GACTTATGAT 5280 

GAGGGCTGGA GATGTAGTGA TCATTGCCTT GGTCGTTTCT TTTGGGACGG AGGCAGTTGC 5340 
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TGGGAATGCA ATCGGAGAAG TCTTGACCCA GTTTAACTAT ATGCCTGCCT TTGGCGTCGC 5400 

TACGGCAACG GTCATGCTGT TGGCCCGAGC AGTTGGAGAG GATGATTGGA AAAGAGTTGC 5460 

TAGTTTGAGT AAACAAACCT TTTGGCTTTC TCTGTTCCTC ATGTTGCCCC TGTCCTTTAG 5520 

TATATATGTC TTGGGTGTAC CATTAACTCA TCTCTATACG ACTGATTCTC TAGCGGTGGA '5580 

GGCTAGTGTT CTAGTGACAC TGTTTTCACT ACTTGGGACC CCTATGACGA CAGGAACAGT 5640 

CATCTATACG GCAGTCTGGC AGGGATTAGG AAATGCACGC CTCCCTTTTT ATGCGACAAG 5700 

TATAGGAATG TGGTGTATCC GCATTGGGAC AGGATATCTG ATGGGGATTG TGCTTGGTTG 5760 

GGGCTTGCCT GGTATTTGGG CAGGGTCTCT CTTGGATAAT GGTTTTCGCT GGTTATTTCT 5820 

ACGCTATCGT TACCAGCGCT ATATGAGCTT GAAAGGATAG GAAATGCAAA AAACAGCTTT 5880 

TATTTGGGAT TTAGACGGGA CTTTATTGGA CTCTTACGAA GCGATTTTAT CAGGGATTGA 5940 

GGAGACTTTT GCTCAGTTTT CTATTCCTTA TGATAAGGAG AAGGTGAGAG AGTTTATCTT 6000 

CAAGTATTCG GTGCAAGATT TGCTTGTGCG GGTGGCAGAA GATAGAAATC TGGATGTTGA 6060 

GGTGCTAAAT CAGGTGCGTG CCCAGAGTCT GGCTGAGAAG AATGCTCAGG TAGTTTTGAT 6120 

GCCAGGTGCG CGTGAGGTGC TAGCTTGGGC AGACGAATCA GGAATTCAGC AGTTTATATA 6180 

TACTCATAAG GGGAACAACG CTTTTACCAT TCTCAAGGAC TTGGGGGTGG AATCCTATTT 6240 

TACAGAGATT TTAACCAGTC AGAGTGGCTT TGTGCGGAAG CCAAGTCCAG AAGCGGCTAC 6300 

CTATCTGCTA GATAAGTATC AGTTGAATTC TGATAATACT TATTATATAG GGGATCGGAC 6360 

TCTGGATGTG GAATTTGCCC AGAATAGTGG GATTCAAAGC ATCAACTTTT TAGAGTCTAC 6420 

TTATGAAGGG AATCACAGGA TTCAAGCGTT AGCAGATATT TCCCGTATTT TTGAGACTAA 6480 

GTGATAAAAA GATTGTGTCA GTTTTGTGAC AGAGACCTAA CAAACTATTT CAAGTAACCT 6540 

AGTTTGTTAC AAGGAATAGA CAGTTCTGTT AAATAGGCCC GAGAGGGCTT TTTTTCTACA 6600 

TTTTTTGTGT TATGATAGAC AGGTACTCAT TTGAAAGGAA TTTGAAAGAA TGAAGAAAAG 6660 

AATGTTATTA GCGTCAACAG TAGCCTTGTC ATTTGCCCCA GTATTGGCAA CTCAAGCAGA 6720 

AGAAGTTCTT TGGACTGCAC GTAGTGTTGA GCAAATCCAA AACGATTTGA CTAAAACGGA 6780 

CAACAAAACA AGTTATACCG TACAGTATGG TGATACTTTG AGCACCATTG CAGAAGCCTT 6840 

GGGTGTAGAT GTCACAGTGC TTGCGAATCT GAACAAAATC ACTAATATGG ACTTGATTTT 6900 

CCCAGAAACT GTTTTGACAA CGACTGTCAA TGAAGCAGAA GAAGTAACAG AAGTTGAAAT 6960 

CCAAACACCT CAAGCAGACT CTAGTGAAGA AGTGACAACT GCGACAGCAG ATTTGACCAC 7020 

TAATCAAGTG ACCGTTGATG ATCAAACTGT TCAGGTTGCA GACCTTTCTC AACCAATTGC 7080 
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AGAAGTTACA AAGACAGTGA TTGCTTCTGA AGAAGTGGCA CCATCTACGG GCACTTCTGT 7140 

CCCAGAGGAG CAAACGACCG AAACAACTCG CCCAGTTGAA GAAGCAACTC CTCAGGAAAC 7200 

GACTCCAGCT GAGAAGCAGG AAACACAAGC AAGCCCTCAA GCTGCATCAG CAGTGGAAGT 7260 

AACTACAACA AGTTCAGAAG CAAAAGAAGT AGCATCATCA AATGGAGCTA CAGCAGCAGT 7320 

TTCTACTTAT CAACCAGAAG AGACGAAAAT AATTTCAACA ACTTACGAGG CTCCAGCTGC 7380 

GCCCGATTAT GCTGGACTTG CAGTAGCAAA ATCTGAAAAT GCAGGTCTTC AACCACAAAC 7440 

AGCTGCCTTT AAAGAAGAAA TTGCTAACTT GTTTGGCATT ACATCCTTTA GTGGTTATCG 7500 

TCCAGGAGAC AGTGGAGATC ACGGAAAAGG TTTGGCTATC GACTTTATGG TACCAGAACG 7560 

TTCAGAATTA GGGGATAAGA TTGCGGAATA TGCTATTCAA AATATGGCCA GCCGTGGCAT 7620 

TAGTTACATC ATCTGGAAAC AACGTTTCTA TGCTCCATTC GATAGCAAAT ATGGGCCAGC 7680 

TAACACTTGG AACCCAATGC CAGACCGTGG TAGTGTGACA GAAAATCACT ATGATCACGT 7740 

TCACGTTTCA ATGAATGGAT AAACCCGACT TGATAACATC ATTTTGACGA ATGAGATCTA 7800 

GCTTTCGTGA TGGAAAGCGA TTCTCGTTCG TTTTTTCTTT GTCATACTCT TCGAAAATCT 7860 

CTTCAAACCA CGTCAGTTTT ATCTGAAACT TCAAAGCTGT GCTTTGAGCA ACCTGCGACT 7920 

AGCTTCCTAG TTTGCTTTTT GATTTTCATT GAGTATCAAT TTGAATGGAA AATGGAAAGT 7980 

TATCATCTTG TAATGAGTTA AGCAACATTC TTGCAATCTA TTTTACTTTA TATCACAATT 8040 

AATTAGTCAA ATATTGATAA ATCAATAAAA AGAGAGGGGA AGAAATGCTA GAGATTCAAG 8100 

ATTTACTGTA TCAACTCCGC TTGTCTGAGC AAGCGAGTAC GCAATTGTTT GAAAAAAGGC 8160 

TTGGGATTAG TTTGACACGG TATCAGATTT TACTGTTTTT GCTGGAGCAT TCTCCTTGTA 8220 

ACCAAATGGC GGTTCAGGAG CGTTTGAAAA TTGATCAGGC TGCTTTGACA CGGCATTTCA 8280 

AAATTTTGGA AACGGAAGGT . TTGGTGGAGC GTCATCGTAA TCCTGAAAAT CAGCGGGAAG 8340 

TGTTGGTAGA GGCTGCGAAG TATGCCAAGG AGCAGTTAGT GGTGAATCCC CCTCTGCAAC 8400 

ATATCAGGGT TAAGGAAGAG ATAGAAAGTA TCTTAACAGA GTTTGAGAGA ACAGAACTCA 8460 

GCCGTTTATT AAATAAATTG GTTTTGGGTA TTGAAAATAT AGAAATTTAA GGAGAAATAG 8520 

ATGTCAATTA TTTTAACAAC GATCGTTGCT TTGGAGCATT TTTACATTTT TTATTTGGAA 8580 

AGTATTGCCA CGCAATCAGA TGCGACTAGT CGTGTATTTA ATATGGAAAA GGAAGAATTG 8640 

GCTCATCCGT CAGTAAGTTC ATTGTTCAAA AATCAAGGAA TTTATAAGGC TCTGCTAGGA 8700 

GTCTTTCTCT TGTATGTCAT TTATTTCTCA CAGAATTTAG AAATTGTGAC TATTTTTGTC 8760 

TTATTTGTGA TTGGTGCTGC GACTTACGGC TCTTTAACAG CGGATAAAAA AATTATTTTG 8820 

AAACAAGGTG GATCAGCTAT TTTGGCCTTG ATTAGTATTT TACTCTTTAA ATACACTTGA 8880 
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AGGTCGATTC TAATCTCGCT AATCCTTTTT AATCCAGAAT AAGGGAAATA TGTTATACTT 8940 

GTTTTTAAGA AAAAAGTCTC ATTGAATTGG TTTTGAGGAG TTAGAAATGA AAGTATTAGT 9000 

GACAGGTTTT GAGCCCTTTG GAGGGGAAAA GGGCAATCCA GCTTTGGAGG CCATTAAAGG 9060 

TTTACCAGCT GAAATCCATG GTGCTGAGGT CCGTTGGCTA GAGGTGCCGA CAGTTTTTCA 9120 

CAAATCTGCT CAAGTATTGG AAGAAGAGAT GAATCGTTAT CAACCTGACT TTGTCCTTTG 9180 

TATTGGGCAA GCTGGTGGAA GAACTAGTTT GACACCTGAA CGAGTGACCA TTAATCAAGA 9240 

CGATGCATGC ATTTCTGATA ACGAAGATAA TCAACCGATT GACCGTCCCA TTCGCCCAGA 9300 

TGGTGCTTCG GCCTACTTTA GTAGTTTGCC GATTAAAGCG ATGGTTCAAG CTATAAAAAA 9360 

AGAGGGCTTA CCGGCCTCTG TTTCCAATAC GGCAGGGACT TTTGTCTGCA GCCATTTGAT 9420 

GTATCAGGCT CTCTATTTGG TAGAAAAGAA ATCTCCATAT GTTAAGGCAG GTTTTATGCA 9480 

TATTCCTTAT ATGATGGAAC AGGTGGTGAA CAGACCGACT ACTCCAGCTA TGAGTTTAGT 9540 

GGATATTCGG CGAGGGATAG AAGCAGCAAT CGGCGCTATA ATAGAACATG GAGATCAGGA 9600 

ACTCAAGTTG GTAGGCGGAG AAACTCATTG ATAGAAAAAA GCTTG AGGGG AAAAACCTTC 9660 

AAGCTTTTGG ACGTTTTCGG GCCAATACTG CTCGGTAAAA CATAATTTTA GTGCATTGGA 9720 

TATAAGGTAG GAGTGAAAAA CTAGCAATGC CAAAGGTAAT CCAATTGAGG AAGTACCAAG 9780 

GAAGAAGCTG TAAATCTAGG ACAAAGTGCT GGAACTTGTA GCCCTTCATA AAGGAACGGC 9840 

TAGTTTTTAG GATTCGTCTT GGTGGGACCT GTCCTAGGTC TAGACTATAA CAGAGAAGAA 9900 

ATTCCACCTG TGAATAGGCA TAATACTGTG GAATATAGAG GATATTTCCT ACAATGATCA 9960 

AGATGAGACT TGCAAGAAAG TAGAGTCCAA AGACCATGAG GAAACGCTCG GTTTCAACTG 10020 

ATGAGAGATC TAGATTTGGA AACTCAGGAT GTAGGGTGAC GAATTTTTTG GCTAAAAAGC 10080 

TACTATAAAA GAGGAGGTAA ATCCCAAGTA AATTAGGGAT ACTCCATAAA AAGAGATAGA 10140 

AACGTTTGAG AAGTAGGGTC AAAAAGGTTT GAGAAAAGCG CTCCTCATCA AAGAGAGCTA 10200 

GGCTGTTTTT TACAGATGGC TCCGTTTTAG AATCTTTCAT GAGTGTCAGT GTTGCATAGA 10260 

CGGAACTGGT CAAAAGAATA GTCCCGATAA AGGAGACTAG TAGAGGAAAG AGGTAGGTTT 10320 

GAAGTATTTG GCCAAGTATG CTGAAAAATG GCTGTTCTAA AACAGTCCCG TGGATCCGAG 10380 

ATAAGGGATT AAGAAAACCA GATAAGATGA CCAGCATACT GGGAAGGATA TAGAGGAGAA 10440 

AGAGACGGGG GGTGTCAGCC TGAAAATGTT TTGACTCCTG ACGAATTGTT TTTAAATCAA 10500 

TTTTTGGATA GTTCATTCTC TTATTATACC ATAGTTCTTA TACATAGTTC GTGACAGTTC 10560 

CTACTTTTTT TGATAAAATC ATACAGTGTG TCCTTGGGCA CACTGTATGA ACTGGGACTG 10620 
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TCTTTCCCAG CTTCGGAGGT AAAAAATGTC AGATTCACCA ATCAAATATC GTTTGATTAA 10680 

GAAAGAAAAA CACACAGGAG CTCGTCTGGG AGAAATCATC ACTCCCCACG GTACCTTTCC 10740 

GACACCTATG TTTATGCCAG TTGGGACACA AGCCACTGTC AAAACTCAGT CACCTGAAGA 10800 

ATTGAAGGAG ATGGGTTCGG GAATTATCCT ATCAAACACC TATCATCTCT GGCTTCGCCC 10860 

TGGAGATGAA CTCATTGCAC GCGCTGGTGG TCTCCACAAG TTCATGAATT GGGACCAGCC 10920 

TATCTTGACA GATAGTGGTG GTTTTCAGGT TTATTCTTTA GCAGATAGCC GTAATATCAC 10980 

AGAAGAAGGA GTAACCTTTA AAAATCATCT AAATGGTTCT AAGATGTTCC TATCCCCAGA 11040 

AAAAGCCATC TCTATTCAGA ATAATCTGGG TTCAGACATC ATGATGTCCT TTGATGAATG 11100 

TCCTCAGTTT TATCAACCTT ATGACTACGT TAAGAAATCG ATCGAGCGTA CCAGCCGTTG 11160 

GGCTGAGCGT GGTTTGAAGG CTCACCGTCG TCCACATGAC CAAGGTTTGT TTGGAATTGT 11220 

GCAAGGTGCA GGATTTGAAG ACCTTCGCCG CCAATCAGCT CATGATCTTG TCAGCATGGA 11280 

TTTCTCAGGC TACTCTATCG GTGGTTTGGC AGTGGGAGAA ACCCATGAAG AGATGAATGC 11340 

GGTCTTGGAC TTTACAACTC AACTGCTGCC TGAAAATAAA CCTCGTTATC TGATGGGTGT 11400 

GGGAGCGCCA GATAGCTTGA TCGATGGGGT CATTCGTGGG GTGGATATGT TTGACTGTGT 114 60 

CTTACCGACT CGAATTGCTC GTAACGGGAC TTGTATGACC AGTCAAGGAC GTTTGGTTGT 11520 

GAAAAATGCC CAGTTTGCTG AGGACTTTAC GCCACTGGAT CCTGAGTGTG ATTGCTACAC 11580 

ATGTAATAAC TATACACGCG CTTACCTTCG TCACCTGCTC AAGGCTGATG AAACCTTTGG 11640 

TATCCGCTTG ACTAGCT AC C ACAATCTTTA CTTCTTGCTT AACCTGATGA AGCAAGTGCG 11700 

ACAAGCCATC ATGGATGACA ATCTCTTGGA ATTCCGTGAG TATTTTGTGG AAAAATATGG 11760 

CTATAATAAG TCAGGACGTA ATTTCTAAAA TGGAATTGAT ATAAAAAAAT CCTAAGTTTT 11820 

CTCTTAGGAT TTTTCTTCTT TTTTTGATAG AATAAAGTGT ACAATGAAAG GAAGAATAAA 11880 

CTCGTATGCG CATTAAATGG TTTTCCTCGA TTAGG 11915 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

GAGAGGGCAA CAGTTCTATC GCTTCAAATT TTTTCTTGGT TTGCAGATAT TCAAGAATCG 60 

GGAGTTTTTC TATAGTATTC GGCAGATTTA TTACAGCCAA GCATCTCAAA AATACGGACA 120 
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GCATCCTCCA TCTTTTTCTG GCCTTCCTTG ACTCTACCTT GCTTGCTATC AAGGAGACCT 180 

TCTGCCCACA GATAAACAAT TCGGAAATAG GTCTCATTTT CCTTGTAGAA ATGCTCTTCG 240 

ATAACACGTT TAAAATAATA GGCATTGGTA AATTCTTCAC ACTCAATACT AGCTAAAAAG 300 

CCATTCAATA GTATAGTATG AAAAAGGTTT CGATTGCCAG ACATTTCCAT TAGAAAATCA 360 

GATTTACGTA CCATTTCTCG TACATATCTA GTAAAAAGAG AAACAGATAA AAATGGAGAA 420 

CTGACTGAAA ATAAATTGAG TTCATAGATT CCCCAGATCT CGGTAGAAAA CAAATAATCA 480 

TGAAGGACTT TTCCTTCCTC TGCTGTTAAG TCTACCCTTT CATCTATGCT CTTCATATAA 540 

GACTTGATAA TAATGGCATT TAGAATATGT TTCTGTTTGT TGTGAGAATG GGCATGCTTT 600 

TATACTCCCT GCGATATAAG TCCTCAAGAG GTGCTATATT CTTTGGTTCC AAGACATCTG 660 

TAATTTCTTT TCTCAACTCA GAATCTGTAT CATACTGGAA ACCTCTTGCC AGAAAGAGGA 720 

TCTCCTCCAC ACTGGCAGAT ATATTTTCCA GAGCAAATAG AAACTTTTCC ACCGAAAGCT 780 

CACTCTGACC TGTTTCAAAA CGGGACAACA TAGACGGCGA AAATTGTCCT CCGGTTGCTT 840 

GTCTCAGTGA GATATTTCTT GACTCTCGTA ATTGTCTAAA GACTTTTCCA ATCTGCTCCA 900 

TAGACTTCCC CTTGATTCCG TATTTTCTTC ATTTTATCAT ATTTTTCAGA AAATTCATCA 960 

AAAACTTGCC AAATTGTCAG AATTATGAGA AAATAGAGGA TATTTATCAC GTGGAGGGAC 1020 

TGCTATGAGA GACGATATCA AAATCAATGA CCGTGCTTTG GCCTTGCAAG ACCAAATTAT ,1080 

CGAAAAACTA GAGAAAGTTT TTGATACAGA TGTGGAATTG GATGTTTACA ATCTAGGTCT 1140 

GATTTATGAA ATCAATCTGG ATGAAACGGG GCTCTGCAAG ATTGTCATGA CCTTCACCGA 1200 

TACTGCCTGT GATTGCGCCG AAAGCCTGCC TATTGAAATC GTGGCAGGTC TGAAACAAAT 1260 

CGAGGGTATC AAAGATATCA AGGTTGAAGT TACCTGGTCG CCTGCTTGGA AAATCACACG 1320 

AATCAGTCGC TATGGCCGTA TTGCCCTTGG ACTACCACCT CGTTAAGCAG ACCAATCACT 13 80 

TTTAAAGATG AAAATCAAAG GGCAAACTAG AAAACTAGCC GCAGGTTGCT CAAAACACTG 1440 

TTTTGAAGTT ATGGATAGAA CTGACGAAGT CAGCTCAAAA CACTGTTTTG AGGTTGTGGA 1500 

TAGAACTGAC GAAGTCAgCT CAAAACACTG TTTTGAGGTT GTGGATAGAA CTGACGAAGT 1560 

CAGCCCAAAA CACTGTTTTG AGGTTGTGGA TAGAACTGAC GAAGTCAGTA ACCATACCTA 1620 

CGGCAAGGCG ACGTTGACGT GATTTGAAGA GATTTTCGAG TATGAGTTTA TTTTTTACCT 1680 

GACTTGTCCA TATTCCAGAA GTCTGTCACG GCTCCGCGTG AAGCAGATGA TACGATGTGG 1740 

GCATATTTAC CGAGGACACC ACGGCTGTAA AGTGGTGGCA AGGTTGTTTC TGCCTTGCGT 1800 

TTTTCAAGTT CTTCTTCGGA TACGGCCATA GAAATTTCTT TGGTATCTTG GTCAACCGTA 1860 
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ACGATATCGC CGGTACGGAG ATAGGCAATT GGTCCACCAT CCTGAGCTTC AGGAGCGATA 1920 

TGTCCAACAA CCAGACCATA AGTACCACCA GAGAAACGTC CGTCCGTCAA GAGGGCCACC 1980 

TTATCTCCCT GACCTTTACC AACAATCATT GAAGAAAGTG ATAGCATCTC AGGCATACCA 2040 

GGACCACCTT TAGGTCCAAC AAAACGAACA ACGACTACAT CGCCATCAAC GATTTCATCT 2100 

GTCAGAACGG CCTGAATCGC ATCTTCTTCT GAGTCAAAGA CCTTAGCTGG CCCAACGTGA 2160 

CGACGCAGTT TAACACCTGA TACCTTGGCA ACTGCACCGT CAGGAGCAAG GTTCCCGTTC 2220 

AAGATGATAA GCGGACCATC CGCACGTTTT GGATTTTCAA GTGGCATGAT AACTTTTTGG 2280 

CCTGGAGTCA AGTCTGCAAA GTCAGCCAAG TTTTCAGCTA CAGTCTTACC AGTACATGTG 2340 

ATGCGATCTC CGTGAAGGAA ACCATTTGCC AACAAATACT TCATAACCGC AGGGACACCA 2400 

CCGACTTCGT AGAGGTCTTG GAAGACATAC TGACCAGATG GTTTCAAGTC GGCCAAGTGA 2460 

GGCACACGTT CTTGAATCGT ATTGAAGTCC TCAAGTGACA AGTCAACATT TGCGGCATGG 2520 

GCAATGGCGA GCAAGTGAAG AGTGGCGTTT GTAGAACCAC CGAGAGCCAT CGTTACAGTG 2580 

ATAGCATCTT CAAAGGCTTC ACGAGTCAAG ATATCTGATG GTTTGAGACC AAGTTCCAAC 2640 

ATCTTAACAA CAGCACGTCC TGCTGCTTCG ATATCTTCTT TCTTATCAGC TGATTCAGCT 2700 

GGGTGAGAGG ATGACCCTGG CAAACTCATC CCTAGAACTT CGATAGCAGT TGCCATGGTA 2760 

TTAGCAGTAT ACATACCACC ACAACCACCA GGGCCAGGGC AGGCATTACA TTCAAGACGT 2820 

TTCACGTCCT CAGCTGTCAT GTCACCGTGG TTCCATTTTC CGATACCTTC AAAGACAGAA ■ 2880 

ACCAAGTCGA TATCTTTACC ATCAAGATTT CCCGGTGCAA TAGTTCCACC ATAGGCGAAA 2940 

ATAGCTGGGA TATCCATATT AGCAATAGCA ATCATAGATC CAGGCATGTT CTTGTCACAG 3000 

CCACCGATAG CGACGAAGGC ATCCACGTTG TGACCACTCA TAGCCGCCTC GATGGAGTCC 3060 

GCGATGATGT CACGAGATGT TAGAGAGAAA CGCATACCAG GCGTTCCCAT AGCGATCCCG 3120 

TCCGCTACGG TAATGGTTCC AAACTGTACA GGCCAAGCGC CTGCAGATTT GACACCTTCT 3180 

TTAGCCAGTT TCCCGAAATC ATGCAAGTGA ATGTTACATG GTGTATTTTC CGCCCAAGTC 3240 

GAAATCACTC CCACAATCGA TGTTTCAAAG TCCTTATCTG TCATACCAGT CGCAOGAAGC 3300 

ATAGCACGGT TAGGTGATTT AACCATGCTG TCATAAATGC TACTGCGGTG ACGTTTATCT 3360 

AATTCAGTCA TCTTATCCCT CCCATTTCAG TTTTTACTAT TATAGCACAA TTTTCGCATG 3420 

AAGAACAGAA TAAAATTCTT GAATTTTCAG AAAATTCTAT ACACATGTGA AATATTTAAA 3480 

ATTAAAAACA ACAAAGCGGA TTAGTGCACT TTCTGATGAC CAGAATATGC TTTTTAATCC 3540 

GCTTTCTTTA AATAACGTAC TGTAATTTTT ACAGAAATTC TTTCAAATAA GTGTATTTAA 3600 

CATCTATCTT GCATTATAAA TTTCTAGAAC CTTCTCTTTT ATATTCGATT CACTCAAACC 3660 
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ATACTCATTA AGAAGATAAT CCATTTTCCC TACTTGACCG AATCTTTCTT GAACACCCAT 3720 

CCGATGAATT TTTGTTATTC CATCATCAGA GAATAATTCA CATAAAGCAC TGCCAATTCC 3780 

ACCTATCTGA TTGTGGTTTT CTACAGTAAA TATAGTTTTT CCACTTAACA TTGTTTTTAT 3840 

CTGTTCTGGT ATCGGTTTGA TTCTAAATAA ATCTATCACA CCTACTGAAT AACCTAATTT 3900 

AGACAGTTCA TCTGCAACTC GAATACTTGG AGCAACCATT ATGCCAGAAG CAACGATTAC 3960 

AAGATCTTCA CCATGCCTTA ACTCAATGTA GCCTTTAGAA AAATCTTCTC CACCTTGATA 4020 

CACAGGAACT GGAGCTTTTC TAATTGTTCG AATATATTTT AGTCCTTTTA AGTCTAATGT 4080 

CTGGTTCAAT ATTTCACGAA ATTGGATATC ATCAGTTGCT TCGAAAATGA TTGATTTAGG 4140 

AATTAAACGT AACAATCCAA TTTCTTCAAA TGGCATATGT GTTCCACCAT TCATCTCTGC 4200 

CGTTACTCCT GCATCTGATC CAATCACAGT GGCATCCAAT TGTGCGTATC CAAGAGAAAT 4260 

AAATAATTGA TCAAATACTC TTCGTGAAGC AAAAGGACCA AATGTATGAA GATAAGGTCT 4320 

AAACCCCTGA ATAGACAAGC CTGCTGCAAG GCCGACCATT TCTGCTTCCA TAATCCCAAC 4380 

ATTCACATAA CGGTCTCCAA AGTCCTTTTC AAGATTATTA GTAGCCATCG AACTTGACAA 4440 

ATCGGCTTCT AAGACTACTA TATCAGAATC ACTTTGATTA GCCTCTAAAA GGAAGTCTCT 4500 

ATATACATGC CGTAATTCTT TCGTACTTCT CATCATTCTG TTTCCTCCAA TTCCTGACTT 4560 

AATCTTTCTA CAACTGAAGT TAACATTTGT TTCTCCTCTA CAGTAGGGCG AAGATGATGA 4620 

TTGGATTTCA TTTCTTCCAG CTCTTGAACC CCTTGACCTT TAATAGTATC TAATACAATG -4680 

CACTTAGGTG ATGAATTATT TGACTGTTTT AATTGGACAA TCCCTTCATA AATTTCTCTA 4740 

ATATCTGAAC CCTTGACCCT AATGGATTCA AATCCAAATG CTGAAAATTT TTCTACGAAA 4800 

TCACCTGGAT TACAAATATC CTTTGTAAAA CCATCTAATT GTTTTTTGTT ATCATCAACA 4860 

AATACAATTA AGTTGGATAA CTGTTGATGA GAAGCAAACT GTATAGCCTC CCAACATTGT 4920 

CCCTCATTTA ACTCACCATC TCCAACAATA GCGTAAGTAT AAAAGGGACT CTTTCTTATT 4980 

CTCTGACCAT ATGCAAGTCC AGTTGCAACA CTAATTCCTT GTCCTAAAGA GCCCGTTGTC 5040 

ATATCTATGC CTGGCGTTAG ATTTCTATCA GGATGAGACG GTAATTTGGT TCCATTTGTA 5100 

TTTAAAGAAT ATAAGAATTC TTTGTCAAAG AAACCATTCA AATAGAGTGT ACTGTATAGA 5160 

GCTGGTCCTC CGTGACCTTT TGATAATATG AAATAATCTC TATCTCGTGC TGCAAATATT 5220 

TCTGGAGTCA TTGGCATTAT TTCACCATAA AGCACCGCTA AAACTTCTAC GATAGACAGA 5280 

CTTCCTCCGT AATGTCCGAA TCCAAGATGA TTCAATGTTC TAAGAGTATT TAATCGGATG 5340 

TTAGTCGCAA ATTTTCTTAA CCCATCTTCT CTATTTTTAC TTAAAATCAT CCCTTATTCC 5400 
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TCCGTTGCAG ATGGCTTTTT AATAAAGGAT ACTCCAAACA TAACTGCTAG AATAAGAACA 5460 

AGACCAATCA CAATGCCTGC TTGTGAGCCA AATTGATTTA ACATTCCTAA AATAATTCCT 5520 

GATAGACCAA AATCTGCATC TGAGAAAGTT GATCCTTGGA AACCAAGTCC TCCCAAAACT 5580 

GGCATTAAAA AGACTGGAAG AAAACTGATT AAAATACCTT GTAAAAATGC TCCAATAGTG 5640 

GCTCCACGAA CACCACCAGA TGCATTCCCA ATGACACCTG CAGTCGCTCC ACAGAAGAAA 5700 ' 

TGAGGCACAA CACCTGGTAA GATAACAACC GTTCCTGAAG CAATCATAAT TACCATACTT 5760 

ACTAAACCAC CAACAAAACT AGAGATAAAT CCAATTAGAA CTGCATTGGG TGCATAAGTA 5820 

TAAACAATCG GACAATCCAA AGCAGGTTTT GAATTAGGTA CAAGACGCTC TGAAATACCT 5880 

TTAAAGGCTG GAACAATTTC GCCCAAAATA AGGCGAACAC CTGCTAAAAT AACAAATACC 5940 

CCTGCTGCAA ATTGACCTGC TAATTGTAAA GCATAAACTA GACCACTTGT ACCACTACTG 6000 

ATTTCTTTTT CTATATATTC TGACCCTGCA AAGATAGCTA CAATAATGTA AATAACTGCC 6060 

ATGGATAAAG TAATACTAAC AGTACTATCA CGTAAAAAAG CTAAACTCTT TGGAAATTTA 6120 

ATGTCCTCTG TTGATTTTGA TTTGTCACCG ATAAGGCTAC CAGTAAAACC ACTCAACCAA 6180 

TATCCCAAAG AACTGAAATG ACCTAAAGCT ACCTTGTCAT TTCCAGTTAA TTGAACCATA 6240 

TATTTTTGCA CAAATGCTGG GGAAATACTC ATAATAATAC CGAGTGCTAA TCCTCCTAGT 6300 

AAGATGAGAG GCAAGCTAGT AAAGCCAGCA ACTGATAAAA TGACCGCAAT CATACATGCC 6360 

ATATATAGAG TGTGGTGCCC TGTTAAAAAA ATATATTTAA ATCGAGTAAA ACGAGCGATT 6420 

AAGATATTGA ACACCATGCC TGCAAACATA ATCATTGCAG TAGCTGAGCC ATATGTTGTT 6480 

AAAGCTACAG CTACAATTGC TTCATTATTC GGCACAACGC CAGATAAATG AAAAGCATGC 6540 

TCAAACATGG TACCAAATGG ATTTAAAGAA TTTTGTACAA TTCCTGCACC ACCAGATACA 6600 

ACTAAGAAAC CAACAAAGGT CTTAATTCCA CCTTTAATAA TATCAGGTAA TTTCTTCTTC 6660 

TGAAGAACTA ATCCTAAGAT TGCAATTAAA GCTACTAAAA TAGCTGGTGT ACTAACAATA 6720 

TCCAATATGA ACTTCATCAT GACGCTAGCC TCCTATATAA GTCCTTTTTC TTCACAAAGT 6780 

TTAGTAATTA ATTCTCGTAG TTCATCCATA TCAATAATAC TATTTAAGAT ACGAACATCT 6840 

CCAAGATGAC TAGCTGAATC AGCTAGATCA CGACCAACAA TCCAAATATC AGCTGCATTT 6900 

GGATCTGCTC CACCTAAATC ATAATGTTCA ACTTCTACAT CCGAAACATT CAAATCACTC 6960 

AATACAGATT CAATATTCAT CTGTACCATA AAACTTGAAC CTAATCCTGA ACCACAAGCT 7020 

GTACCAATTT TTAACATTAT CTAATCCTCC TGTTTAATTA TCATTTTAAT GTCATCATAG 7080 

TTTTTTGATG ATATTAAAGT TTGAACATGA TTTTTATCTC TTAAAATTGT TGTTAAATGT 7140 

GACAAAGCCT TTAAATGACT CTCATTATCA ATGGCTGCAA TACAAATCAA CAATCTTACC 7200 
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TCTTGTTCTG GATTATCCAA TAAATAAATC GGTTCTTCCA AAACTAACAT TGACATTCCT 7260 

ATTTCATTCA CACCTTCATC TGGCCGAGCG TGAGGAATTG CTACTCCCTT CCCTAAATTA 7320 

ATAAAAGGTC CAAACTCTTC TACTTTTTGA ATCATTGCCT CAGGGTAGTT CTCAGTTATC 7380 

TTATCTTGAT CCAAAAGCGG TTTAGCTGCT AAACGAATCG CCTCCTTCCA TCCTAATTTT 7440 

TGCGAACTAA CCTGATAGGT TTCTTTGGTA ATAAGTTGTT CTAGCACTGG TACAATTTCC 7500 

TTTCTATCAT TTTTTTGGTA AAGATAATTC TTTAACGCCA ATCTTAATTC CAATTCTTGT 7560 

GTAATAATTC CATATCTTTT GACAATATTC AGGATTTGTT CAATCTCAAA ATCTCCATAC 7620 

TCTAAATTCG GAAAATCTTT TAACACTAGT TCTACTAGTT GTATTGCTTG CTCTTCAGTC 7680 

ATCATAACCG AAACTAGATA ATTTGGCTTT TCTGTCTCCA CCTTTATGGT AGAAAAAACC 7740 

ATATCATAGT CACTACTAGC TTTCACCTGT AAATCATCAA TCTTTGAGGT TCCTATAAAC 7800 

TCAATTTGAG GAAATAATGC TAATAGATTC TCTTTTAACA TCAATGAAGA ACTAACACCA 7860 

TTAGGACAAA TGATTGCTGC TTTATACCAT TTTTGAGGCA AAGTATCTGC TTTCTTTAAA 7920 

TAACCTCCGA AATGGATAAC AAAATATGCT GTTTCACTAT CAGGTATGGG ATTGTCAATA 7980 

GCGTCCATCA AGGGCATCAA AGAATCTTTG ACTAATTCAA ATAAATCAGG ATAATGTTCT 8040 

TTAACATGCA ATACATATTC ATTTGAACTA GGTAGGCCGA ACTTTAATCT ATAGTAAGCC 8100 

GGTATAAGGT GGCGGCGAAG ATTTTCTCTC AATCCTTCCC TTTGTTTAAA ATGTAACAAA 8160 

GAAATATCTT CCATTCTACT TATAATAGCC TCTGTTAATT GATTAAAGTA AACCGGAGCA 8220 

ACATCTACTT CACCTTCAAA GCAACTTGAT AATAAAACGG TGATATAGCG ATAATCATCC 8280 

TCAGAAAACA CCGTATCTAT AATTCCCAAA TCAACCACTG TATCCAATAA AATAGTGGTT 8340 

ATATCTTGAA TAACAGGAGA TACTAATGTC TCTGAAAGAC ATACTCTTTC AACATCCCTT 8400 

TGATACCTAC ACAGAATGAA TACTAAACCG AAAAGGTAAA CTTTTAATTG ATTAACAATA 8460 

GGTACTAGCT GTAGCTTCTC ATAATAATCT TTAACTACCT GATCAATCAA ATCATAAGTT 8520 

AATGAATACC CCCAACTGGA TAAAACATAA TCCAAACCCC AAATCCCTAT GGAGGATTCC 8580 

AGCAACTCAC TAACCATTTG AAAAGCTAAG CGGTGCTTAT TCCACTCTGA ACCGTGTAAA 8640 

GTATAACCTT TTGCTCTACT GTACCCTAGC TCCAAATCAT TATCTAACAT AATCTTTCTT 8700 

AATGATTGAA TATCAGATAA GGTTGTATTC TTACTTACTT TCAAAAAGTC TTGGTAATGA 8760 

CTATTCGATA TAAAATCTAA TCGGCAAAAA GTGTAAAGAT AGATTAAAGC TAAGCGAGTC 8820 

GACTTTGGTA AAACCAATTC ATCCGACTTA ATAATATCTG TCAAAGACTG CTTCGTACGA 8880 

TTTGATAAAC TATAGCGACC TTGCTTTTTA TCCAGCACTA TCCCTTTATT AGCTAGATAA 8940 
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GGCACTAAAT AATCTATTCC TTCTTTGACT TCCTTTATAG GTAAGCTCAC CTTAACAGAT 9000 
AATTCATATA ACGATAGCTC ACAATGATCC ATCAAAGTCA TCAAAAT AAC TAGTGCTCTA 9060 
TAATCAAAC 9069 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 8654 -base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 



CGAGACAACA 


AGATGAAGAA 


AAATTTGCCC 


TATCGTTTGT 


GGCGCTTGCA AGTGTAGCAC 


60 


TTCTTGCAGC 


CTGTGGAGAA 


GTGAAGTCTG 


GAGCAGTCAA 


CACTGCTGGT 


AACTCAGTAG 


120 


AGGAAAAGAC 


AATTAAAATC 


GGGTTTAACT 


TTGAAGAATC 


AGGTTCTTTA 


GCTGCATACG 


180 


* »n A/""'fTV" , h 
vaAAt. Av»t* 1XSJ\ 


ACAAAAAGGT 


GCCCAATTGG 


CTGTTGATGA 


AATCAATGCC 


GCAGTGGTAT 


240 


CGATGGAAAA 


CAAATCGAAG 


TAGTCGATAA 


AGATAATAAG 


TCTGAAACAG 


CTGAGGCTGC 


300 


TTCAGTTACA 


ACTAACCTTG 


TAACCCAATC 


TAAAGTATCA 


GCAGTCGTAG 


GACCTGCGAC 


360 


ATCTGGTGCG 


ACTGCAGCTG 


CGGTAGCGAA 


CGCTACAAAA 


GCAGGTGTTC 


CATTGATCTC 


420 


ACCAAGTGCG 


ACTCAAGATG 


GATTGACTAA 


AGGTCAAGAT 


TACCTCTTTA 


TTGGAACTTT 


480 


CCAAGATAGC 


TTCCAAGGAA 


AAATTATCTC 


AAACTATGTT 


TCTGAAAAAT 


TAAATGCTAA 


540 


GAAAGTTGTT 


CTTTACACTG 


ACAATGCCAG 


TGACTATGCT 


AAAGGGATTG 


CAAAATCTTT 


600 


CCGCGAGTCA 


TACAAGGGTG 


AAATCGTTGC 


AGATGAAACT 


TTCGTAGCAG 


GTGACACAGA 


660 


CTTCCAAGCA 


GCCCTTACAA 


AAATGAAAGG 


GAAAGACTTT 


GATGCTATCG 


TTGTTCCTGG 


720 


TTACTATAAT 


GAGGCTGGTA 


AAATTGTAAA 


CCAAGCGCGT 


GGCATGGGAA 


TTGACAAACC 


780 


AATCGTTGGT 


GGTGATGGAT 


TCAACGGTGA 


GGAGTTTGTA 


CAACAAGCAA 


CTGCTGAAAA 


840 


AGCATCAAAC 


ATCTACTTTA 


TCTCAGGCTT 


CTCAACTACT 


GTAGAAGTTT 


CAGCTAAAGC 


900 


TAAAGCCTTC 


CTTGACGCTT 


ACCGTGCTAA 


GTACAATGAA 


GAGCCTTCAA 


CATTTGCAGC 


960 


CTTGGCTTAT 


GATTCAGTTC 


ACCTTGTAGC 


AAACGCAGCA 


AAAGGTGCTA AAAATTCAGG 


1020 


TGAAATCAAG 


AATAACCTTG 


CTAAAACAAA 


AGATTTTGAA 


GGTGTAACTG 


GTCAAACAAG 


1080 


CTTCGATGCA 


GACCACAACA 


CAGTCAAAAC 


TGCTTACATG 


ATGACCATGA 


ACAATGGTAA 


1140 


AGTTGAAGCA 


GCAGAAGTTG 


TAAAACCATA 


ATAGAAAAAT 


GTTGAAATAG 


GGAATGAGCC 


1200 


TTTGACTCAC 


TCCCTGTTTC 


GATATTTAAT 


ACTCTTCGAA 


AATCTCTTCA 


AACTGCGTCA 


1260 
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ACGTCGCCTT 


GGATTATATA 


TGTGACTGAC 


TTCGTCAGTC 


TTATCTACAA 


CCTCAAAGCA 


1320 


GTGCTTTGAG 


CAACCTGCGG 


CTAGTTTCCT 


AGTTTGCTCT 


TTGATTTTCA 


TTGAGTATAA 


1380 


GAACCTATCA 


AAAAGTGAGG 


GAAAACCCTC 


GGAATTATAA 


ATAGAAAGAG 


TGAATCTTAT 


1440 


GCTCCAACAA 


CTCGTAAATG 


GTTTGATTCT 


AGGTAGTGTT 


TACGCGCTGT 


TAGCCCTAGG 


1500 


ATATACCATG 


GTTTACGGAA 


TTATCAAGCT 


CATCAACTTC 


GCCCATGGTG 


ATATTTATAT 


1560 


GATGGGAGCC 


TTTATCGGTT 


ATTTCTTGAT 


CAATTCTTTC 


CAAATGAATT 


TCTTTGTAGC 


1620 


GCTTATTGTA 


GCTATGCTAG 


CGACAGCTAT 


TCTTGGTGTC 


GTGATTGAGT 


TTCTTGCTTA 


1680 


CCGACCTTTG 


CGCCACTCTA 


CTCGTATTGC 


TGTTTTGATT 


ACGGCTATTG 


GGGTTTCTTT 


1740 


CCTATTGGAG 


TATGGAATGG 


TCTATCTGGT 


TGGTGCCAAT 


ACCCGTGCCT 


TCCCTCAAGC 


1800 


GATTCAAACA 


GTTCGATATG 


ATTTGGGACC 


AATTAGCTTA 


ACAAATGTGC 


AGTTAATGAT 


1860 


TTTGGCCATT 


TCCTTGATTT 


TGATGATTTT 


GTTACAAGTC 


ATTGTCCAAA 


AGACTAAGAT 


1920 


GGGGAAAGCC 


ATGCGTGCAG 


TATCAGTAGA 


TAGCGACGCG 


GCGCAATTGA 


TGGGGATCAA 


1980 


TGTAAACCGT 


ACGATTAGCT 


TTACCTTCGC 


TTTGGGTTCT 


GCTCTTGCGG 


GTGCGGCTGG 


2040 


TGTTCTGATT 


GCTCTTTATT 


ATAACTCTCT 


TGAGCCTTTG 


ATGGGGGTTA 


CTCCAGGTCT 


2100 


TAAATCTTTC 


GTTGCCGCAG 


TACTTGGTGG 


TATCGGAATT 


ATTCCTGGTG 


CGGCTCTTGG 


2160 


TGGCTTTGTG 


ATTGGTCTAT 


TGGAAACCTT 


TGCGACTGCC 


TTTGGGATGT 


CAGATTTCCG 


2220 


TGATGCCATT 


GTTTATGGAA 


TCTTGTTGTT 


GATCTTGATT 


GTCCGCCCAG 


CTGGTATCCT 


2280 


TGGTAAGAAT 


GTGAAAGAGA 


AGGTGTAAAC 


GATGAAGGAA 


AATTTAAAAG 


TTAATATTCT 


2340 


ATGGTTACTC 


CTTTTGTTAG 


CTGGCTATAG 


CTTGATTAGT 


GTACTGGTTT 


CAGTCGGAGT 


2400 


ACTTAATCTA 


TTCTATGTAC 


AGATTTTACA 


ACAAATTGGA 


ATTAATATTA 


TTTTGGCTGT 


2460 


TGGTCTCAAC 


TTAATCGTTG 


GTTTTTCAGG 


ACAATTTTCA 


CTTGGTCATG 


CTGGTTTCAT 


2520 


GGCGATTGGT 


GCCTATGCAG 


CAGCTATTAT 


TGGTTCTAAA 


TCACCAACCT 


ACGGTGCCTT 


2580 


CTTTGGAGCT 


ATGCTTGTAG 


GGGCTTTGCT 


TTCAGGAGCA 


GTTGCCTTAC 


TTGTCGGCAT 


2640 


TCCAACCTTG 


CGCTTGAAGG 


GGGACTATCT 


TGCGGTAGCA 


ACTCTGGGTG 


TTTCTGAAAT 


2700 


TATCCGTATC 


TTTATCATCA 


ATGGTGGAAG 


CCTTACAAAT 


GGTGCGGCAG 


GTATCTTAGG 


2760 


GATTCCTAAC 


TTTACAACTT 


GGCAAATGGT 


TTACTTCTTT 


GTCGTGATTA 


CAACCATTGC 


2820 


AACCTTGAAC 


TTCTTGCGTA 


GCCCAATTGG 


TCGTTCAACC 


CTCTCTGTTC 


GTGAAGATGA 


2880 


AATCGCTGCT 


GAGTCAGTTG 


GGGTTAATAC 


GACTAAAATT 


AAAATCATCG 


CTTTTGTCTT 


2940 


TGGTGCCATT 


ACTGCAAGTA 


TTGCTGGGTC 


ACTTCAGGCA 


GGATTTATCG 


GGTCTGTTGT 


3000 
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ACCGAAAGAT TACACCTTCA TCAACTCAAT CAACGTTTTG ATTATTGTTG TATTTGGTGG 3060 

ACTCGGTTCC ATTACAGGTG CGATTGTTTC GGCTATTGTT CTGGGAATTT TGAATATGCT 3120 

TCTCCAAGAT GTTGCTAGTG TGCGTATGAT TATTTACGCT TTGGCCTTGG TATTGGTAAT 3180 

GATTTTCAGA CCAGGTGGAC TCCTTGGAAC ATGGGAACTG AGCCTATCAC GTTTCTTTAA 3240 

AAAATCTAAG AAGGAGGAAC AAAACTAATG GCATTACTTG AAGTAAAACA' GTTAACCAAA 3300 

CATTTTGGTG GTCTAACAGC TGTTGGAGAT GTGACTCTTG AATTGAACGA AGGGGAACTG 3360 

GTTGGATTAA TCGGTCCAAA CGGAGCTGGG AAAACCACCC TTTTCAACCT TTTGACCGGT 3420 

GTTTATGAAC CAAGCGAGGG AACAGTAACC CTAGATGGTC ACCTTTTGAA TGGGAAATCA 3480 

CCTTATAAGA TTGCCTCTTT GGGACTTGGA CGTACTTTCC AAAATATCCG TCTCTTTAAA 3540 

GATTTAACAG TTTTAGATAA TGTTTTGATT GCTTTTGGAA ACCATCACAA ACAGCATGTT 3600 

TTTACTAGTT TCTTACGCTT ACCAGCTTTT TACAAGAGTG AAAAAGAATT AAAGGCTAAA 3660 

GCTTTGGAAT TGTTGAAAAT CTTTGATTTA GATGGTGATG CAGAGACTCT TGCTAAAAAT 3720 

CTTTCCTACG GACAACAACG TCGTTTGGAA ATTGTTCGTG CCCTTGCTAC GGAACCTAAA 3780 

ATTCTCTTCT TAGATGAACC AGCAGCAGGT ATGAACCCAC AGGAAACAGC CGAATTGACT 3840 

GAGTTAATTC GTCGTATCAA AGATGAGTTT AAGATTACAA TCATGTTGAT TGAACACGAT 3900 

ATGAATCTGG TCATGGAAGT AACAGAACGT ATCTACGTAC TTGAATATGG CCGTTTAATC 3960 

GCTCAAGGAA CTCCAGACGA AATTAAGACC AATAAACGCG TTATCGAAGC TTATCTAGGA 4020 

GGTGAAGCCT AATGTCTATG TTAAAAGTTG AAAATCTTTC TGTGCATTAC GGTATGATCC 4080 

AAGCAGTTCG TGATGTAAGC TTTGAAGTTA ATGAAGGAGA AGTTGTTTCC CTTATCGGTG 4140 

CCAACGGTGC AGGTAAGACA ACTATTCTTC GCACCTTGTC AGGTTTGGTT CGACCAAGTT 4200 

CAGGAAAGAT TGAATTTTTA GGTCAAGAAA TCCAAAAAAT GCCAGCTCAG AAAATCGTGG 4260 

CAAGTGGTCT TTCACAAGTT CCAGAAGGAC GCCACGTCTT TCCTGGCTTG ACTGTTATGG 4320 

AAAATCTTGA AATGGGAGCT TTCTTAAAGA AAAATCGTGA AGAAAATCAA GCTAACTJ'GA 4380 

AGAAGGTTTT CTCACGCTTT CCTCGTCTTG AAGAACGGAA GAACCAAGAT GCAGCCACTC 4440 

TTTCAGGGGG GGAACAACAA ATGCTTGCCA TGGGACGCGC CCTCATGTCA ACACCAAAAC 4500 

TTCTTCTTTT AGATGAACCA TCAATGGGAC TTGCCCCAAT CTTTATCCAA GAAATTTTTG 4560 

ATATCATTCA AGATATTCAG AAGCAAGGAA CAACGGTCCT CTTGATTGAA CAAAATGCCA 4620 

ATAAAGCACT TGCAATCTCT GACCGAGGAT ATGTACTGGA AACAGGGAGA ATCGTCCTAT 4680 

CAGGAACAGG AAAAGAACTC GCTTCATCAG AAGAAGTCAG AAAAGCATAT CTAGGTGGCT 4740 

AAAACAATCC AGTGGATTGT TTTAGTCGGC AGATGGAGAT TACGAAGTAA TCATCAATAT 4800 
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AGTCCGGGGG ACCTTTTTAG TCGGTAGATT GAGATTGCAA ACAAATCTGC ATCTACATTG 4860 

AAAGCTTAAT TTCTAATAAT TGAAAAAATC GAATGAAAAA TTTCTTACCT TCATTCACAG 4920 

AGCTCGATTT CAGAGCTCTT TTTGCTAGCT TATTCATACT TTTCTGAATT TCGAAAAAGA 4980 

/ AATGTAAGCG TTTGATAGAT TTACAAAAAG ATTGTATAAT AGGGATAAGA ATAGAAAAGG 5040 

AGAAGTCTCA TGGCAGTTAA AGATTTTATG ACCCGCAAGG TAGTTTATAT TAGTCCAGAT 5100 

ATAACAGTAT CTCATGCAGC AGATTTGATG AGAGAGCAAG GTTTGCACCG TCTGCCTGTT 5160 

ATCGAAAATG ATCAATTAGT TGGTTTGGTG ACTGAGGGAA CCATTGCACA AGCAAGTCCA 5220 

TCTAAAGCAA CAAGTCTTTC TATCTATGAG ATGAATTATC TTCTGAATAA GACAAAAGTA 5280 

AAAGATGTCA TGATTCGCGA TGTTGTCACT GTCTCAGGCT ATGCTAGTCT AGAAGATGCA 5340 

ACTTATCTGA TGTTGAAAAA TAAGATTAGT ATTCTCCCTG TCGTAGATAA CCATCAAGTA 5400 

TACGGAGTTA TTACTGACCG TGACGTTTTC CAAGCCTTTC TTGAAATTGC AGGTTATGGC 5460 

GAAGAAGGGA TTCGTGTACG CTTTGTTACA GAAGATGAAG TTGGTGTTCT TGGAAAAATT 5520 

GTTTCTTTGA TTGTAGAAGA AAATTTGAAT ATCTCCCATA CAGTCAATAT TCCGCGTAAG 5580 

GATGGTAAGG TGATTATCGA AGTGCAAATC GATGGATCAA TTGATTTACC AGCCTTGAAA 5640 

GAAAAATTTG AAGCAAATGG TATTCAAGTG GAAGAAATCG CTCGCACTTC AGCAAAAGTC 5700 

TTGTAAGAAG GGAAGCCCAA AGGCTTCTTT TTTCATGAAA AGGGGATTAG AGCAAAAGAT 5760 

GGAAAGAAAT GATAAAATAT GCTATAATGA AATAATGTAA AAAAGGAGTA TTTATGGACA 5820 

TTTCAGTAAT TCGTCAGAAA ATTGACGCAA ATCGTGAAAA ATTAGCTTCT TTCAGGGGGT 5880 

CTCTTTGACC TCGAAGGGCT AGAGGAAGAG ATTGCCATCT TGGAAAACAA GATGACAGAA 5940 

CCTGATTTTT GGAACGATAA TATTGCGGCC CAAAAAACGT CGCAAGAATT AAATGAATTA 6000 

AAAAACACTT ACAATACCTT CCATAAGATG GAAGAGTTGC AGGATGAAGT CGAAATTTTA 6060 

TTGGATTTTT TGGCTGAAGA CGAGTCAGTG CATGATGAAC TGGTAGCGCA GTTAGCCGAA 6120 

CTTGATAAGA TAATGACCAG CTACGAGATG ACTCTACTCT TGTCAGAACC TTATGACCAC 6180 

AACAATGCCA TCTTGGAAAT CCATCCAGGT TCTGGTGGTA CTGAGGCGCA GGACTGGGGT 6240 

GATATGTTGC TTCGTATGTA TACTCGTTAT GGTAATGCTA AAGGCTTTAA AGTGGAAGTG . 6300 

TTGGATTACC AAGCAGGTGA TGAGGCTGGT ATTAAGTCGG TAACTTTATC ATTTGAAGGG 6360 

CCTAATGCCT ATGGTCTCCT CAAGTCAGAA ATGGGTGTTC ACCGCTTAGT GCGAATCTCA 6420 

CCATTTGACT CTGCCAAACG TCGCCATACC TCTTTCACAT CTGTAGAAGT GATGCCAGAA 6480 

TTGGATGATA CTATTGAAGT GGAAATCCGT GAAGATGATA TCAAGATGGA TACCTTCCGT 6540 
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TCAGGTGGTG 


CCGGTGGACA 


AAACGTCAAT 


AAGGTTTCAA 


CAGGTGTACG 


TTTAACCCAC 


6600 


ATTCCAACTG 


GAATTGTTGT 


CCAATCAACA 


GTAGATCGTA 


CCCAGTATGG 


AAATAGAGAT 


6660 


CGTGCCATGA 


AGATGTTGCA 


GGCTAAGCTC 


TATCAAATGG 


AGCAAGATAA 


GAAGGCTGCG 


6720 


GAGGTAGATT 


CTCTCAAAGG 


TGAGAAAAAG 


GAGATCACTT 


GGGGAAGCCA 


AATCCGTTCT 


6780 


TATGTCTTCA 


CGCCTTATAC 


TATGGTAAAA 


GATCACCGAA 


CTAGCTTTGA 


GGTTGCTCAG 


6B40 


GTAGATAAGG 


TTATGGATGG 


GGACCTAGAT 


GGTTTTATCG 


ATGCTTATCT 


CAAGTGGCGA 


6900 


ATTAGCTAAG 


ATAGAAAGGA 


ACTCACATGT 


CAATTATTGA 


AATGAGAGAT 


GTCGTTAAAA 


6960 


AATACGACAA CGGAACAACT 


GCTCTACGCG 


GTGTTTCGGT 


TAGCGTTCAA 


CCGGGGGAAT 


7020 


TTGCTTACAT 


CGTAGGACCT 


TCAGGAGCAG 


GGAAGTCAAC 


TTTTATTCGT 


TCTCTGTATC 


7080 


GTGAAGTAAA 


AATCGATAAA 


GGAAGCCTAT 


CAGTTGCTGG 


TTTTAATCTG GTTAAGATCA 


7140 


AAAAGAAAGA 


TGTCCCGCTT 


CTACGTCGTA 


GTGTTGGGGT 


TGTCTTCCAG 


GATTATAAAT 


7200 


TGTTACCAAA 


GAAAACTGTC 


TATGAAAATA 


TTGCTTACGC 


TATGGAAGTA 


ATCGGGGAAA 


7260 


ATCGCCGTAA 


TATCAAAAGA 


CGAGTGATGG 


AAGTTTTGGA 


CTTGGTTGGA 


TTGAAGCATA 


7320 


AGGTTCGTTC 


TTTCCCAAAT 


GAACTCTCAG 


GTGGGGAGCA 


ACAGCGGATT 


GCGATTGCGC 


7380 


GTGCAATTGT 


AAATAATCCC 


AAAGTATTGA 


TAGCTGATGA 


GCCAACAGGA 


AATCTGGATC 


7440 


CGGATAATTC 


ATGGGAAATT ATGAATCTCT TGGAACGGAT TAACyTACAA GGAACAACTA 


7500 


TTTTGATGGC 


GACTCATAAT 


AGCCAGATTG 


TAAATACCTT 


GCGCCACCGT 


GTCATTGCCA 


7560 


TTGAAAATGG 


CCGTGTCGTT 


CGTGACGAAT 


CAAAAGGAGA 


GTATGGATAC 


GATGATTAGT 


7620 


AGATTTTTTC 


GCCATTTATT 


TGAAGCCTTA 


AAAAGTTTGA 


AACGAAATGG 


TTGGATGACA 


7680 


GTAGCTG CTG 


TCAGTTCAGT 


CATGATTACT 


TTGACCTTGG 


TGGCAATATT 


TGCATCTGTT 


7740 


ATTTTCAATA 


CAGCGAAACT 


AGCTACAGAT 


ATTGAAAATA 


ATGTCCGTGT 


AGTAGTTTAT 


7800 


ATCCGAAAGG 


ATGTGGAAGA 


TAATAGTCAG 


ACAATTGAAA 


AAGAAGGTCA 


AACTGTTACA 


IDCft 

/fcJoU 


AATAATGACT 


ACCACAAGGT 


ATATGATTCT 


TTGAAGAACA 


TGTCTACGGT 


TAAAAGTGTT 


7920 


ACCTTTTCAA 


GTAAAGAAGA 


ACAATATGAA 


AAATTAACCG 


AGATAATGGG 


AGATAACTGG 


7980 


AAAATCTTTG 


AAGGAGATGC 


CAATCCTCTC 


TATGATGCCT 


ATATTGTAGA 


GGCAAACACT 


8040 


CCAAATGATG 


TAAAAACTAT 


AGCCGAAGAT 


GCTAAAAAAA 


TTGAAGGTGT 


CTCTGAGGTT 


8100 


CAAGATGGCG 


GTGCCAATAC 


AGAAAGACTC 


TTCAAGTTAG 


CTTCATTTAT 


CCGTGTTTGG 


8160 


GGACTAGGGA 


TTGCTGCTTT 


GTTAATTTTT 


ATCGCAGTTT 


TCTTGATTTC 


AAATACCATT 


8220 


CGTATTACCA 


TTATTTCCCG 


CAGTCGCGAA 


ATTCAAATCA 


TGCGCTTGGT 


CGGAGCTAAA 


8280 


AACAGTTATA 


TCCGTGGACC 


GTTCTTGTTA 


GAAGGAGCCT 


TTATCGGTTT 


ATTGGGAGCT 


8340 
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ATCGCACCAT CTGTTTTGGT CTTTATTGTT TATCAAATTG TTTACCAATC TGTCAACAAA 8400 

TCGTTGGTAG GGCAAAATCT ATCCATGATT AGTCCAGATT TATTTAGTCC GTTGATGATT 8460 

GCCCTACTAT TTGTGATTGG GGTTTTCATT GGTTCATTGG GATCAGGAAT ATCCATGCGC 8520 

CGATTCTTGA AGATTTAGGT AAAATAGCTG CTTTTATGAG GAGATTGTAA AATCTCCTTT 8580 

TTTGCTACAA GAGTTTTTGA AAAGAGATGC GCAGAAGAAA AGAGCTTCCA AAGAAGTCCC 8640 

CCAGAGAAGA CTTC ~ 8654 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 



TGTCGCGTCA, 


AAATCATTAC 


TATGGCTATG 


TATAGCCCTT 


ACTATGACTT 


GGCTAAACAC 


60 


GTTCGCTTTC 


AAATTTCTAG 


GCTCAGGCTG 


AAACAGTCTC 


CCAGGCTGTT 


CACTCCCGAA 


120 


TGCTAAAATC 


GTTCTTGATC 


GCTTTCACAT 


TGTACAACAT 


CTTAGCCGTG 


CTATGAGTCG 


180 


TGTGCATGTC 


CAAATCATGA 


ATCAGTTTCA 


TCGAAAATCC 


CATGAATACA 


AGGCTATCAA 


240 


GCGCTACTGG 


AAACTCATTC 


AACAGGATAG 


CCGTAAACTG 


AGTGATAAGC 


GATTTTATCG 


300 


CCCTACTTTT 


CGCATGCACT 


TAACAAATAA 


AGAAATTCTT 


GACAAGATTT 


TAAGCTATTC 


360 


AGAAGACTTG 


AAACACCACT 


ATCAGATCTA 


TCAACTCTTA 


CTTTTTCACT 


TTCAGAACAA 


420 


AGACCCTGAG 


AAATTTTTCG 


GACTCATTGA 


GGACAATCTG 


AAGCAGGTTC 


ATCCTCTTTT 


480 


TCAGACTGTC 


TTTAAAACCT 


TTCTCAAAGA 


TAAAGAAAAG 


ATTATCAACG 


CCCTTCAACT 


540 


ACACTATTCT 


AATGCCAAAC 


TGGAAGCGAC 


CAATAATCTC 


ATCAAACTTA 


TCAAGCGCAA 


600 


TGCCTTTGGT 


TTTCGAAACT 


TTGAAAACTT 


CAAAAAACGG 


ATTTTTATCG 


CTTTGAACAT 


660 


CAAAAAAGAA 


AGGACGAAAT 


TTGTCCTTTC 


TCGAGCTTAG 


CTGACTTCAA 


CCCACTACAG 


720 


TTGACAAAGA 


GCCTAATTTC 


CATAAAAATT 


GACATGGAAA 


TTATAAAACC 


ATTACTAGTT 


780 


TAGTCCTTTT 


TGATAACGTG 


CCAATTCGGC 


TTGGTTCGCC 


CAAACATAGT 


GACCTGGACG 


840 


GATTTCTACC 


ATAGATGGCT 


TATCAGTCTC 


ATAGTCGTGT 


TGACTTGGAT 


CGTAAACCTT 


900 


CAAGACCTTC 


TTACGTTCCA 


AGATTGGATC 


TGGGATTGGT 


ACCGCTGAAA 


GCAAGGCTTG 


960 


AGTATATGGG 


TGAATTGGAT 


TGTTAAACAA 


TTCTTCTGTT 


TCTGCAACCT 


CTACAATAAC 


1020 
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ACCCTTGTAA ATAACTGCGA TACGATCTGA AATAAAGCGA ACAACCGACA AGTCATGGGC 1080 

GATGAAGAGA TAGGTCAGGC CGAGCTCTTT TTGGAATTTT TTGAGCAAGT TCAAGACTTG 1140 

GGCACGTACA GAAACGTCCA AGGCTGAAAT TGGCTCATCT GCAATAACAA AGTCTGGTTG 1200 

CATGACCAAG GCACGGGCAA TACCGATACG TTGACGTTGA CCGCCTGAGA ATTCATGAGG 1260 

GTAACGAGTC AAGTGCTCAG CAAGAAGACC TACTTCACGG ATAATATTTT GAACTTTCTC 1320 

TTTACGTTCT TCTTCATCCT TAAATAAACG GTGATTGTAA AGACCTTCAG AAATAATATA 1380 

ATCAACAGTC GCACGTTCAT TCAAACTTGC GGCAGGGTCT TGGAAAATCA TCTGGATTCG 1440 

ACGAATCAAT TCCGCAGCTT GTTCACGCGA TTTCTTACCA TTAATCTTTT GACCATCAAA 1500 

AATGATATCT CCATTACTTG TATCATTTAG ACCGATGATA GCACGACCAA TAGTTGTTTT 1560 

CCCACTACCG GACTCACCTA CAAGCGAGAA AGTTTCTCCC TTGTTGATAA AGAAGTTAGC 1620 

ATTTTTAACC GCGACAAACT TCTTACTTCC TTCACCGAAG GAAATTTCTA AATCTTTGAT 1680 

TTCTACTAAT TTTTCAGACA TTTCCTTCCT CCTAGTCAGC CAGATGGGCA AATCCCATTT 1740 

TTTCACGGAT CTTATCATGG AGATTTGCAA TCACAGCTGG TTTTTCTACT TTCGGAGCAT 1800 

CCTCATGAAG AAGCCAAGTT TTAGCCCAAT GTGTCTCTGA TACTGAGAAT TGAGGAGCTT 1860 

TTTGTTCGAA GTCAATCTGC ATTGCGTAGT CAGAACGCAA GGCAAAAGCA TCCCCTTTCA 1920 

GGTCAGTATA AAGTGACGGA GGTGTTCCTG GGATTGAGTA AAGATCCCCT TTATCATCAG 1980 

CAAGCTGAGG CAAGCTAGAC AAGAGACTCC ATGTATATGG ATGGCGAGGG TCATAGAAGA 2040 

CTTCCTCAAC CGTTCCATAC TCAACGATTT CTCCTGCATA CATAACCGCT ACCTTATCCG 2100 

CAATACTTGC CACCACACCA AGGTCGTGGG TAATAAAGAT TGTTGTGAAA TGATACTCGT 2160 

TTTGTAAAGA TTTTAGCAAA TCAATAATCT GAGCTTGAAT AGTTACATCC AAGGCAGTTG 2220 

TTGGCTCATC ACAGATCAAG ACATCAGGTC GGCAGGCAAG GGCAATAGCA ATAACGATAC 2280 

GTTGACGCAT TCCTCCAGAA TATTGGAATG GGTATTCATT AAAACGTCTA TCTGCGTCTG 2340 

GAATGCCAAC CTTATTCATG TAGTCAATGG CCAATTCTTT CGCTTCTTTA GCTGTTTTTC 2400 

CTTGGTGTTT TACAATAACT TCTGTAATCT GACTACCAAT TGTTTTAATG GGGTCCAAAC 2460 

TAGTCATTGG GTCCTGGAAG ATAGTCGCAA TCTTAGCACC ACGAATTTGT TCCCAATCCT 2520 

TGTGAGAAGA TAAAGCTGTC AAGTCCTGAC CACGGTAGTC AATACTACCT TGGGCAATAC 2580 

GACCATTTTC TTCGAGCATA CCTGTGAAGG TCTTTGTCAA AACAGATTTA CCTGATCCTG 2640 

ACTCACCTAC CAAGGCTAAT ACTTCTCCTT CGACTAGTTC AAGGGAAACG CCGCGAATGG 2700 

CTGTCAATAC TTTGTCACGA ACGTCAAATT CCACGACAAT ATCGCGAGCA GTCAAAATTA 2760 

CATTTTTTTC TTTTGTCATT TCTACTCCTA TCTATGTGTA CGTGGATCAC TAGCATCCGC 2820 
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TAAGTTTTGA CCAACTACGA AAAGGGACAA GGATACCAAG ACAAGGGTTG TCAATGGAAT 2880 

CCAGAACAAG TAAGCATTGG TTGTTACGTT TTGTGAATAA TCCGAAATCA AACGACCCAA 2940 

ACTTGGCACT GTAATCGGTA ATCCAAGACC GAAGAAAGAC AAGAAGGCTT CGTATGAGAT 3000 

AAAGCTTGGA AGCATTTGAG TCATGGTTGT CACAATAACA GATACCAATT GAGGCATGAT 3060 

ATTTTTGGCA ACAATCTTCA AGGTTGGTGT TCCCAAAGTA CGTGACGCCA AGTTGTATTC 3120 

CAAGTCACGA TAGCGCAAGA TTTGCACACG GATCATGAAG GCAATACCAA TCCATGTTGT 3180 

TACGCTCATG GCAAAAATCA GATTCCAGAA TCCAGCTCCG ATTGAGTAAG TCAAGACAAT 3240 

AACAATCAAA AGAGGTGGGA TGTTTGAGAT GACGTTGTAA ACTTCCATCA TGACACGGTC 3300 

AACTGATTTT GAAATACCCC AAATACCACC GACAAAAACA CCGATAACCA AGTTAATCAC 3360 

TGTCGCAATC ACAGAAATGA GGATGGAGTT ACGAGCTCCG AACCAGACAC CGTCAAAGAG 3420 

CGATTTACCG TTACTGTCAG TACCGAACCA ATGCTCCGCA TTTGGCTTGA TATAACGAAC 3480 

ACTAAAGTCG TTTACCTTGC TGACATCATT GAAATCAAAC TTAGAAAACA TTGGGTAGAT 3540 

GAAACTTATC AAAATGATGG CTACCAAGAT TCCCAACATG ACTACAGTTG ATTTTTTCTT 3600 

CATAAATTGT TTAAACACTG ATTTCCAGTA AGAATATGCT GGCGCATCAA TAGTTTCAGA 3660 

GGCAAAATCG TCACGTTTTA CAAACTGAAA TTTTTCTTTA TCGATTGTAG ACATTATTTG 3720 

CCTCCTTTCT CAGTCAATTT AATACGTGGG TCAATAATAG TCATCCAAAT ATCTCCCAAA 3780 

AGACGTGAGA AGATAGAAAT ACATGTAAAG ATGAAGACAA GACCAACGAC CATAGAGTTA 3840 

TTAGATGCTT TTACAGAGTC AATCAACATT TTACCCATAC CTGGGAAGGC GAAGACTGTT 3900 

TCAGTAAGGG TTGCACCACC GATAACCCCA ATAATGGCAG CAGGAATTCC TGAAACCAGC 3960 

GGAACCATGG CATTTTTAAA GATGTGTTTG TTTGAAATTT CTTTTTCAGA CAAACCTTTT 4020 

GCACGAGCGA AACGAACAAA GTCTTGAGAT TGCAAGTCAA TCATGTAACG ACGAATCCAA 4080 

ATGGCTGTAC CAGGAGCACC CAACAAACCA AGGATGACTG CTGGTAAAAC GTAAGAACGC 4140 

CAATCTCCAG CTCCCAAGAT AGGGAATGAA TCTGGAAGGG CAATAGATGA TCCAATCAAT 4200 

CGAACGATGT AAACCAAGGC AATCGTTGGA AGAGCAAGCA AGAAGGTCAA AGCCCCTGTT 4260 

GAGAGGCTAT CAATCCAAGT GTTCTTGAAA CGAGCCATGG CTGAACCAAG TGGCACGGCA 4320 

AGAGCATAGG CAAGAACCAA ACCAATCAAA CCAGTAATAG CAGAGCTGAC AATCATAGAT 4380 

GGATATTGGT AATTACTTTC AGTCGCTGTA TAAGGATCAT CTTTCCCATA GCTAGCTACT 4440 

TCACGAGAGT CAGCCTGACT AGGTGACTTG TAGGTTCTTG AGTAAATATT TACAGAAGAC 4500 

GTTTTCTTAC CTGTTGGGAA CTGAACTTGG GCAGTTTTGG TTTGTCCTTG ACCTTGAGTA 4560 
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ATAACCTGAA GAACTGGTGT ATTAGCATAG GTTGGGTAAG AGTCACCTAA ATTCAAGTTC 4620 

ACAAAGTTTT GATGAACAAA TGGGAACTGA CTGTTAAAGT ACAAGAGATA TTTATGTTTA 4680 

GTTCCTGAAC CGACCAATGA CCATCCGATA GCTGGATCAT TTTCAAAACG AAGGTAGCGT 4740 

TTCAAGTCTG GATTTTCAGG GTCTTGGATT TTATTTGTAT GGTCAATGTC AATCAAGTTA 4800 

GCATAGAAGT GAAAAACACG TTCAAAAATT GGAATTTCAC GAGTAGCATA GAATTGACCA 4860 

CTTTCAGTAA ATTCTCCCAA AGTCCAACCA TGACCTAATT GATTGATGTA CTTTTCATAA 4920 

ATAGCTTTAT TGGTCGCATT TGCTTCTACT GTTACAGAAG AATCCATGCT ACTTGCCTTT 4980 

TCTTGCAACT CTTTAGTATC GTAATACTCA ATGTAGCCCA TACGCTCAAA CACAGTATTT 5040 

TCATAGTTAT CACGTTTATC AGCCGTTGTC GCAATTTTAT TATAGTTAGG ATCCTGCTTG 5100 

AAAATCAATT TTCGAGGAAC CAAGGTATAG ATAATCGTGT AGGTCAAAGT CGTTACTAAG 5160 

AAAATCGAAA CCAATGACCG CAAAACACGC ATAAAAATAT ATTTTTTCAT ATTATTTCCT 5220 

TTAAAAATCC CAAAAGAACC TTCTCCTCAT GGAGAGAAAG TTCTATTAGA AATTATTTAC 5280 

TTCACATGAC TTGCCAATTC TTTTTGAGCT TTCTCATTTG ATTCAGCTTT TTCTTTCAAC 5340 

CATTTTTCAC GAGCTTTTTC ATACTCTTCC TTAGTCACCA CTTTATCTTG TGATTTCAAA 5400 

TATTTGAAGT AAACATCTGA CCCCTTAGAG CCTGTTTGCG CAGAAGCTCC AGTAAATGGA 5460 

ACAATTCGTG AAAGCACTGG TGCTGCACCA GAAGAAGCCA TAGCAGGAAT AAAGAGTGAA 5520 

CTATCTGTCA ACCATGCTTG AGCCGCTGCA TATTTTTCAT AACGGACATT CAAGTCGCTT 5580 

GTCTCTCTGG CAGCTTCATC AACTAATTTA TCGTATTCTT TCAAACCAAC TTGAACTACT 5640 

GAAGGGCTAT TTGGATTATC AAATCCTAAA TATGTTTTTG TAGTTTCACT GCTAGTTGTT 5700 

TTTAAAATAT CCAGGTAAGT AGATGGGTCT TGATAGTCTG GCCCCCATGA AACTCCTCCT 5760 

GATACATCCC AATCCTCAGA TGAAGCATTG GCAGCATAGT AAGTAATATT AAGGAATTCA 5820 

TCACTTGTCA TTTGTTGAAT ATCAACAACG ACATTTTCAA CACCAAGAAC TGTTTCTACA 5880 

GATTGTTTAA AGGACTGAAT ACGAGATATG TAGTTTTTTG ATGCTTGGTC TACTGGAACG 5940 

TCCAGATGAA TAGGAAACTG AACGCCGTCT GCTTCTAAAG CTTTCTTAGC TTTCGCAAAC 6000 

TCTGCCTTGG CCTTGTCAGC ATTGAATAAA CCATCCTGCC CATCAGCTAA ATTCACACCT 6060 

TTCCACTCAT CACCATAAGC AGGAAGTTGA GCAGCGACTA AATCACCAAA GGTCTTCTCA 6120 

CCAGCTGAAA CAAAGTCTGG TTTTACAAAT AAATTACGAA CTGCTAAAGC TGCTCCATCT 6180 

TTACCATTGA TTTGAGCTGA GTAAGCTGAG CGATCAAGAG CAAAATTCAA GGCTTGACGG 6240 

AAATCTTTGT TAAGCAATGC CTTCTTAGTA GCTACTTTCT CTGAATCTGT AGTTTTAGAA 6300 

GTATAGTTGT AACTTTGGCG ATCAATATTC ACACCCAGAC CAGCAATCCC AGAGCCTGAT 6360 
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TGTGTGTAAT AGATATTGTC CTTGTATTCT TCTGCAACCT TAGAATAGTT GGAGCTGGTA 6420 

GGGTAAAGAC GGGCATAACT ATAAGCTCCA CTAGTGAAGT TACGCTCTAG CGACTCCTGA 6480 

TCTGATCCAT CATAGTAAGC TAGATTGATA GTATCTAGGT GGACATTTTC TTTATCCCAA 6S40 

TATTGCTCAT TTTTTACAAA CTCTACAGAA GATTTTGCAG TCAACCCTTT CAACAAGAAT 6600 

GGACCATTAT AAAGCAAGGA TGTCGGATCT GTTGGTTTAG CAAAATCGCT TCCTTTTGAT 6660 

GTTTCGAATT CTTCATTCAG AGGCCAGAAA ATAGAATAGG TCAACTTAGA GTTCCAGAAC 6720 

GGTTCAGGCT GGTTCAAAGT GTATTGTAAC GTATAATCAT CAACCGCCTT GACACCAACT 6780 

GTTGAAAAAT CTGTTGAAGT TCCTGATAGA TAATCTGCCA AGCCTTTAAC CGAATTTTCA 6840 

GCTAAATACA TAGCTTCTGA TTTTTTATCT GCTGCGTGTT TTAAACCGTT CACGAAATCT 6900 

TTAGCCGTCA CCTCTGCATA TTCTTCTCCA TCAGAGGTAA ACCATTTAAC CCCTTTACGA 6960 

ATCTTATAAG TGTAGGTCAA ACCATCCTTA GAGACTTCCC AATCCTCTGC AACTGCAGGA 7020 

GCAAGATTAC CGTAATTATC GTTAGTGAAT AAACCATCAA TCCCATTTGA AGTCACTACT 7080 

GTTGTACTAT TTTTACTTGA AATCAGGTAG TCCAAGGTTT CTGGGTCTGC TGTATAAACA 7140 

TAGCCATAAG CTTTAGGGGC TGATGAATCA GATGATTTTG AAGAACTGCA TGCTGCAAGT 7200 

ACACCTGCTG CTAATAAAAC AAGACCTGCT GTAGCAAATA CACGATTTTT TTTCATTTTC 7260 

TACTCCTCTG TTTATGTGAA TTATAGATTG ACAACCATTA TATCACATTA TCCATTAAAA 7320 

ATCAAACAAA TTTTCAGAAT ATTTAGGCTT GTTGGCACAA ATTTTTCATT TTTTTTGAAT 7380 

ATATGATTCA AATTGTCGTT CGAAGTGTCA AAGACTACAG TGAAAATAGG AAATTTGACG 7440 

CAGAAACTTT GGAGTTTAGG AAGACATACA GTAAAATGAA ATACGGACGG AACAATGTGA 7500 

TTTTGGAATT CAAATTAAAT TATAACAATA TTGTAGAAGT ATCATTCTAG TATTCAAGAT 7560 

TCAGTTTACT ATGTCTTTTC ACACCAACCT TATCCCGAAT TCAATTACTT TTGTGATTTA 7620 

CATATATAGA TTAAGACTAT CTTTTATACT TTAAAATTTC TCGCTACCTT ATCCACTATA 7680 

TGCTCCTCGC TATCACGTTT CTATTCATAG CCTACGATTT CACTATTGCT TTCTCTGACA 7740 

ATTCTTATTT CCTGCGTCAG ACTTAAAACG ATCTATCCCC AGACCATTTT AATCCGCTAC 7800 

CTCACGATAG TCAGGCTTGG GGAGCGCTAT TGTATTCACC GGTAGTGGAG CCCTACAGAG 7860 

GACTTACACC TCAGATGCAC GACATGCCCA TCGTATAAAA AATCTCCTAC CCAAGGTAGA 7920 

AGATTTCAAA CTTATAAAAC TTAATCCGTC ATGTCCGATA CCAACATTCG ATGCTCCAAT 7980 

GGAATACTGC ACATAACTAG CAAGAAAATA AAGCCTGACT GAATCCAGAA GAGAGCCAAG 8040 

TCAAAAATTC CGTGCACAGC AACCACTGTA AGGAAAGATA GATAAAGGCC GATAATCGGA 8100 
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CGTTTCCCCG 
ACTAATAAAA 
GCATGTTCAT 
TCACCCCAAA 
TCTTCCATAG 
CTCAAACCAA 
GTAAAGAGAT 
AAGTTCAAAC 
AACTTGGTCG 
TAATAATTAG 
GGTGAAAGAA 
GCAGCTGACA 
GATAAAATCG 
ACAAGACCCA 
AGCAGCATCG 
AAACTGATAA 
AGGATAATTC 
CCTATTGATT 
TATCCTATAT 
AAAACCCTAT 
TTACCCGATA 
TATTTTGACG 
CTAGATCAAA 
CAGGTCGTTG 
GACCTTCTTT 
AGTAAATCTG 
CATCATCATC 
TATACTCGGC 
AAGCATAGTC 
CATGGGTATC 



ACTCCTGACT 
TAGTCCCCAC 
GATAAGGAGC 
AAGGATTTTG 
AAGAGTCTAA 
TCGCGAAGAC 
AGATAATTGC 
CAAAGAGATT 
TTGTAAACAG 
GATTAAAGAA 
AAGCATAGTT 
AGACACTACC 
ACTGATAGTG 
TCCAATTTTG 
GATGCTCCCC 
TATATAAACA 
CCAATAAAAT 
TCAAAATGAA 
TGTACCACTT 
TTTATCCACC 
AGTCTGATTT 
TTTGATTACA 
GGCTTCTGAA 
CACTTCTTTT 
AGTTTGTTCT 
AGGAATACCT 
TCCAAGCGCT 
ACTAGAGTAC 
AATCTCCTCA 
TAGCGTCGTA 



CATATCCATC 
AATTCCGTAA 
ATGTATCCGA 
CTTAAACAAG 
AGTACCCATT 
CCCAATACTA 
TCCAGCGATA 
AACAAAGCCT 
ATAGAAAGCA 
GGTCACTTCT 
AAATTTCTTC 
AAACAAGACA 
CAAAAAGAAA 
TGCAAGAATG 
CATTTTCTGA 
GAGTACAACT 
CAGCTCTTTA 
TCCTTTCTCT 
TTTTAGCAAT 
ATATCAAGGC 
TGCAAATCAA 
ATGCTGTGAG 
CGGTTACGGA 
GCTATTTCCT 
AATAATTTCA 
GGGGCAAAAG 
GAATAGTAGG 
TTACGTTTGA 
TCGGTCAGGA 
AATTGCTTCA 



756 
ATCAAGCGAA 

CTCAGAATCG 

GGATAAGAGT 

GCCATCCCAG 

CGAACTCCCA 

AGCCAAAAGG 

ATAGCAGGAA 

GCAATCACAC 

ATCATAATAC 

GCCCGGTTCT 

ACAATTTGGA 

AACTGCAAAA 

ATAGTAAATA 

GATATAACAG 

AGAATACTTT 

ACAAAAAGAT 

CTAGACAACC 

CCAAATCAGC 

TTGAAAACAA 

TTCAAAATGA 

TTTCTACTAC 

CTGTTGGTGT 

AAGAAAATAG 

CGTTGCTATA 

AGTCATTCTT 

CTTGAATTAG 

TTGAATTGAT 

CATTGGCTCC 

TATCCTTGAC 

TCGGGCTCAT 



CAGGAGCAAC 
TATCAATATA 
TCATATAGGT 
CATCCCAGAT 
AATCACTAGA 
CCTTCCAGTT 
AGGCAGTTCG 
AGAATACTTT 
AGAAACAACA 
GATGCCACAC 
AATGTTCTAA 
TCAATCGAAA 
GAAACATTCC 
TACTATAGCT 
TCATGTCTCC 
AAAAGGGTAA 
CCTTCAGCTT 
TGATTCAGAT 
AGGAAACGTT 
TACTTCAACT 
TGCTGTTACG 
CTCTATCTCA 
ATTGAGAAGG 
GTAATGACGA 
GCCTGCTAAT 
ACGAGCGAGA 
TTGGTAGATA 
AACCTTATAG 
ATCTACTACT 
CTTTAACCAC 



AGAAGACAAA 
AAGACTGTGG 
CAATGGCCCT 
AGAAATGCGT 
AAAGAGGAAA 
TTTAATAGTC 
ATTTTGAGTA 
CAACCAATTC 
AATAATTCCA 
CTGCATATTG 
ACTGGCAAAA 
GAATTTATGG 
TACTGAAGCC 
AAGAAAAAGA 
TGTAAAAATC 
AAAGATACTC 
TTCAATAAAG 
AATAGTAAGC 
TTCCAAAATA 
CCATTCTCAA 
GACTTATCTT 
GTAGTCCCTT 
GCCTTCACAA 
TTAATATTTC 
AGACCCACAT 
AAATACTTGA 
TCTAAGTTGT 
AGTTCATTTG 
CCAATCCCAT 
TTAGCCAAAC 



8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
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GCTCTGTTCT GGAACTGTAA AGAGTATAAA GTGTCACCAT TGGAAGAGCA AAATCATAAA 9960 

CATAGTAATC ATGGTCTGCT ATTTTAAACT GAATCGAATA GTGTTCATGA ATCTCAGGTA 10020 

AAAGCTCTGT CCCATACTCA GCAGCGATAT CTCGAACTTT GTCCAATAAA TCCCAAATAT 10080 

CTGGTTCCAC AAAGAAATCA TTAGTATCCA ATTTCTTCAC TGCATAAGCA AAGGCATCTA 10140 

GACGAATCAA ATCACACCCA TTACTTGCCA AGTGCTGAAT GGTCTTACGG ATAAATTCCA 10200 

TAGTTACTTC TTTGGTCACA TCAAGATCAA TCTGCTCCTC ACCAAAGGTA TTCCACAAAT 10260 

GTTCCACTGA ACCATCTTCA AACACAATCT CTTGCTTTGG TGCACGATCC TTACGCTTGT 10320 

AAATTAAATC TACATCAGAC TGTGTCGGAC GGTTTTCTGG CCAAAACTTA TCCCAGTTTA 10380 

AAAAGAGAGC TTTAAATTCA CTGGCTTCAT GTTTTTCTTG ATAGTCCTTA TAATACTTGG 10440 

ATTGACGAGA AATATGATTA ATCATAAAAT CAAACATAAG ATAATATTTC TCACCTAAAC 10500 

GCTTCACATC CTCCCAATCA CCAAAAGCTG AGTCCACTTC GTCGTAGTCA ACTGGCGCAA 10560 

ATCCACGATC AACTGTTGAT GGGAAAAATG GTAAAAGGTG AACTCCTCCA ATAGCATCTC 10620 

CAAAATGCTC TTCCAAATTA TCATATAAGT CTTTAAGATT ATTTCCAAGG CTATCAGAAT 10680 

AGGTAATCAA CATGGTTTTA TTTTGAATTG GCATCATTAC TCTCCTTTTT CTAATTGAAG 10740 

CCAAGTCTCA TATGATCTGG CTTCATAAAT AAAATTCATT TTAAATCTCT ATTTATCATC 10800 

AAACTCGTAC TAATATAGAC TGTGATAAAC AAAGTACTAC TTTCTTGTTT TCTGCATAGA 10860 

ATTATCAACA AGCTAAACTC TTCCTCTGTG TCAAAGACTA TAGATTCCAT GAGCTCTTCT 10920 

TATACTCTTC GAAAATCTCT TCAAACCACG TCAGCTTCAC CTTGCCGTAG GTATGGTTAC 10980 

TGACTTCGTC AGTTTCATCC ACAACCTCAA AACAGTGTTT TGAGCAACCT GCGGCTAGCT 11040 

TCCTAGTTTG CTCTTTGATT TTCATTGAGT ATTACTTCAC TGCCCCGTTG CTCATTCCTG 11100 

AAATGATATG GCGTTGGAAG AAGAGATAGA CAATGGTGAT ACTGATAATG CCGACCACGT 11160 

AAGAGGCAAA GCTTGGTCCG TAGTCGTTGA AATATTGGCC TGCGTAGTTG TATTGGAACA 11220 

AAGGCAGAGT CCACATTTTG GAATCCCGGT TCAAGACAAG GAGTGGCAAC ATGAAGTCAT 11280 

TCCAGAACCA AAGGGCATTG ATGATCATGG TTGTCGCATG CATCGGTTTC ATCATTGGGA 11340 

AGATGATGCG GAAATAGGTT GTAAATTGAT TAGCCCCATC GATCTCTGCT GCTTCATCCA 11400 

GACTTTCTGG AATCGAGATT TTGATATAGC CAACATAGAG AAAGAGGGTC TGTGGAATCG 11460 

CATAGGTCAA GTAGAGCAAG ATCAAACCAA AGGTATTAGC CAAACCGAGT TTACTCATCA 11520 

TAACCGTAAT CGGAATCATG ATGACTTGGA AAGGTACGAA GATTCCGAGG ATTAAGAGGG 11580 

TATACATGAT GGTAAAGGCT TTTCTTTTAC TCATATTGCG AGCGATGGAG TAGGCTGCCA 11640 
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TAGGGATAAA GATCATTACT GCAAGTAAAG ACAAGACAGT GATGACGACA GAGTTCCAAT 11700 

AATAGCCTCC AATCCCATCA GCTAAGAGAC GGCTAAAGTT GTCCCATGTG AAGTTGGTTG 11760 

GAAAGCCAAA ' GAAATTATCT ACAATATCCT TAGTGGGTTT GAAGGAACTA AAGAGGGTAG 11820 

CAAGGAGCGG CACTAAAATC AGAACCGATC CTAGAATCAA TAGAATGTAT TTGCCAATCA 11880 

GGGCTTTTCT TTCATCTTGT TTCATCATGC TTCTCCTCTT AAATTTCAAA TTTCTTAGAT 11940 

ACTCTCAATT GGATGATCGA AATCACTACA ATTAAGAAGA ACAAGATTAC GGCAATGGCA 12000 

TTGGCATAAC CGAATTGGTT GTTTTTAAAG GCATAGTTAT AAACCAAGAG CCCAAGTGAG 12060 

GTTGTGGCAT TGTTTGGACC ACCACCGGTC ATGGCAAAGA CTTGGTCAAA GGCAGTCAGC 12120 

CCACCTTTTA GGGCTAGGAT AAAGACCATA GAGACACTTG GTAGCAAGTA AGGCAATTCA 12180 

ATGTTCCAGA AAACTTGCTT GCTAGTCGCA CCATCAATCC TTGCTGCCTC TGTAATCTCA 12240 

GTTGGAATAG ATTGCAAACC AGCTAGGAAG ATGATGATGG GCATAGCCAC CCCTTGCCAA 12300 

AGAAGGACAA AGACAGCCGC AAAGATTGCT CCCCACTTAG TCCCTAAAAG ACTGGTTTGG 12360 

AAAAATTCAA TATGAAGGGC ATTTCCAATC GCTGGAAGAC CGTAGTTGAA GACTTGCTTG 12420 

AAGATCAAAG CCACTGTCAA ACCAGATAAA ACAGCTGGGA AGAAGAACCA AGCACGGAAG 12480 

AAGGTTTGGC CTTTGATTTT AGAATTCAAG ACACGCGCAA TGAAGATCCC GAGTGCAATC .12540 

TCACCAACCA CCATGGCAAT CGCAATGATT GCGGTAAAGC CAATCGCATT CATGAATTTT 12600 

GGATCCATGA AGAGGAGCTT AAAGTTGTTT AAGCCAACAA ATTTGTAGTT ATAAGTCAAT 12660 

CCTGTCCAGT TGGTAAAACT GTAAAAGGCT CCTTGAAACA TCGGCACATA GAAGAAAATT 12720 

GCTTGTAACA AGAGGGGGAT GACCACAAAA GCCCATGCCC AATATTTTTG TAATACTTTT 12780 

TTCATAGTCT CTCTACTCCT AATCCACATC CGCTTTCATC GGGTTAAAGA AGGCATTCAA 12840 

ATCATTGACC ATGCCTTGTT TATCACCGGT CAAGACATAG TTCATGGTCA AGGTATGGAA 12900 

GTCTGCTTCA CTGGTCCAGT ATTGTTGCAA CCAGACCAAG TGACGATCCG TAAAGGCATA 12960 

TTCGGTCATA CCAGCAAGCG GTGAATCTTC TCCTGCTTGT TTGACCCCTT CGATCGCTGT 13020 

TGGAGATCCG TCCACATCGT AGTATTTTTG CATGACTTCT GGACGGGTCA TATATTCCAC 13080 

AAAGGCATTG GCTTCTTTTG GATGTTTGGT GGTGGCTGAG ATAGACCATG CCAAGTCTCC 13140 

CGCACCAACG GTTAAGCTTT GTCCTTTTTC TTTTCCTGGA ATCATGAAGG TCCCAATCTT 13200 

AAAGTTCGGT TTTTGTTCAT TAATCGCTGT GATCGCCCAA GACCCATTTG GTGTCATGAG 13260 

GACATCCCCA CGTGCGAAGG CTCCGATAAC ATCGGTATAG CCAGCACCTT CCCAGTTCTT 13320 

TTGCTTAGAT CCATTGATGC GAAGGATGTC CATGACCTTG ATATCATCTT TCATAATCGG 13380 

ATCCGACAAT TTAATGGCAT, TTGGTTGAGA ATAACGAAGG TATTGATTTG CTTCTTTTCC 13440 
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TCCACCTGTT GCTGTCGCAA AGGCTAATTG ATTGTAACCA TTGAGTGTCC AAGCATCTGC 13 500 

ACCTGCAATT CCAAATGGTG TTTGTCCTTT AGCAACGATA TCTTTGACTA ACTGTTCAAA 13560 

TTCATCCCAG GTTTCAGGAA CCTTCAAGCC CAGTTCTTCG AATTTATCTT TGTTGTAGTA 13620 

AATTCCATAA GCATTAGCTG TAAAAGGAAC GTTGTAAACT TTTTCGTTTA CAGCATATTT 13680 

TTCAGCGTAG CCATTTTTCA CGCGTTTCAG GTAGTCTTTG TTGCTCAAAT CTTCAAAAAC 13740 

ACCTGCTTTT GCCCATTCTT GCAGTTCGAT GGACTGTGGG TAAATATTGA CCACATCAGG 13800 

CACATCTCCT GCGAGAACGC GTGTCTTCAA TACTTCACCA GCATTTGGTA CATTGACGAC 13860 

TTTGACCTTG ATCTTAGGGT TTTCCTTCTC AAAATCACGA GTGATTTCTT CCAAGGTTTT 13920 

GGTCATTTCT TTTTTCTGGT TGAAATACTC GATGGTCACT GTGCCATCCG CAGATTTACC 13980 

ATAGTTGGAG CAAGCGCCGA GCCCAAACAA AGCTAAACCT GTAGTTGCAA GAAGTCCGAT 14040 

TTTTTTATAC CATTCCATTA GAAAGCCTCC TTTATAAATT TATACACCCT TATTGAACTG 14100 

CACCCCAAAA GTTAGACAGA ATAAATCTAA CTTTTGGGGT CAGTACATAT CATAGTTTTC 14160 

TAAAAATATA CTGTCTACTC AAAAAATCTC CTTGGGATAA GATAACAGTT AAGCCCGCAT 14220 

ACATTAGTTC TGCACCTGAG TAAACTTCGC CATTTTCCTG TAATTTATAT AGTCCCTCTT 14280 

CATCCAAATC TTTTAATTTT AAAGTTGTTT CCATGGTCTC TACAACAGAT AAAACGCGAA 14340 

CGTAGGTTAC AATCGTTTGA TTTCCGTAAT TAAATTGTAC AGCTGCTTCA TTGGATACAG 14400 

TATCAGGATT AATTAGTCTA TACTGCTGTC CTAACTGAAC TACTGGTCGT AATTCTTTAT 14460 

ACAAGTTCAC CTGATTAGCA ATCGTAGCTT TCTCTTCATC TGATAAATTT GTCAAATCAA 14520 

GTTCATAGCC CAAATTTCCC ATCATTGCTA CAAGGCCACG TGTTTCTAAT GGTGTCATTC 14580 

GTCCCATCTG ATGATTCGGT ACTGCTGACA CATGAGCCCC CATAGAAATG GTTGGATAGA 14640 

GATAGGATGA ACCGTATTGA ATTGGTAAAC GTGCAATGGC ATCAGTATTA TCACTAGCCC 14700 

AGACTTGTGG GAAATAGCGC ATCATACCAA GATCATTTCG TCCACCACCA CCAGAGCAGG 14760 

ACTCAAAGAG AATATGGCTG TGCTTCTCTG TCAGATAAGA AACGAGTTCA TAAAGCCCCA 14820 

GCATGTACTG ATGAGATTGC ATCTGTGTCT CTAGATAAGT TAATCCATTC CCTAGCTTAG 14880 

TGATATTGCG GTTCATATCC CATTTAATGT AATCAATATC ATGATAAAAT AGGAGTTGAT 14940 

CTAAGACACT TTTCAAGTAT TCTACTACCT GAGGATTGGC AAGATTAAGT ACTAATTGAT 15000 

TCCGAGAATA AGTATGCTCA TAGCCAGGAA CCTGAATAGC CCAGTCAGGA TGTTGACGAT 15060 

ACAAATCACT ATCTACAGAA ATCATTTCGG GTTCTAACCA AAGTCCAAAC TGCAAACCTC 15120 

TTTCATGGAT AGCTGAAATC AGACTTTCTA GACTTCCACC CAGTTTTTCC TCATTAACAA 15180 
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CCCAATCACC TAAAGCACGA TTATCATCAA AACGATTGCC AAACCAACCA TCATCTAATA 15240 

CAAAAAGTTC AATGCCAACT TTCTTAGCTT CATCTGCTAA CTCTAACAGT TTTTCTCTCT 15300 

GAAAGTCAAA GTAAGTAGCT TCCCAGTTAT TGATTAGAAT TGGACGTTCT TTTTTAGAAA 15360 

ATTCACTTAG CATAATGTGC TTCAGTACAA AATTCTGACT TTCATGACTA ATACCAGTTA 15420 

ATCCCTGATC TGAATGAGTC ACTAAAGCTA CCGGTGTTTC AAAGTATTCC TCAGGAGCTA 15480 

ACTTCCAAGA AAAGTTTTCT GGATTAATGC CAATAGCCAC CCGAACTTCA TTCAATTGAT 15540 

TTTTTTGAAC AAAAGCTTCA AAGTTGCCAC TATACATTAG TTGAATAGCA AACACATTCC 15600 

CAGCATCCTC TGTGACTCCT TGTTCGCATA GTAGAAGAGC TGGTGTTTGA GCATGACCAG 15660 

AAGCACCTCG GTTTGAACTA ATCGAAAAGA TTCCTTGTTC TACCTGTTGA CGTCTAACAG 15720 

TCTTTTCACG AGCATAAGCA CCCTGCAGAG TTACTATTTC GTAATCTGCA GCTGGAAAAT 15780 

CAGCCATAAA AGAAAAATCT TTATGGATGA CAACTTCCTG ATTACTATTA TTATCTAATT 15840 

TACTGTAGCT AGCAATAGTC GCATCATTAT TAAAAGTAGT ATAATACAAA GTCAGACTAA 15900 

GTTGAGCCTT AGAATCTTCT AACATTAAGA CAAGAGTCTC TGTATCGTCC ATGCTATGTG 15960 

GAGAAGGTAA GCCCTGTGGA CCATTCTGAC CTTTTAAAAT CTTTGCTTCT ACAAATCGAA 16020 

AGTCTGTTAC TTCAGTTACA CTATGCTGAA CCTGTATGGT TGGTTTCCTA AAATCTCCTA 16080 

AGCCATGTTG TCCAAAAATC TGTCGCTGAG TATCTAAACT AAAGGTTCGA TTAGTAGCCG 16140 

TTGGATTTCC TGAAAAGGCA TGGTCTCGTT CATAAACACT ATTGGAACCT TTATAGTTCT 16200 

TAATAGTCTT TCCTAAATGT TTCAAAAGTA AGTAGCCATT TCGATTTTCA ATAATCAAAC 16260 

TTAGATTTTT ACTCTCAACA TAAAATAGAT TATTCTCTAT CCTAACTCCC ATTTACTTCA 16320 

CCTCATCACT TTATTGATTA TATTTTATCA CCTGAAATCG CTTTCCAAAA TAGAAAAATG 16380 

TCTCAAGAAT ATGGTAAAAT GTTAGGTAGG AGGTAGCACA TGTTAGTTTT TTCAGAATAC 16440 

CAGACTGGAA CAATCGACCT TGCCCTAAGC TTTTATGGAT ATGAGGAATG CACACCTAAT 16500 

TACTCTTTTG GTCCAGCCAT TCGTGATACA TACGTTCTAC ATTACATTAC TAAAGGACAA 16560 

GGAAAATTTC ATTACAAGGG TAAAATTGTT GATTTAAAAG AAGGAGATTT CTTTCTATTA 16620 

AAACCAGAGG AACTAACCTT TTATCAAGCA GATAGTAAAG AACCTTGGGC CTACTACTGG 16680 

TTAGGAATCA CTGGAGGGAA AGCCCCTGAT TATTTTGCTC TTTCCCAAAT TTCTGATCAA 16740 

TCCTATCTCA TCCAATCTGA AACTTGTCAT ACCCAGACTA CTGCAAAACT CATCTCAGAC 16800 

ATTGTCCGCT TCGCTCAGAT TACAAAATCA AGTGAATTAG CTCAACTCCA TATCATGGGA 16860 

CAACTTCATG AACTGATGTT TCATCTGGGA ACTATTGCTC CCAATCAGAA AAAAAAGAAT 16920 

ATTTCATCAA CCCACCAACT CTATCTTGAA TGCAAACGAT TAATTGATAG CCACTATCCT 16980 
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CAATCACTTA CAATTCAAGA TTTAGCAAAA GAACTATCCG TTCACAGAAG CTACTTATCA 17040 

AGCGTATTCA AAGAATTTAA TACCTTATCA CCCAAAGAAT ACCTACTCTA CGTTCGAATG 17100 

CACCGAGCTA GACAACTTCT CGAAAATACC CAAGAGTCCA TCAAGGTAAT TGCATACTCG 17160 

GTAGGTTTTT CAGATCCACT CCATTTTTCG AAAGCTTATA AACAATACTT TAATCAGACT 17220 

CCAAGTCATA CAAGAAAAGA ATACTCTCAA TACCAACTAG TAAGAAAGGC AACATTATGA 17280 

AATCCTACCA AGCTGTCTAC CAAATCCTAT CTAAAGAAAC CGACTATATC AGCGGAGAAA 17340 

AAATCGCAGA AAAACTATCC CTAAGCCGAA CAGCAATTTG GAAAGCCATC AAGCGACTAG 17400 

AACAAGAAGG CATTGAAATT GATAGTATCA AAAATAGAGG ATATAAACTG ATGAATGGTG 17460 

ACCTTATTCT TCCAGAGATT CTAGAAGAAA ATCTTCCAAT TAAAGTCAGC TTTAAACCCG 17520 

AAACAAAATC AACACAACTA GATGCAAAAG AAGCAATTGA TTTAGGCCAT GAAGCAAATA 17580 

CCCTCTATCT AGCTTCCTAT CAAACAGCAG GCCGAGGCCG TTTTCAACGT TCCTTCTACT 17640 

CACCACAAGG TGGTATTTAT ATGACACTCC ATCTTAAACC AAATCTCCCC TATGACAAAT 17700 

TACCATCCTA CACACTACTT GTAGCTGGAG CTGTCTACAA AGCCATTAAG AACCTAACTT 17760 

TAATAGATGT CGACATAAAA TGGGTCAATG ATATCTATCT AAACAATCAT AAAATTGGAG 17820 

GAATCCTTAC TGAAGCAATG ACCTCTGTAG AAACTGGCTT AGTCACAGAT ATCATTATTG 17880 

GAGTAGGTAT CAATTTCACT ATTAAAGACT TCCCTCAGGA ATTAAAAGAA AAAGCTGCCA 17940 

GCTTATTTAA AGCTACAGCT CCTATAACAA GGAATGAATT GATCATAGAA ATCTGGCGTG 18000 

CTTTCTTCGA AACACCAGCA GAAGAGCTAT TATACCTATA CAAAAAACAG TCATTCATTC 18060 

TAGGAAAAGA AGTCACTTTC ACACTAGAGC AAAAAGACTA CAAGGGACTT GCTAAAGACA 18120 

TCTCAGAAAA TGGAAAACTT TTAGTTCAAT GTGATAACGG AAAAGAAATC TGGCTAAATA 18180 

GTGGCGAAAT TTCTCTCAAT AGTTGGAAGT AAAATAACAC AATTATAATA TAAACGATAT 18240 

AAAAATAACT TCAGATTAGT AATTCAATTA AGTTTTACGG ATCTGAAGTT TTATTGGCTC 18300 

TAAAAATAAA AAAGAGAGTT ACAGACTCTC ATTAAAACGG AGAATAAGGG ATTCGAACCC 18360 

TTGCGCCAGT TACCCGACCT AACGATTTAG CAAACCGTCC TCTTCAGCCT CTTGAGTAAT 18420 

TCTCCAATTA ATGGGCACGA GTGGACTCGA ACCACCGACC TCACGCTTAT CAGGCGTGCG 18480 

CTCTAACCAC CTGAGCTACG CGCCCAAGTT AAAAAACTTG GTAATTTGAA CAAAGTTCAA 18540 

AGCGGGTGAC GAGAATCGAA CTCGCGACAA CAGCTTGGAA GGCTGTAGTT TTACCACTAA 18600 

ACTACACCCG CATAAATACT ATCAATAAAA TGGCGCGAGA CGGAATCGAA CCGCCGACAC 18660 

ATGGAGCTTC AATCCATTGC TCTACCAACT GAGCTACCGA GCCTTATTGC GGGAGCAGGA 18720 
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TTTGAACCTA CGACCTTCGG GTTATGAGCC CGACGAGCTA CCGAGCTGCT CCATCCCGCG 18780 

TTAATAATAT AAAAGGAGGA TGTGGGATTC GAACCCACGC ACGCTTTTAC ACGCCTGACG 18B40 

GTTTTCAAGA CCGTTCCCTT CAGCCGGACT TGGGTAATCC TCCAATATTC AAATGGACCT 18900 

TGTAGGACTT GAACCTACGA CCACTCGGTT ATGAGCCGAG AGCTCTAACC AGCTGAGCTA 18960 

AAGGTCCGAC AAGATCATTA TAGCGGCGAA GGGGATCGAA CCCCCGACCT CCCGGGTATG 19020 

AACCGGACGC TCTAGCCAGC TGAGCTACAC CGCCATGAAT CGGGAAGACA GGATTCGAAC 19080 

CTGCGACACC TTGGTCCCAA ACCAAGTACT CTACCAAGCT GAGCTACTTC CCGAGTTAAA 19140 

TAGAAAAATG CACCCTAGAG GAGTCGAACC TCTAACCGCC TGATTCGTAG TCAGGTACTC 19200 

TATCCAGTTG AGCTAAGGGT GCTCCATATT ATGCCGAGGA CCGGAATCGA ACCGGTACGA 19260 
TCGTTACCAA TCGCAGGATT TTAAGTCCTG TGCGTCTGCC AGTTCCGCCA CCCCGGCCTC " 19320 

TCTAAGCGAA CGACGGGATT CGAACCCGCG ACCCCCACCT TGGCAAGGTG CTGTTCTACC 19380 

ACTGAACTAC GTTCGCACTG TTTTCTTCTA TCTAAAAATG CCGGCTACAT GACTTGAACA 19440 

CGCGACCCTC TGATTACAAA TCAGATGCTC TACCAACTGA GCTAAGCCGG CTCATTTGTT 19500 

ATATCTTAAT GCGGGTTAAG GGACTTGAAC CCCCACGCCG TTAAGCGCCA GATCCTAAAT 19560 

CTGGTGCGTC TGCCAATTCC GCCAAACCCG CATATATGAC CCGTACTGGG CTCGAACCAG 19620 

TGACCCATTG ATTAAAAGTC AATTGCTCTA CCAACTGAGC TAACGAGTCT AAAATAACTT 19680 

GCGTTACCTT AAACGGTCCG ACGGAATCGA CCCGGTAC 19718 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

CCGTGGAAAA GTCTGGATAG TGAATGGTCT TCACACAATG • ACCTGAAAGA AGCCTGAGAA 60 

TAATTATGGA GAGTAGCATT CTGAGAGGTG TTAGCAGAAC CATATGACAG AGCTGTTTGA 120 

AGAGGGAATA TTGAGGAGAA AAATCCTGAG CCTACCAGTT GGAGTTGGAA AGAGCTGACT 180 

GTTAGATCAT GGTTTATTAT CCACAACCTG TGGATAACTT TGTGAATAAG AGAAGTTGCT 240 

AAAGAAGGAG ATATATAACG ATGAAGAAAA TCAAACCGCA TGGACCGTTA CCAAGTCAGA 300 

CTCAGCTAGC TTATCTGGGA GATGAACTAG CAGCTTTTAT CCACTTCGGT CCTAATACCT 360 

TTTATGACCA AGAATGGGGG ACTGGACAGG AGGATCCTGA GCGCTTTAAC CCGAGTCAGT 420 
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TGGATGCGCG TGAGTGGGTT CGTGTGCTCA AGGAAACGGG CTTCAAAAAG TTGATTTTGG 480 

TGGTCAAGCA CCACGATGGC TTTGTCCTTT ATCCGACAGC TCACACAGAT TATTCGGTTA 540 

AGGTCAGTCC TTGGAGGAGA GGAAAGGGCG ACTTGCTCCT TGAAGTATCC CAAGCTGCCA 600 

CAGAGTTTGA TATGGATATG GGGGTCTACC TGTCACCGTG GGATGCCCAT AGTCCCCTCT 660 

ATCATGTGGA CCGAGAAGCG GACTACAATG CCTATTATCT GGCTCAGTTG AAGGAAATCT 720 

TATCAAATCC TAACTATGGG AATGCTGGTA AGTTCGCTGA GGTTTGGATG GATGGTGCCA 780 

GAGGAGAGGG CGCGCAAAAG GTTAATTATG AATTTGAAAA ATGGTTTGAA ACCATTCGTG 840 

ACCTGCAGGG CGATTGCTTG ATTTTTTCAA CAGAAGGCAC CAGTATCCGC TGGATTGGCA 900 

ATGAACGAGG GTATGCAGGT GATCCACTGT GGCAAAAGGT GAATCCTGAT AAACTAGGAA 960 

CAGAAGCAGA GCTGAACTAT CTTCAGCACG GGGATCCCTC GGGCACGATT TTTTCAATCG 1020 

GAGAGGCAGA TGTTTCCATC CGTCCAGGCT GGTTCTACCA TGAGGATCAG GATCCTAAGT 1080 

CTCTCGAGGA GTTGGTCGAA ATCTACTTTC ACTCAGTAGG GCGAGGAACT CCACTCTTGC 1140 

TTAATATTCC GCCGAATCAA GCTGGGCTCT TTGATGCAAA GGATATTGAA CGACTTTATG 1200 

AATTTGCGAC CTATCGCAAT GAGCTCTATA AAGAAGATTT GGCTCTGGGA GCTGAGGTAT 1260 

CTGGTCCAGC TCTTTCCGCA GACTTTGCTT GTCGCCATTT GACAGACGGC CTTGAGACCA 1320 

GCTCTTGGGC AAGCGATGCA GACTTGCCCA TCCAGTTAGA ACTCGACTTA GGTTCTCCTA 1380 

AAACTTTTGA TGTAATTGAG TTAAGAGAAG ATTTGAAGCT AGGGCAACGA ATCGCTGCTT 1440 

TTCATGTGCA AGTAGAGGTG GATGGTGTCT GGCAGGAGTT TGGTTCGGGT CATACTGTTG 1500 

GTTACAAACG TCTCTTACGA GGAGCAGTTG TTGAGGCACA GAAGATACGT GTAGTCATTA 1560 

CAGAATCACA GGCTTTGCCT TTGTTGACCA AGATTTCCCT TTATAAAACT CCTGGATTAT 1620 

CAAAAAAAGA AGTTGTTCAG GAACTAGCAT TTGCAGAAAA AAGCCTAGCT GTGGCAAAGG 1680 

GAGAAAATGC CTATTTTACA GTTAAGCGCA GAGAATGTAG TGGTCCTTTA GAAGCTAAGA 1740 

TTTCGATTCA ACCGGGGACA GGTGTCCATG GTGTCGCCTA TCAGGATGAG ATTCAAGTCC 1800 

TTGCGTTTCA AACTGGTGAG ACTGAAAAAA GTCTGACGCT ACCAACCTTG TATTTCGCAG I860 

GAGATAAAAC CTTGGATTTC TATCTGAACC TAACGGTGGA TGGTCAGCTT GTGGATCAAC 1920 

TTCAAGTCCA AGTTTCATAA AAGAAGAACC TTTGCGCGAT GCAAAGGTTC TTTTGGTTAT 1980 

TAGTGACTTG GTAACCAGCT GAGGGTGAAA GTTAGTTGTT CAGCTTTTAA GAGGTCTTGG 2040 

TGTTGAATAG TTGATACGAG TGTTTTGTCC AGTCGGCATT CTTTGACAAA GTTAAAATGG 2100 

TTGTGGTTTT GTTTAGTATG GATATCCAGC CATTTATCTT CTTTAGCGAG GTAGACTCGT 2160 
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AGATGGTCAA AGAGAGGGAT TCCGAGGTCA TAGCTTGGTT TTCCTGGACA GGTTGGATAA 2220 

AATCCGAGAG CTGACCAGAT GTACCAAGCA GAGAGACTAC CATTGTCTTC ATCTCCAGGA 2280 

TAGGCTTCCC AACTTGGGTG AAAAGCTTTC TGACGGAGCG TCTTGATAAG AAGGGCAGTG 2340 

TAGTCAGGGT AATCGCTGTA ACGGAAGAGA TAAGGAATGT GGAAACTAGG CTGGTTGGAA 2400 

ATGGCTATTT GTCCAAAAGG AGCAGTAGCC ATCTCGCTCA TTTCGTGAAT TTCGTAACCA 2460 

TAGCCTGTTG TTTCAAAGAG GGGAGCATCT TGACAGGCTT TCAAAAGATA GTTGCTAAAG 2520 

GTTTCTTTTC CACCCATCAG TTGGATTAAG CCAGGGATGT CGTGGAGAAC GCCTAAAGTA 2580 

GCTTGAATGG CAGAGCATTC AGCGTAGTCT CGCCCCCAAC TATAAGGAGA GAAGTCAGGG 2640 

TGAAAGTTTC CTTGATTGTC TCGTGCTCGC ATGTAACCTG TCTCAGCGTC AAATAGCTGG 2700 

CGGTAATTTT GTGAAGCAGC CTTGTAGGTT TCAGCGATTT CTATGTTCTC TAGTTTTTTG 2760 

GCACAGCTGG CGATACAAAA GTCACTATAG GCATAGTCTA GAGTATGGCT AACACTTTCG 2820 

TGGTGGTCGG TAGAGAGGTA ACCTAGTTCT TGGTATTGGG CTAGTCCGTG GCGGCCATTG 2880 

ATGCCGAGAG GGTCGGCTTT GCTGGCTGTT TCGAGCATGG CTTGGAAGAG TTCTCCTTCT 2 940 

AGGTCGGGGG TCATGTCCTT GCAGGCGCTA TCTGCGATAA TACCGTCTAA AAGTGTACCT 3000 

GGCATCATAC CCCGTTCATC TGGAGCCAGC CATTTTGGAA GGAAACCAGT ATCGCGGTAG 3060 

CTATTGAGGA AACCTTCTAA AAAGCGTTGA TAGTGCTCCG GTATGATAAG GGCAAAGAGG 3120 

GGGAAGGTGG TGCGGAAGGT ATCCCAGAAA CCATTGTTGC TAAAGAGGAC ACCAGGCTTG 3180 

ACAGTACCAG TAGCCAGATC CATGTGGATG GCTTGCCCTG ATTCATTAAT CTCATAAAAA 3240 

GTCTGTGGGA AGAGGAAGAG TCTGTAGAGG CAGTGGTCAA AGAAGGTTCG GTCAGCCTCT 3300 

CCTGTCTCTA TAATGTCAAA ACGATGGAGG AGATTTTCCC AATCCACTTG GGCACTTGAT 3360 

TTACAGCTAT CAAAATCTTC TTGAGGTAGA TTGATTAGAG CTTGAGAAGG AGAGATGAAA 3420 

GAAGTGGCTA GTTGCATCTC GGTTTGACTA CTTGCTAAGT CAATTCGCCA GTCTCCAGCT 3480 

TCTTGGCTGA TAGCAAGAAT ATCCGTGTTC ATTTGCAGGG CAGTGAACAT CGTTAGCGAA 3540 

TTTTTGTTAG TTTCAGTTTT ACCTTCTTGT CGCAGGGCAA GAGTCCGCTT ATCTACTTGC 3600 

TCTACTGTCA GTTCATCTGC TGCGTGAAGA TAGAGGGAGA GGGCTTTGCC TTGCTTTTGA 3660 

TTCAAACGAA TAGAAGCACC ATAGCAAGTC GGTGTGAGCT GGGTTTCAAT CTGATAACGC 3720 

AGAGAAAAGA GCTTCAAATA GTGAGGCTGG AAGCAAGCTT TATCTATATC ATAAGAAGAC 3780 

TGGCGGTGAA AGAGGCTGTC TCCCCCCAGT TGACTGGTGA CAGGTGTCAG AAGGAGCCAA 3840 

GAGTAGTCCC CAATCCAAGG ACTGGGCTGG TGAGTTAATC GAATCCCCTG AAAGATAGGC 3900 

AGATGTGGAT CAAAAAACCA AGATCCATCC TGGTCACTGG TCTGGGGCAC AAAGTAATTC 3960 
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ATCCCAAAAG GCACGCCTGT GTATGGCAGG GTATTTCCCC GAGAAAAGGC ATGCTTGTTG 4020 

GTAGTTCCAA AACGGGTATC GATGGTATCA AGTAGTGGTT TCATAGTCTT TCCTTTAGCT 4080 

GTTTTTCTAC ATTATATCAG TAATAGAGGG CCTTTAG 4117 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2727 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE SS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

CTGGTTCAAT TATTATTCAC TCTAAGTAGT CATATGTTCT TTATTTATGT GAGTTTTTAC 60 

CTTTTAAAGG ATCTTGTTAG ATGGGAGAAG GTTTTAAAAG TGACAGATGA TAATACAAGA 120 

AAAGTTCGTT TATTAGTAGC CTTTTTTAGC ATTGTCATAG GCTACATCCT GAGTTCTTTC 180 

TTTATTAGCC TGTATCATTT GTGGCAAGAA GCGCTTAGAG GATTATTATG AAATCAAGAG 240 

TAAAGGAAAC GAGTATGGAT AAAATTGTGG TTCAAGGTGG CGATAATCGT CTGGTAGGAA 300 

GCGTGACGAT CGAGGGAGCA AAAAATGCAG TCTTACCCTT GTTGGCAGCG ACTATTCTAG 360 

CAAGTGAAGG AAAGACCGTC TTGCAGAATG TTCCGATTTT GTCGGATGTC TTTATTATGA 420 

ATCAGGTAGT TGGTGGTTTG AATGCCAAGG TTGACTTTGA TGAGGAAGCT CATCTTGTCA 480 

AGGTGGATGC TACTGGCGAC ATCACTGAGG AAGCCCCTTA CAAGTATGTC AGCAAGATGC 540 

GCGCCTCCAT CGTTGTATTA GGGCCAATCC TTGCCCGTGT GGGTCATGCC AAGGTATCCA 600 

TGCCAGGTGG TTGTACGATT GGTAGCCGTC CTATTGATCT TCATTTGAAA GGTCTGGAAG 660 

CTATGGGGGT TAAGATTAGT CAGACAGCTG GTTACATCGA AGCCAAGGCA GAACGCTTGC 720 

ATGGTGCTCA TATCTATATG GACTTTCCAA GTGTTGGTGC AACGCAGAAC TTGATGATGG 780 

CAGCGACTCT GGCTGATGGG GTGACAGTGA TTGAGAATGC TGCGCGTGAG CCTGAGATTG 840 

TTGACTTAGC CATTCTCCTT AATGAAATGG GAGCCAAGGT CAAAGGTGCT GGTACAGAGA 900 

CTATAACCAT TACTGGTGTT GAGAAACTTC ATGGTACGAC TCACAATGTA GTCCAAGACC 960 

GTATCGAAGC AGGAACCTTT ATGGTAGCTG CTGCCATGAC TGGTGGTGAT GTCTTGATTC 1020 

GAGACGCTGT CTGGGAGCAC AACCGTCCCT TGATTGCCAA GTTACTTGAA ATGGGTGTTG 1080 

AAGTAATTGA AGAAGACGAA GGAATTCGTG TTCGTTCTCA ACTAGAAAAT CTAAAAGCTG 1140 

TTCATGTGAA AACCTTGCCC CACCCAGGAT TTCCAACAGA TATGCAGGCT CAATTTACAG 1200 
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CCTTGATGAC 


AGTTGCAAAA 


GGCGAATCAA 


CCATGGTGGA 


GACAGTTTTC 


GAAAATCGTT 


1260 


TCCAACACCT 


AGAAGAGATG 


CGCCGCATGG 


GCTTGCATTC 


TGAGATTATC CGTGATACAG 


1320 


CTCGTATTGT 


TGGTGGACAG 


CCTTTGCAGG 


GAGCAGAAGT 


TCTTTCAACT 


GACCTTCGTG 


1380 


CCAGTGCGGC 


CTTGATTTTG 


ACAGGTTTGG 


TAGCACAGGG 


AGAAACTGTG 


GTCGGTAAAT 


1440 


TGGTTCACTT 


GGATAGAGGT 


TACTACGGTT 


TCCATGAGAA 


GTTGGCGCAG 


CTAGGTGCTA 


1500 


AGATTCAGCG 


GATTGAGGCA 


AGTGATGAAG 


ATGAATAAGA 


AATCAAGCTA 


CGTAGTCAAG 


1560 


CGTTTACTTT 


TAGTCATCAT 


AGTACTGATT 


TTAGGTACTC 


TGGCTCTAGG 


AATCGGTTTA 


1620 


ATGGTAGGTT 


ATGGAATCTT 


GGGCAAGGGT 


CAAGATCCAT 


GGGCTATCCT 


GTCTCCAGCA 


1680 


AAATGGCAGG 


AATTGATTCA 


TAAATTTACA 


GGAAATTAGG 


CTGGAGAACC 


AGCCTTTTTC 


1740 


TAAAGATAAG 


GAGAAATATG 


AACAAAAAAA 


CAAGACAGAC 


ACTAATCGGA 


CTGCTAGTGT 


1800 


TATTGCTTTT 


GTCTACAGGG 


AGCTATTATA 


TCAAGCAGAT 


GCCGTCGGCA 


CCTAATAGTC 


1860 


CCAAAACCAA 


TCTTAGTCAG 


AAAAAACAAG 


CGTCTGAAGC 


TCCTAGTCAA 


GCATTGGCAG 


1920 


AGAGTGTCTT 


AACAGACGCA 


GTCAAGAGTC 


AAATAAAGGG 


GAGTCTGGAG 


TGGAATGGCT 


1980 


CAGGTGCTTT 


TATCGTCAAT 


GGTAATAAAA 


CAAATCTAGA 


TGCCAAGGTT 


TCAAGTAAGC 


2040 


CCTACGCTGA 


CAATAAAACA 


AAGACAGTGG 


GCAAGGAAAC 


TGTTCCAACC 


GTAGCTAATG 


2100 


CCCTCTTGTC 


TAAGGCCACT 


CGTCAGTACA 


AGAATCGTAA 


AGAAACTGGG 


AATGGTTCAA 


2160 


CTTCTTGGAC 


TCCTCCAGGT 


TGGCATCAGG 


TCAAGAATCT 


AAAGGGCTCT 


TATACCCATG 


2220 


CAGTCGATAG 


AGGTCATTTG 


TTAGGCTATG 


CCTTAATCGG 


TGGTTTGGAT 


GGTTTTGATG 


2280 


CCTCAACAAG 


CAATCCTAAA 


AACATTGCTG 


TTCAGACAGC 


CTGGGCAAAT 


CAGGCACAAG 


2340 


CCGAGTATTC 


GACTGGTCAA 


AACTACTATG 


AAAGCAAGGT 


GCGTAAAGCC 


TTGGACCAAA 


2400 


ACAAGCGTGT 


CCGTTACCGT 


GTAACCCTTT 


ACTACGCTTC 


AAACGAGGAT 


TTAGTTCCCT 


2460 


CAGCTTCACA 


GATTGAAGCC 


AAGTCTTCGG 


ATGGAGAATT 


GGAATTCAAT 


GTTCTAGTTC 


2520 


CCAATGTTCA 


AAAGGGACTT 


CAACTGGATT 


ACCGAACTGG 


AGAAGTAACT 


GTAACTCAGT 


2580 


AAAAGATACG 


CCTACACTCC 


TATGTCACTT 


ATGGATGTAG 


GAGTTCTTTT 


TACTAGTTTA 


2640 


AGCAGGACTA 


AGACAGGTAC 


TAAGACAAAA 


TAGCAACTTC 


TAAAACTAAC 


TTCCAGTTTT 


2700 


GGGAGAGAGA 


TGGAAGTTAC 


TTTGAGA 








2727 


(2) INFORMATION FOR SEQ ID NO: 102: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

TTTTTTGTAG ATTTAAGTGG GGTGCAATTC CTAAAAAATA AAAAACAATT TTTGAAAATT 60 

ATGTTAGCAG GAATTGCTTC AAATTCGATT TTATCACTTA CAGGTTTACT TGTTTTATTG 120 

TTCACATCGT ATAAATTGCT TGGACTCTTA TTTTTTATCA TTAACTTAGG TATGATTTTT 180 

ATTAATTCAA TTCCTTTTTT TCAGTATGAT AGTGGTATTA TTTTAAGAT A . CTTGAATTCT 240 

AACAATAATA ACTTGAATTT TCAATATATA GTTCAACTTT TAATAGCATT TGTTATTATT 300 

TATTTTCCTT TGAGTCAACT ATTACAGTTT TTGACACCCA ATATTATTGT TCGTAGTATA 360 

GGAGGGGTGG TTGTTTCTAT ACTGCTTTCT ATATTATATA TGATAGGAAG GACGAAATAT 420 

GTTCTACGTA AATAGTTATG TTTTTGCTTA TAAAAAAGAA GGTATAATGT ATTTACGTGG 480 

TCGGAGTATG CGGGAAATAG CTATAGAACC TCAAATTTCG CAAGAATTTA TCAACGATCT 540 

ATTTAATAGT TGTAAGGAAC TATTAGAGAT AGAAGAAGTA TTAGGCAGTA AACTAACATT 600 

TGAACTATAA ATGAACAAAT TTTAATTTCG GATGAGATAG ATATTGATAG TAGATATTCT 660 

AGAACTAAAG GTTACTATTC GTTATTTTAT AATGAAGAGT ATAATAAAAT AC AGAAT AAA 720 

ACAGTATTAG TATTAGGAGC AGGAGTCTTA GGATGTTATA TATCTCTAAG TCTAAGTATG 780 

TATGGAGTGA GGAAACTTAT TGTCGCTGAT TACGATATAA TAGAACCATC AAATTTAAAT 840 

AGGCAAATTC TTTATACAGA GTCGGATGTT GGTAAGGAGA AGATTAATGT TCTTTCTGAA 900 

AAAATACACA AGTATAATTC AGATGTTCAG GTAGTACCTA TTTCTATTAA AGTTTCTTCA 960 

GTAGAAGAAT TAGAAAAAAT TGTTGCGGAA TATGGGAGTA TAGATTTTAT CGTTAAAGCA 1020 

ATTGATACGC CCATTGATAT TATAAAAATT GTCAATCAAT TTGCTGTATC GCATAAGATA 1080 

TCCTACATAT CAGGAGGGTT TAATGGATGC TATCTTATTA TTGATAATAT ATATATCCCT 1140 

ACCATCGGTT CTTGCTTTGG TTGTCGGAAT ATAAACAAAG ATATAAATAA GTACACTTTA 1200 

TCTGATAAGA CAAAGTGGCC GACTACACCA GAGATGCCTG CTATTTTGGG AGGGATAATG 1260 

ACTAATTTAA TAATTAAAAT ATTTCTGGGA TGTTATAATG AAATCCTAAT AGATAACGCT 1320 

TACGTTTATA ATATGAGAAA TCATGCTCTA AGTCAAGAAA AATATGTTCT GGAAAACGGA 1380 

GAATGTCCAA TTTGTAAAAA AATAATAAAG TGAAAGATAA CAATATTAGA GCGAAAACAT 1440 

TTATTCGTTC AGTTTGTTTT TGCTTATTAT CAGGAGGAGT AGCTTTTTTA TCTGCTATTG 1500 

GGCAGTTCAC TGTTATAGAA ACACAATTAA TAGTATTGTT CTTGGGTATT ATTTTTGCTA 1560 

TATATTATGC TTACTACAAT AAAAATATTC AAACATCATT GGAAAATATA GTATGGCTTT 1620 
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TTTCATCGTT TGAGATTTTA TTTTTGCTTG TTAATTTTAG AACATTTATT CAGTTACCAG 1680 

TGGATATTTT TATTGGTATG ATAATATTTT TAATGCTGTG GATATTTATT ATGTTAGGTA 1740 

TAGTGTGTCT TAGTTATTAT ATAACTTTAT TATTTAGCAA GGAGGCTTAG TATGTTTAAA 1800 

AAAATAGGTA TAATGAGCAT TTGCATATAT ATAATTATTT TATACTGCTT GAGAATGTAT 1860 

CGTATTATCA ATAATATTGA AACAATCTTG CTAACGGTTA TATGCTTAAT GTTATTGTTT 1920 

TTTTTAAGAC GTTTATTTGA TAAAGATAAG TAAATAGATG TTAAGTAAAA ATGTAGAATA 1980 

TAAAGGAGGT GCAATGAGTA TGATTGAAGT TAGCCATTTA TCAAAAAGTT- TTGGTGATAA 2040 

AATAGCTTTA AATAATATAA GCTTCACTGT TAAAGAAGGT TAGATTTTTG GATTTTTAGA 2100 

ACCATCTGGT TCTGGAAAGA CCACAACGAT TAATATTCTG ACTGGGCAGT TCCTTGCCGA 2160 

TAAAGGACAA TCTATTATTT TGGGACAAAA ATCTCAAAAT TTAACAAGCG GTGAATTAAA 2220 

GAGAATTGGA TTGGTTAGCG ATACAAGTGG ATTTTATGAG AAAATGTCTC TGTATAACAA 2280 

TCTTCTTTTT TATAGTAAAT TTTATAATAT TAGTAAATCA CGTGTTGATA ATTTGTTAAA 2340 

GCGAGTAGGA TTATATGATA GTCGCAAGAT GGTAGCAGGA AAATTATCCA CTGGAATGAG 2400 

GCAACGAATG CTTTTAGCAC GAGCTCTTAT CAACAACCCC GCTGTACTCT TTCTGGATGA 2460 

ACCGACCTCA GGTCTAGATC CCACAACTTC TCGAACAATT CATGAGTTAA TTTTAGAATT 2520 

GAAAACAGCA GGGACAACGA TTTTTCTAAC GACTCATGAT ATGAATGAAG CAACTCTTTT 2580 

ATGTGATTAT GTTGCCTTAT TAAATAAAGG GAAATTAGTT GAGCAAGGAG CTCCTTCTGA 2640 

ACTCATTCAA AGATATAATA AAGATAAAAA GATTAAGGTT ACAGATTATA ATGGGAATCA 2700 

GATAACTTTT GATTTTACAT CACTAGAACA GGTATCTCAG ACTGATCTGG AAAATATTTT 2760 

TTCAATTCAT TCATGTGAGC CTACTTTAGA AGATATTTTT ATCACATTAA CAGGAGGAAA 2820 

GCTAAATGCT TAAACGGTTT CTGGCTTTGG TATGGTTGCG TTGTCAAATC ATCCTTTCCA 2880 

ATAAGAGTAT TTTATTGCAA GTTTTAGTGC CTTTTGCTTT CACATATTTT TATAAATATC 2940 

TTATGGAAAC ACAGGGGAAG GTCAACGATC AACAGGCATT AGTTCTTTTG ATGATGTGTT 3000 

TACCTTTTTC TTTTTCTTTG GCTGTTGGAA GTCCTATAAC TATTATCTTG TCTGAAGAAA 3060 

AAGAAAAGTA CAATTTACAA ACTCTTCTGT TGAGTGGTGT TAAAGGCTCC GAATACATTT 3120 

TATCAACTAT GTTTCTTCCT TTTTTGCTAA CTTTTGTGAT TATGGGAACT ACTCCTCTTA 3180 

TTTTAGGAGT TACAATTGTA CATACTTTTA ATTATATTAC AATCGTTCTT CTAACCTCTT 3240 

TATCCATCAT TTTATTCTAT TTATTGATAG GTTTAACCGC GAAGAGCCAA GTAGTAGCTC 3300 

AGGTTATCAG TCTTCCTGCT ATGATTTTAG TTGCTTTCTT ACCGATGCTA TCTGGTTTGG 3360 

ATAAGACAGT TGCGAAGATA ACAGATTATA GTTTTATGGG ACTATTTACT AAGTTTTTCA 3420 
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CAAAATGGGA 


GGAATTTTCA 


TGGAATAAAA 


CTCTAATTCC 


TAATCTAACA 


CTACTTATTT 


3480 


GGATTGTTCT 


TCTATTAACT 


TTAATTACGA 


TAACTATTAG 


GAAAAAGAAA 


ATTTCTTAAT 


3540 


TGAGTTATTT 


TAATGATTAT 


AAACACAAGT 


GGGAAGGAAA 


AAATGAACTG 


ATCTTTTTGA 


3600 


CAGCAATTCT 


ACAGAATAGT 


CTTATTGCTA 


TATTTTGATT 


TGAGTGTACG 


AAAAAAGAAA 


3660 


AATAACAATA 


GTGCTCATAC 


TAATTGCAGA 


AGTTTTGGGT 


GATAAGATAA 


CTGATAAATT 


3720 


GCAATAAAAA 


ATGCAACATT 


TTTAAATCTC 


CTCTATAAGT 


GCTTCAAAAA 


GTGCTTCAAA 


3780 


ACCTGTCTTG 


TAATCCAAGT 


ATTTTTGGGG 


ACGGTGATTA 


ATAAGCTAGC 


AAAGCATCAT 


3840 


TAAGGATTTT 


TTCGGTAATT 


GTTGCCAAAT 


CGGTTTAAGA 


AAATACTCAC 


GAAGAAGTCC 


3900 


ATTCGCATTC 


TCATTACTTC 


CCCTTTGCCA 


AGATGAATAG 


GCATCCGCAA 


AATAAAACAG 


3960 


AATTCCCATT 


TGTTCAATTA 


AAGGGTAACA 


AGCAAACTCT 


TTTTCTCTGT 


CCGAAGTGAA 


4020 


AGTCTTTAAC 


TATTCTTTTG 


GAAAGAGTCT 


TGTGAGGTGT 


TCAATAGCAG 


TCAACATGGA 


4080 


TTTAGCTGTT 


TTTACTTGAC 


AAGTGCTAGT 


AGAAATAATA 


GAATAGTAAA 


AAACCTTTAA 


4140 


AGCAGTCCAG 


AGAGGCAGCT 


AAGGTTAGAC 


GGTGAAAGGG 


TGGAGACTAC 


CCATTTTTCG 


4200 


TGGAACCTTG 


CTGTTGGCAG 


GTTCCTTTTT 


TCGTGGCTTC 


TGTTGGCCAG 


ACTCTCTCAC 


4260 


TAGTAAAGGT 


AAAAGGAGAA 


ACCTATGCGA 


GAACATCGTC 


CAATCATTGC 


TCTTGATTTT 


4320 


CCTAGTTTTG 


AGGCGGTCAA 


GGAATTTTTA 


GCTCTTTTCC 


CAGCAGAAGA 


AAGCCTTTAT 


43B0 


CTCAAGGTAG 


GGATGGAGCT 


TTATTACGCA 


GCGGGGCCTG 


AGATTGTGTC 


CTACTTAAAA 


4440 


GGTTTGGGTC 


ATAGTGTCTT 


TTTGGATCTC 


AAACTTCATG 


ACATTCCTAA 


TACAGTCAAG 


4500 


TCAGCCATGA AGATCTTGTC 


TCAGCTTGGT 


GTCGATATGA 


CTAATGTCCA 


TGCGGCTGGT 


4560 


GGTGTAGAGA 


TGATGAAGGC 


GGCGCGTGAA 


GGTCTTGGGA 


GTCAAGCCAA 


ATTGATCGCT 


4620 


GTAACTCAGC 


TCACATCAAC 


GTCAGAAGCT 


CAGATGCAGG 


AGTTTCAAAA 


TATCCAAACC 


4680 


AGTCTGCAAG 


AGTCTGTGAT 


TCACTATGCC 


AAGAAGACAG 


CTGAAGCTGG 


CTTGGATGGT 


4740 


GTTGTTTGCT 


CGGCTCAGGA 


AGTACAAGTC 


ATCAAGCAGG 


CTACCAATCC 


AGATTTTATC 


4800 


TGTCTGACAC 


CAGGGATTCG 


TCCAGCTGGT 


GTTGCAGTTG 


GAGATCAAAA 


ACGAGTCATG 


4860 


ACACCTGCTG 


ATGCCTATCA 


AATCGGCAGT 


GACTATATCG 


TAGTGGGACG 


TCCCATTACC 


4920 


CAAGCTGAGG ATCCTGTTGC AGCTTATCAT GCCATCAAGG ATGAATGGAC ACAGGACTGG 


4980 


AATTAAAGAA 


CTAGATTAGA 


AAAATAAAAG 


GAGAATACCA 


TGACACTTGC 


TAAAGATATC 


5040 


GCTAGCCACC TCTTGAAAAT 


CCAAGCCGTT 


TACCTCAAAC 


CAGAGGAACC 


CTTCACTTGG 


5100 


GCATCTGGTA TCAAGTCACC 


GATTTACACT 


GATAATCGTG 


TGACACTAGC 


CTATCCAGAA 


5160 
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ACTCGTACCC TAATTGAAAA TGGTTTTGTG GAAGCTATCA AAGAAGCCTT TCCTGAAGTA 5220 

GAAGTGATTG CAGGAACTGC AACAGCAGGG ATTCCACACG GAGCCATTAT TGCTGATAAG 5280 

ATGGACTTGC CTTTTGCCTA CATCCGTAGT AAACCAAAAG ACCACGGAGC TGGTAATCAA 5340 

ATCGAAGGTC GCGTAGCTCA AGGTCAAAAA ATGGTAGTGG TTGAAGACCT TATTTCAACG 5400 

GGTGGTTCAG TTCTTGAAGC TGTAGCAGCA GCCAAGCGAG AAGGAGCAGA TGTACTTGGA 5460 

GTTGTAGCGA TTTTCAGCTA CCAATTGCCA AAAGCAGATA AGAACTTTGC AGATGCTGGT 5520 

GTTAAACTTG TGACGCTTTC AAACTATAGC GAGCTTATCC ATCTAGCCCA AGAAGAAGGT 5580 

TACATCACGC CAGAGGGCCT TGATCTTCTA AAACGCTTTA AAGAAGACCA AGAAAATTGG 5640 

CAAGAAGGTT AGGTCAGTAA GATAAAGAGA GACGAGGCTA CCGAGTCTCT TTTACCATTT 5700 

TATTTAAAAT ATGACAG 5717 



(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5558 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

CCTGGACTTT CTAAAATGAA ATCTTGCGAC CTGGATCAAG CCCTTCATGA GCATTTTTCA 60 

GAAGAAGAAT TAGCTGGTCA CTTTCATGTC CTTCTATGGA CTTTTTTTAC AATGGCATTG 120 

CTATCACACC CAATACCTAT CTAAGCGCCT GGTTCGTAAA CTTTATTGCA GCTCTTCCTC 180 

TAAATTTCCT AATTGTTGAA CCAATTGCCC GTTTTATACT AAGTTCTTTT CAGAAACCAT 240 

TTACTGGGGA AGAAGTTGAA GATTTTCAAG ATGATGATGA AATCCCAACT ATTATCTAAG 300 

CCAGTTCTGT AAACTACTAA TATTTGAAAT CCACTTCCTT TTAGGGTGCA ATGGTTATAA 360 

ATGAATTTTT GAGAGGATCA GAATGAAAAA ACTAGCAACC CTTCTTTTAC TGTCTACTGT 420 

AGCCCTAGCT GGGTGTAGCA GCGTCCAACG CAGTCTGCGT GGTGATGATT ATGTTGATTC 480 

CAGTCTTGCT GCTGAAGAAA GTTCCAAAGT AGCTGCCCAA TCTGCCAAGG AGTTAAACGA 540 

TGCTTTAACA AACGAAAACG CCAATTTCCC ACAACTATCT AAGGAAGTTG CTGAAGATGA 600 

AGCCGAAGTG ATTTTCCACA CAAGCCAAGG TGATATTCGC ATTAAACTCT TCCCTAAACT 660 

CGCTCCTCTA GCGGTTGAAA ATTTCCTCAC TCACGCCAAA GAAGGCTACT ATAACGGTAT 720 

TACCTTCCAC CGTGTCATCG ATGGCTTTAT GGTCCAAACT GGAGATCCAA AAGGGGACGG 780 

TACAGGTGGT CAGTCCATCT GGCATGACAA GGATAAGACT AAAGACAAAG GAACTGGTTT 840 
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CAAGAACGAG ATTACTCCTT ATTTGTATAA CATCCGTGGT GCTCTTGCTA TGGCTAATAC 900 

TGGTCAACCA AACACCAATG GCAGCCAGTT CTTCATCAAC CAAAACTCTA CAGATACCTC 960 

TTCTAAACTC CCTACAAGCA AGTATCCACA GAAAATTATT GAAGCCTACA AAGAAGGTGG 1020 

AAACCCTAGT CTAGATGGCA AACACCCAGT CTTTGGTCAA GTGATTGACG GTATGGATGT 1080 

TGTGGATAAG ATTGCTAAGG CCGAAAAAGA TGAAAAAGAC AAGCCAACTA CTGCTATGAC 1140 

AATCGACAGC ATCGAAGTGG TGAAAGACTA CGATTTTAAA TCTTAAAAAC CAAAAAAATA 1200 

CAGTATCCAC ATTCGGTACT GTATTTCTTT TACT CTC ATT CTTAAGTTAA ATTATTAAAA 1260 

TCCCATATTT GGTCTATCCA GCCTTCATAA AAGTCTGGCT CGTGGCAGAC CATAAGGATA 1320 

GATCCCCTAT ATTCTTTGAG AGCGCGTTTG AGCTCATCCT TTGCATCCAC ATCCAAATGG 1380 

TTGGTCGGCT CGTCCAGCAC TAAAACGTTG TTTTCACGAT TCATCAAGAG ACAGAAACGA 1440 

ACCTTGGCTT GCTCTCCCCC TGATAATACT TGAATCTGGC TTTCAATATG TTTGGTTGTC 1500 

AAACCACAAC GGGCAAGGGC TGCACGGACT TCTGCTTGAT TAAGGGCAGG AAAGGCATTC 1560 

CAGACAGCTT CAAGAGGAGT TTGGCGATTA CCGCCTTCTA CTTCCTGCTC AAAATAACCA 1620 

AGTTCTAAAT AATCTCCACG CTCCACTTCC CCAGCGATTG GCGAGATAAT GCCCAAGAGA 1680 

CTCTTCAAGA GAGTTGTTTT TCCAATACCA TTAGCACCAA TAATCGCAAC CTTTTGATTG 1740 

CGTTCGAAGG TAAGATTTAA AGGCTTAGTA AGAGGACGGT CGTAACCAAT TTGCAAGTTC 1800 

TTGGCTTGGA AGATAAAGCG CCCTGGTGTA CGAGCTGGTT TGAAATCAAA GGATGGTTTT 1860 

GGTTTCTCAC TTTGGAGTTC GATAATATCC ATCTTATCCA ATTTCTTTTG ACGAGACATA 1920 

GCCATATTAC GAGTTGCAAC ACGGGCTTTA TTACGAGCCA CAAAGTCCTT GAGGTCTGCA 1980 

ATCTCTTTCT GCTGGCGTTC GTAGGCTGCC TCTAGCTGAG ATTTCTTCAT AGCATAAACT 2040 

TCTTGGAACT GGTAGTAGTC ACCAGAGTAA CGCGTCAGCT GTTGATTTTC CACATGATAG 2100 

ACAATATTAA TAACGTCATT GAGGAATGGA ATATCGTGCG AAATGAGAAC AAAGGCATTC 2160 

TCATAGTTTT GGAGATAGCG CTTGAGCCAA TCAATATGCT CAGCATCCAA GTAGTTGGTC 2220 

GGCTCGTCCA ACAGCAAGAT ATCAGGCTTT TCAAGGAGAA GTTTTGCCAA AAGCACCTTG 2280 

GTTCTTTGCC CACCTGACAA AGAAGTTACA TCCGTATCCA TGCCAAAGTC CATAACACCA 2340 

AGAGCACGCG CTACTTCGTC AATCTTAGCA TCCAAGGTAT AGAAATCACG ACTCTC C AG A 2400 

CGGTCTTGAA GTTCTCCTAC TTCTTCCATG AGAGCATCAA CATCCGCGCC GTCTTCAGCC 2460 

ATTTTCATAT AGAGGTCATT GATACGAGCT TCAGCTTTGA AAAGCTCATC AAAAGCCGTA " 2520 

CGGAGAACAT CACGCACCGA CTGTCTTTCA GCAAGGACAG AGTGCTGATC CAAGTAACCA 2580 
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GCCGTCACAT ATTTGGACCA CTCAACCTTT CCTTCATCTG 
ATACTCATAA AGGTTGATTT TCCTTCACCA TTGGCACCGA 
TTGAGGAGAC GGAAGGACAC ATCTTCAAAA ATTGCACGGT 
TTTTTAACTT CTAAAATACT CATTTTAATT CCTTACCTTG 
AAGGAGCCAA GCCAGATAGC CACCCAAAGT GTTGGTCCAC 
GCGATTGAAA TCAAAGAAAA AGTCCAAGAT TAATTGCGTA 
AAGAAAACTA AAAAGAAGGA CCTTTTTTGT TTTCCGCAAA 
TTGGAAAATC AGAGGAAAAA ACAAGAAGAC ATTGAGGATA 
TAATTGTCCA ATGTCACTCA CTTCGCCCAG TTTCCAGAGA 
AAAAACCAGG CGTCCAAGAT GCTGAATACC TGGAGTTCCC 
TTGAGGAGTA AAGCAAAAAC AGACAATGCA AATGCTATAG 
AATATGATTA TAAGTCTTCT T CATC ATT AA GGATTTACCG 
TCACGTTTCA TTGTGTTAGA GCGCAATTGT CCACAAGCTG 
TCTTGACGAA CCACACAGTT GACCCCTTTT TTCTTAAGCG 
CACTCTTTGG GACTACGGCT ATATTGGTCA TGCTCACTAA 
TTTACATAAG ACAATTTCTT GATGTTCTTG AGCAATTCAG 
ACACCGTCGT TGACTTCATT AAGCATGATA TATTCAAAGG 
TCAATGTAGT ATTCAATAGC AGCAAAGAGT TTTTCAATCG 
ATGATACTTG AACGAAGTTC ATTGTTAGGT GCGTGAAGAG 
ACCCCTTCAT CAGCAAAGTC ACGAATTTTA TGAGCCAAAC 
TGACGAGCAC CGATAGCCAT TCCTTTATCA TCATTGATAG 
TTGTTGTAAT TATCAAAGGG CTCACCGATT CCCATGACAA 
TCCTGACCAC GCTCATCAAA GTATTTCTGA ACCAGCATGA 
TTATTGAGGT CACGTTGCTT CTTAATCAAA CCAGAGGCAC 
CAGCCGACCT GAGTGGTCAC ACAGACAGAT AAACCATAGT 
TCAATTAACA TACCGTCGGG CAATTCAAAG AGATATTTGA 
TGCACAATAC GTTGTTTCAA GGGATTGACC ACAAACTGGT 
TCCTTGGAAA GGTTGGTCAT TTCTTCAAAT GACTGCACAC 
CAGATTTGAT CTGCACGGAA TTTCTTTTCT CCCTGCTCCA 
TGATGTACCA AACTATGAAT TGAGGGTTTC ATTTCTTCTC 



GCAGCATTTT ACCAGTCACG 2640 

CCAGGCCGAT ATGTT CTCCC 2700 

CACCAAAACC GTGACTCAGA 2760 

TTTTTATGTA ATCGTTTATA 2820 

AAATCATCAA TCTCAAAGAC 2880 

CACTCGATTC CAAGACTCAC 2940 

TTTGGAAATA GATAAAGGAG 3000 

TTTTGTAAAA AAATCCAACA 3060 

GAATTGAAAG GAGTCAAAAG 3120 

ACTCCCACGG TAGATTGTTC 3180 

AAAATGACTC CCCAGACCAA 3240 

CTGCGACTGC CTTCTGGCGG 3300 

CGTCAATATC TGTACCATGC 33 60 

TATCATAGAA AGCCAACACG 3420 

CTGGGTTATA AGGAATCAAG 3480 

TCAATTCCAA GGCTTGTTCT 3540 

TTACACGACG GTTTGTTGTC 3600 

GAAAGGCACG GTTAATCTTC 3660 

ACACGGCAAG ATTGACCTGA 3720 

CTGAGGTTGA AACCGTGATG 3780 

TACGAAAGAA ATTCAAGACA 3840 

CGATATGGCT GATGCGTTCA 3900 

TTTGCGCTAC GATTTCACCG 3960 

AGAAGGTACA ACCGATATTA 4020 

GTTGACGCAT GAGTACAGTC 4080 

CTGTACCATC AGCAGACTCT 4140 

CATTGAGCTT AGCAATCAAA 4200 

GTTTACGGTA GAGCCATTCC 4260 

ATACCCATTC CTGCATGGTT 4320 

CTTATTCTCT ACTCACTTCT 4380 
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GACGAATGAC AAAATGACGT TGTCCCTTGT CGTCTTTCTG ACGACGTCTA TTTTTCTTAT 4440 

CTGCATTCGA CTTTCGTTTA GTTTGAGTCG GTTTCTTTCC TTTTCTAGAA GGTGTTTCTT 4500 

CTTCCGTCTT ACGCATTTTC TTGTCAAATG ATGCTCGCTT AGGGGCTTCA TTTTCTAAGA 4560 

CAAAATAGGC ACAACCATAA CTACAATACT CTAAAAGGTA GTCTTGTAAA CGACTGATTT 4 620 

TTTCAAGTTT TTCTTCTGTT CGGTCATCCT TGTAAAAACC TCGTAGGCGA AGCTGTTCGT 4680 

TGCTCCAGTC CCCCACGATA TAATCAAACT TGGTTAATAC TTCTGAAAAA GGCTGATTAA 4740 

AAGTCGTCAC ATCAAAGGCA TCCTTGATAT TTTCAACCAA GGAAAAAGCT ATCCCTTCCG 4800 

TTTCGACCTT GTCCCCGTGT AAATGGAACT CCGGACCAGG AAACTTGTTA TAGTTGTATA 4860 

ATTCAGGTGC AATTTCTTTT CGCATAGATA TCCTTTTTTC ACGATTACTT AATACTTTAT 4920 

TCTACCATAA TTTCTAGCAG TTAGCACGTT TCTCATAAAA ATGAAAAAAG TCTGACGATT 4980 

TTGTCAGACC AGAATCTTAT AACCTAAAAA GAGAAGAACA ATTCTTCCCT CCAACTATCA 5040 

TTATTTAGCA GCTGCGTACA ATTCATCTAC TTTATTCCAG TTGATTACTG AAAAGAAAGC 5100 

TTTGATGTAG TCAGGACGCA CGTTGCGGTA TTTCACGTAG TAAGCATGTT CCCAAACGTC 5160 

CAAGCCCAAG ATTGGTTTTT TACCTTCTGA GATTGGTGTG TCTTGGTTTG CTGTTGAAGT 5220 

CACTTCAAGT TTCCCTTCTT TGTTGACAAC CAACCATGCC CAACCTGAAC CAAAACGAGT 5280 

TGTTGCTGCT GCAGTGAAGG CTGCTTGGAA TTCTTCAAAT GAACCAAATG TTGCATCGAT 5340 

TGCTGCTGCC AGTTCTGCTG AAGGAGCTGT TTTCTCGGGA GTCATCAATT CCCAGAAAAG 5400 

AGCGTGGTTC AAGTGTCCGC CACCATTGTT GATAAGTGCT TGACGGATAT CAGCTGGGAT 5460 

AGATTCTACA TCAGCAAGCA AGGCTTCAAG GTCTTCACCG ATTTCAGGGT GTTTTTCTAA 5520 

AGCTGCATTG GCATTGTTGA CATAAGTTTG ATGGTGTT 5558 
(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

GGAATTGTAA AT AT CAT ATT GTTTTTGCAC CCAAATATCG TCGTCAAATC ATTTATGGCA 60 

GATACAAAGC TAGTATCGGA AGAATCATAC GTGACTTATG TGAGCGTAAG GGTGTAATAA 120 

TCCATGAAGC GAATGCTTGT TCAGACCATA TTCACATGCT TATCAGTATT CCTCCGAAAC 180 
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TTAGTGTTTC GTCCTTTATG GGCTATTTAA AGGGCAAGAG CAGTTTGATG ATTTTTGATA 240 

AGCATGCGAA TTTAAAATAC AAATATGGCA ATCGCAAGTT TTGGTGTAGA GGCTATTATG 300 

TAGATACGGT AGGCCGTAAT CAGAAAGTGA TAGCTGAATA TATTCAGAAT CAATTACAAG 360 

AAGACAGAGT AGCAGACCAG CTCACGTTAT TCGAGTCAGT AGATCCGTTT ACTGGCGAAA 420 

TAAATAAGAG GAAGTAACTA AGGTGCTTTA GCACCTGCTC GGGAAAGTGG TGCGCGAGGA 480 

AGCTATTTCG GTGGGCCTTT GGCCCTGGCC GGTAGAAGCG GCTTATAGCC GCAGAACAAA 540 

CCACCAGTTC ACACTGGTGG TTTTGATTTA AAAAACTTGA TACATAAAAA TAAAAGTCTA 600 

TATAAAGGAT GGTAAAATTC CTGTTGTCCG ATTTGGACAA TATCCTAAAT AGTTACAATA 660 

TATGGTCTAT ACTTTTTCTT AGGAGAAAGC TAGATGTACA GACGTTTGAG AGATTTGAGG 720 

GAGGATCATG ATCTGCCCCA AAAGCAAATA GCTACAATAC TTTCGTTTAC AAATTCAGCT 780 

TATGCCAAAA TTGAACGGGG TGAGCATGCG TTGACGGCTG ATGTATTGGT TAAACTCTCA 840 

GATTTCTATG ACGTCAGTAC AGACTATTTA TTGGGATTAA CTGATTTTCC TGATAAAATT 900 

CGCTTTAGAA AATAATCTCC TCAATTTCAT AGAGTTTGAA AATGAGTGAG ATTTTTTATT 960 

TGCCCTTTGA CAACTGAATA GCCTAAAATG GTACTTTCCT CATTTGTGGA GCAAATTTGA 1020 

ATGGCTCGCC ATGATAAGAG CGATTTTAAA ATCATCAATA AAATAGAGCG ATACTTTATA 1080 

TGCCATGATA CAAATGATAT ACAATGATAC TTCTGACCGT TCAGCCTGCC AACGTAAAAG 1140 

AGCAGCAAGT GAAATTCTTA TGATGACTTC ATCAGTCATG CCACGTTGAA TGTGTGAGTT 1200 

TGTTAGATAA ACGCAATTAA TCCTCAAAAG GTTCCCCGAA CCTTTTGAGT TCTACAGACG 1260 

CATCACGTGG AGTGTGTAAG CTTGTTGCTA AAAGCGTAAA AACCTTGGAA CGAAAGGAAT 1320 

AATAGACTTT CTGCGAAACA AAAATATAAT ACAATAAAAC TATGAATGAT GAAGCAAGTA 1380 

AACAATTGAG CGATAGCCGT TTCAAGATCC TTGTAGGTGT TCAGCGCACG ACTTTTGAAG 1440 

AGATGTTAGC TGTGTTAAAA ACAGCTTATC AACGTAAACG CGCAAAAGGT GGACGAAAAA 1500 

GCAAATTAAG CCTAGACGAT CTCCTTATGG TAACTATTCA ATACATGCGA GAATAGAGCA 1560 

CTTATGAACA AATTGCGGCT GATTTTGGCA TTCACGAAAG CAACTTAATC CGTCGGAGTC 1620 

AATGGGTTGA AGCAACTCTT ATTCAAAATG GTTTTACGAT TTCAAATTCT GCCTTAATTC 1680 

TGTAAAAACA GTAAAATTCG AAGGATTGTA AGGTAAGAGT TTTTTTCTTT CTGAAAAAAT 1740 

GGTATAATAG CAATCAAAAC TAGAAAATAA AACGGAATTT GGAACAGATT TGTCTGTATC 1800 

CTAGTAGAGT GGTGATACTA TGAAGATTAG TAAGAGGCAC TTATTAAATT ATTCCATCTT 1860 

GATTCCCTAC TTGCTTTTAT CTATTTTGGG CTTGATTGTG GTCTATTCGA CCACCAGTGC 1920 

TATTTTAATT GAAGAAGGCA AGAGCGCCTT GCAGTTGGTT CGAAACCAAG GAATCTTTTG 1980 
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GATTGTTAGT TTGATACTGA TTGCCTTAAT TTATAAATTG AGACTAGATT TTTTGAGAAA 2040 

TGAGCGACTA ATCATTTTAG TTATATTAAT AGAAATGCTT TTATTGTTCT TGGCTCGTTT 2100 

TATTGGTATT TCCGTAAACG GGGCATACGG TTGGATTTCG GTTGCAGGAA TAACTATTCA 2160 

GCCAGCTGAG TACTTAAAAA TCATTATTAT TTGGTATTTA GCTCACCGAT TCTCCAAACA 2220 

GCAAGAAGAA ATAGCTACTT ATGATTTTCA AGTTTTGACT CAAAATCAAT GGCTTCCCCG 2280 

TGCTTTTAAT GATTGGCGAT TCGTTCTCCT AGTTCTGATT GGAAGTTTGG GAATTTTCCC 2340 

TGATTTAGGA AATGCGACTA TTTTAGTCTT GGTTTCCTTG ATT ATG TATA CAGTTAGTGG 2400 

AATCGCTTAT CGCTGGTTTT CAACCATTCT GGCGCTCGTA TCTGCCGCTT CTGTCTTTGT 2460 

CTTGACCACT ATCAGCCTAA TCGGTGTTGA GACCTTTTCA AAAATTCCAG TATTCGGCTA 2520 

TGTAGCCAAG CGCTTTAGTG CCTTTTTTAA TCCTTTTGCC GATCGTGCTG ATGCAGGTCA 2580 

CCAGTTAGCT AATTCTTATT TTGCCATGGT CAATGGCGGT TGGTTTGGTC TAGGTCTTGG 2640 

AAACTCGATT GAAAAACGAG GTTATTTGCC AGAAGCTCAT ACAGACTTTG TCTTTTCTAT 2700 

CGTGATTGAA GAATTTGGCT TTGTTGGTGC CAGTCTTATT TTAGCTCTCT TGTTTTTCAT 2760 

GATTTTGCGG ATTATCTTGG TCGGTATCCG AGCGGAGAAT CCTTTCAATG CCATGGTTGC 2820 

ACTCGGTGTC GGAGGGATGA TGTTGGTTCA GGTATTTGTC AATATCGGAG GGATTTCGGG 2880 

CTTGATTCCA TCTACAGGAG TGACTTTCCC CTTCTTATCC CAGGGTGGAA ATAGTCTTCT 2940 

AGTCTTATCA GTGGCAGTAG CCTTTGTCTT AAATATTGAT GCCAGTGAAA AACGCGCTAA 3000 

ATTGTACCGA GAATTGGAAA ATCAACCAAT GAACCTTCTG TTGAAGTAGG ATAAAGAAAG 3060 

GATAGTTTAT GTCTCTTCAA AAATTAGAAA ATTATAGTAA TAAAAGTGTT GTGCAAGAAG 3120 

AAGTCTTGAT TCTAACAGAA TTACTGGAAG ATATTACTAA AAATATGCTT GCCCCAGAGA 3180 

CCTTTGAAAA AATAATACAG TTGAAAGAAT TATCAACGCA GGAAGATTAT CAAGGTCTAA 3240 

ACCGTCTAGT GACTAGCTTA TCAAATGATG AAATGGTCTA TATTTCACGC TATTTCTCTA 3300 

TCTTGCCTCT TTTGATTAAT ATTTCAGAGG ATGTGGATTT AGCTTATGAA ATCAATCATC 3360 

AAAATAATAT TGATCAGGAC TATTTAGGTA AATTATCTAC AACGATTAAA TTGGTAGCAG 3420 

AAAAGGAAAA TGCCGTTGAG ATCCTAGAAC ACTTGAATGT TGTCCCTGTT TTGACAGCCC 3480 

ATCCAACACA AGTGC AACGC AAAAGTATGT TGGATTTAAC AAATCATATT CATAGTCTTT 3540 

TGCGTAAATA CCGTGATGTT AAGTTGGGGT TGATCAATAA AGATAAATGG TACAATGATT 3600 

TGCGTCGTTA CATCGAAATT ATCATGCAGA CAGACATGAT TCGTGAGAAA AAATTAAAAG 3660 

TGACTAACGA AATCACGAAT GCTATGGAAT ATTATAACAG CTCCTTTTTG AAAGCTGTAC 3720 
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CTCATTTGAC GACGGAGTAT AAGCGCTTAG CGCAAGCGCA TGGTCTGAAT TTAAAACAGG 3780 

CTAAACCAAT CACCATGGGT ATGTGGATAG GTGGTGACCG TGATGGAAAT CCATTTGTTA 3840 

CAGCAAAGAC CTTGAAGCAG TCTGCACTCA CTCAGTGTGA AGTCATCATG AACTACTATG 3900 

ATAAAAAGAT TTACCAACTT TATCGTGAAT TTTCTCTTTC AACTAGCATT GTCAACGTCA 3960 

GCAAGCAAGT CAGAGAAATG GCTCGTCAAT CCAAGGATAA CTCGATTTAC CGCGAAAAAG 4020 

AGCTTTACCG TCGTGCCTTG TTTGATATTC AATCAAAAAT TCAGGCAACT AAAACCTATC 4080 

TGATTGAGGA TGAAGAAGTT GGGACTCGTT ATGAAACCGC CAATGATTTC TACAAGGATT 4140 

TGATTGCCAT TCGAGATTCT CTACTAGAAA ATAAGGGCGA GTCCTTGATT TCAGGTGATT 4200 

TTGTGGAATT ATTGCAGGCA GTAGAGATAT TTGGTTTTTA CTTAGCATCA ATTGATATGC 4260 

GACAAGACTC TAGCGTCTAT GAAGCCTGTG TGGCAGAACT CTTGAAATCA GCAGGAATTC 4320 

ATTCTCGTTA TAGCGAGTTG AGCGAAGAAG AAAAGTGTGA CCTTCTCTTG AAAGAATTAG 4380 

AAGAAGATCC CCGAATTCTT TCTGCGACTC ACGCAGAAAA ATCAGAATTA TTAGCAAAAG 4440 

AATTAGCTAT TTTTAAGACG GCTCGTGTTT TGAAAGATAA GTTGGGAGAT GATGTCATCC 4S0O 

GTCAGACCAT CATTTCACAT GCAACCAGCC TTTCTGATAT GCTAGAATTA GCTATTCTGT 4560 

TAAAAGAAGT AGGACTGGTG GATACGGAAA GGGCGCGTGT TCAGATTGTT CCCCTTTTTG 4620 

AAACAATTGA AGACTTGGAT CATTCAGAGG AAACAATGAG AAAATATCTT TCTCTTAGCC 4680 

TTGCCAAAAA ATGGATTGAC TCACGAAATA ACTACCAAGA AATCATGCTT GGCTACTCTG 4740 

ACAGTAATAA AGATGGCGGT TACTTGTCAT CATGTTGGAC CCTCTACAAG GCTCAACAAC 4800 

AATTGACTGC TATTGGAGAT GAATTTGGCG TTAAGGTTAC CTTCTTCCAT GGTCGTGGTG 4860 

GTACTGTCGG TCGTGGTGGT GGGCCAACCT ATGAAGCCAT TACATCTCAA CCGCTCAAGT 4920 

CTATCAAGGA TCGTATCCGC TTGACGGAGC AGGGTGAAGT AATTGGGAAT AAATACGGTA 4980 

ACAAAGACGC CGCTTACTAT AACCTTGAAA TGCTAGTATC GGCAGCTATT AACCGTATGA 5040 

TTACTCAGAA GAAGAGCGAT ACCAATACCG CAAATCGTTA TGAAACCATT ATGGATCAAG 5100 

TAGTGGACCG TAGTTACGAT ATCTACCGTG ATTTGGTCTT TGGTAATGAG CATTTCTATG 5160 

ATTATTTCTT CGAGTCAAGT CCAATCAAGG CTATTTCAAG TTTTAATATT GGTTCTCGTC 5220 

CAGCCGCTCG TAAGACTATT ACTGAAATCG GTGGTTTGCG TGCCATCCCT TGGGTATTCT 5280 

CATGGTCACA GAGTCGTGTT ATGTTCCCTG GATGGTACGG GGTTGGTTCA AGCTTCAAGG 5340 

AATTTATCAA TAAAAATCCA GAGAATATTG CTATCTTACG AGATATGTAC CAAAATTGGC 5400 

CTTTCTTCCA ATCGCTTCTT TCAAATGTTG ATATGGTTTT GTCAAAATCA AATATGAATA 5460 

TTGCTTTTGA ATATGCTAAA CTTTGTGAAG ACGAGCAAGT TAAGGCCATC TATGAGACTA 5520 
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TTTTAAATGA ATGGCAAGTT ACTAAGAACG TTATCTTGGC TATTGAAGGA CATGACGAAC 5580 

TCTTAGCTGA CAATCCATAT CTAAAAGCTA GTCTGGATTA CCGTATGCCT TACTTTAATA 5640 

TTCTCAACTA TATTCAGTTG GAGTTGATTA AACGCCAACG TCGTGGAGAA TTGTCCAGTG 5700 

ATCAAGAACG ATTGATTCAT ATCACCATCA ACGGAATTGC GACAGGATTG CGTAATTCAG 5760 

GTTGATAATT TTCAAGAGTG AATGCTAAAA GTGAATATCA AAAAAATTCT AATAGACTAT 5820 

TGACAAGTAG TTTAAAAATG ATATAATTTA ACCATTCAGA AAAGTAATCA TACAAACTTT 5880 

TTAGAGAGTC TGTGGTAGCT GAAAACAGAT AAGTGGCAAT GATGAAAATT GGGCTGAATG 5940 

CTATTTAGAA TTTGAAATTA TAAAAATTCG GTAAGCACAC CTTACAGTGC ATCTCGTTAT 6000 

TGCGAGACTG AGCGATAGGG AAATTCCCTA TAATTGAGGT GGTACCGCGC ATCGACGTCC 6060 

TCACACAAGT TTTTTGTGTG AGGATTTTTT TGATGGAGGT TAGTATGGAA AGAAAACGAT 6120 

GGCGTCGCTT GTTTAGATAA GTGAAATATG TTAAAGGAAA TAAAAAGGAG AAACAGAATG 6180 

AAAAATAAAC GTTTAATTGG AATTATTGCT GCATTAGCAG TCTTAGTAGC AGGAAGCTTG 6240 

ATTTATTCTT CAATGAATAA ATCAGAAGCT CAGAATAATA AGGATGAGAA GAAAATAACC 6300 

AAGATTGGTG TGCTTCAATT TGTGAGCCAT CCATCCCTTG ATTTGATTTA TAAAGGGATC 6360 

CAAGATGGAC TTGCAGAAGA AGGATATAAA GATGATCAAG TTAAAATTGA TTTTATGAAC 6420 

TCAGAAGGTG ACCAAAGTAA GGTTGCGACA ATGAGTAAAC AATTGGTTGC AAATGGGAAT 6480 

GACCTTGTGG TTGGTATCGC AACACCAGCA GCCCAAGGGT TGGCTAGTGC AACAAAAGAC 6540 

CTACCGGTTA TCATGGCCGC TATTACAGAC CCAATTGGTG CTAACTTGGT TAAAGATTTG 6600 

AAAAAACCAG GTGGCAACGT TACAGGGGTA TCTGACCACA ATCCAGCTCA ACAACAAGTT 6660 

GAACTCATCA AGGCTCTGAC ACCGAATGTG AAAACAATCG GAGCTCTTTA CTCAAGTAGC 6720 

GAAGACAATT CAAAA 6735 
(2 J INFORMAT I ON FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 6516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

CTAGAGGATC CCAGCAGGTA AATTGGCTTC AGCTGGCAAA AAAGTTGCCC TCGTTGAACG 60 

CAGCAAGGCT ATGTACGGTG GAACTTGTAT CAACATTGGT TGTATCCCAA CTAAAACCTT 120 
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GCTAGTTGCT 


GCTGAAAAGG 


ACTTGTCTTT 


TGAAGAAGTC 


ATTGCTACTA 


AAAACACGAT 


180 


CACTGGTCGC 


CTCAACGGTA 


AAAACTATGC 


GACTGTTGCT 


GGTACAGGCG 


TAGATATCTT 


240 


TGATGCGGAA 


GCTCACTTCC 


TTTCAAATAA 


AGTCATCGAA 


ATCCAAGCTG 


GTGATGAAAA 


300 


GAAAGAACTG 


ACTGCTGAAA 


CAATCGTCAT 


CAACACTGGT 


GCTGTTTCAA 


ACGTCTTGCC 


360 


AATCCCTGGA 


CTTGCTACAA 


GCAAAAACAT 


CTTTGACTCA 


ACAGGTATCC 


AAAGCTTGGA 


420 


CAAATTACCT 


GAAAAACTTG 


GAATCCTTGG 


TGGCGGAAAT 


ATCGGTCTTG 


AATTTGCCGG 


480 


CCTTTACAAC 


AAACTTGGAA 


GCAAGGTCAC 


AGTCCTAGAT 


GCCTTGGATA 


CATTCCTACC 


540 


TCGTGCAGAA 


CCTTCCATCG 


CAGCTCTTGC 


TAAACAATAC 


ATGGAAGAAG 


ATGGCATTGA 


600 


ATTGCTTCAA 


AATATCCATA 


CTACTGAAAT 


CAAAAACGAT 


GGTG ACCAAG 


TGCTTGTCGT 


660 


AACTGAAGAC 


GAAACTTACC 


GTTTCGACGC 


CCTTCTCTAC 


GCAACTGGAC 


GCAAACCAAA 


720 


TGTAGAACCA 


CTTCAACTTG 


AAAATACAGA 


TATTGAACTA 


ACTGAACGTG 


GTGCTATTAA 


780 


AGTAGACAAA 


CACTGTCAAA 


CAAACGTTCC 


TGGTGTCTTT 


GCAGTTGGAG 


ATGTCAACGG 


840 


TGGCCTTCAA 


TTTACTTACA 


TTTCACTTGA 


TGACTTCCGT 


GTTGTTTACA 


GCTACCTTGC 


900 


TGGAGATGGC 


AGCTATACAC 


TTGAAGACCG 


TCTCAATGTG 


CCAAATACTA 


TGTTCATCAC 


960 


ACCTGCACTT 


TCACAAGTTG 


GTTTGACTGA 


AAGCCAAGCA 


GCTGATTTGA 


AACTTCCATA 


1020 


CGCTGTTAAG 


GAAATCCCCG 


TTGCAGCAAT 


GCCTCGTGGT 


CACGTAAATG 


GAGACCTTCG 


1080 


CGGTGCCTTC 


AAAGCTGTTG 


TCAATACTGA 


AACAAAAGAA 


ATTCTTGGAG 


CAAGCATCTT 


1140 


CTCAGAAGGT 


TCTCAAGAAA 


TCATCAACAT 


CATCACTGTT 


GCTATGGACA 


ACAAGATTCC 


1200 


TTACACTTAC 


TTCACAAAAC 


AAATCTTCAC 


TCACCCAACC 


TTGGCTGAGA 


ACTTGAATGA 


1260 


CTTGTTTGCG 


ATTTAAGTTG 


AGATTTAATC 


GTATCGAACA 


GCCCTCTTTG 


GGCTGTTTTT 


1320 


ACTTCTGCGG 


AATCTCAAAT 


CTGTCTTTCT 


CCTCTTTTAT 


GATATAATAG 


AAACATGAAC 


1380 


TTAAAAACTA 
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1440 


GGACGTGGAA 


GTACGCTCCC 


AGGGAAAGTC 


GCCCTTCAAT 


TTGATAAAGA 


TATTTTACAA 


1500 


AACCTAGCTA 


AGAACTACGA 


GATTGTCGTT 


GTCACTGGAA 


CAAATGGAAA 


AACCCTGACA 


1560 


ACTGCCCTCA 


CTGTCGGCAT 


TTTAAAAGAG 


GTTTATGGTC 


AAGTTCTAAC 


CAACCCAAGC 


1620 


GGTGCCAACA 


TGATTACAGG 


GATTGCAACA 


ACCTTCCTAA 


CAGCCAAATC 


TTCTAAAACT 


1680 


GGGAAAAATA 


TTGCCGTCCT 


CGAAATTGAC 


GAAGCCAGTC 


TATCTCGTAT 


CTGTGACTAT 


1740 


ATCCAGCCTA 


GTCTTTTTGT 


CATTACTAAT 


ATCTTCCGTG 


ACCAGATGGA 


CCGTTTCGGT 


1800 


GAAATCTATA 


CTACCTATAA 


CATGATATTG 


GATGCCATTC 


GGAAAGTTCC 


AACTGCTACT 


1860 


GTTCTCCTTA 


ACGGAGACAG 


TCCACTTTTC 


TACAAGCCAA 


CTATTCCAAA 


CCCTATAGAG 


1920 
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TATTTTGGTT 


TTGACTTGGA 


AAAGGGACCA 


GCCCAACTGG 


CTCACTACAA 


TACCGAAGGG 


1980 


ATTCTCTGTC 


CTGACTGCCA 


AGGCATCCTC 


AAATATGAGC 


ATAATACCTA 


TGCAAACTTG 


2040 


GGTGCCTATA 


TCTGTGAAGG 


TTGTGGATGT 


AAACGTCCTG 


ATCTCGACTA 


TCGTTTGACA 


2100 


AAACTGGTTG 


AGTTGACCAA 


CAATCGCTCT CGCTTTGTCA 


TAGACGGCCA 


AGAATAOGGT 


2160 


ATCCAAATCG 


GCGGGCTCTA 


TAATATCTAT 


AACGCCCTAG 


CTGCTGTGGC 


CATCGCCCGT 


2220 


TTCCTAGGTG 


CCGATTCGCA 


ACTCATCAAA 


CAGGGATTTG 


ACAAGAGCCG 


TGCTGTCTTT 


2280 


GGACGCCAAG 


AAACCTTTCA 


TATCGGTGAC 


AAGGAATGTA 


CCCTTGTCTT 


GATTAAAAAT 


2340 


CCAGTCGGTG 


CAACCCAAGC 


TATCGAAATG 


ATCAAACTAG 


CACCTTATCC 


ATTTAGCCTA 


2400 


TCTGTCCTCC 


TTAATGCCAA 


CTATGCAGAT 


GGAATTGACA 


CTAGCTGGAT 


CTGGGATGCA 


2460 


GACTTTGAAC 


AAATCACTGA 


CATGGACATT 


CCTGAAATCA 


ACGCTGGCGG 


TGTTCGTCAT 


2520 


TCTGAAATCG 


CTCGTCGCCT 


CCGAGTGACT 


GGCTATCCAG 


CTGAGAAAAT 


CACTGAAACG 


2580 


AGTAATCTGG 


AGCAAGTTCT 


CAAGACCATT 


GAGAATCAAG 


ACTGCAAGCA 


TGCCTATATT 


2640 


CTGGCAACTT 


ATACTGCCAT 


GCTGGAATTT 


CGTGAACTGC 


TGGCTAGTCG 


TCAGATTGTT 


2700 


AGAAAGGAGA 


TGAACTAATG 


GTTTATACTT 


CACTTTCCTC 


AAAAGATGGC 


AATTACCCCT 


2760 


ATCAGCTCAA 


CATTGCCCAC 


CTCTACGGAA 


ATCTCATGAA 


TACtACGGGG 


ACAATGGAAA 


2820 


CATCCTCATG 


CTCAAGTATG 


TGGCTGAAAA 


ACTGGGAGCC 


CATGTGACCG 


TTGACATCGT 


2880 


TTCTCTCCAT 


GATGACTTTG 


ATGAAAATCA 


CTACGACATC 


GCCTTTTTCG 


GTGGTGGTCA 


2940 


AGACTTTGAA 


CAAAGTATCA 


TTGCAGACGA 


CCTACCTGCT 


AAAAAAGAGA 


GCATTGACAA 


3000 


CTACATCCAA 


AACGACGGTG 


TAGTTCTGGC TATCTGCGGT 


GGTTTCCAAC 


TATTGGGTCA 


3060 


ATATTATGTT 


GAAGCTTCAG 


GAAAACGTAT 


CGAAGGGCTA 


GGGGTCATGG 


GACACTACAC 


3120 


GCTCAACCAG 


ACCAATAACC 


GTTTTATCGG 


TGACATCAAG 


ATTCACAATG 


AAGATTTCGA 


3180 


TGAAACCTAC 


TATGGATTTG 


AAAATCACCA 


AGGTCGTACC 


TTCCTCTCTG 


ATGACCAAAA 


3240 


ACCGCTGGGA 


CAGGTTGTCT 


ATGGAAATGG 


AAACAACGAA 


GAAAAGGTCG 


GTGAAGGGGT 


3300 


TCATTATAAG 


AATGTCTTTG 


GTTCCTACTT 


CCACGGGCCT 


ATCCTCTCTC 


GTAATGCCAA 


3360 


TCTGGCTTAT 


CGCCTAGTTA 


CTACTGCCCT 


CAAGAAGAAA 


TATGGTCAGG 


ACATGCAACT 


3420 


CCCTGCCTAT 


GAGGACATTC 


TCAGCCAAGA 


AATCGCTGAA 


GAGTACAGTG 


ACGTCAAAAG 


3480 


CAAGGCTGAC 


TTTTCTTAAA 


CAAAGGAAAA 


TGATATCAAA 


GAACTCCGTT 


ATCTTGTCGG 


3540 


AGTTTTTTGT 




ACCCTTCTCC 


CTTGCATTTT 


CTCTCATTTT 


TTGCCAAAAT 


3600 


AGAGGGGTAG 


AAAGAAGGTA 


GCATATGTCT 


AAATTACAAC 


AAATCCTAAC 


ATATCTTGAA 


3660 
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TCAGAAAAAC TAGACGTCGC TGTCGTATCT GACCCCGTCA CAATCAATTA CCTCACTGGT 3720 

TTTTACAGTG ATCCCCATGA ACGCCAAATG TTCCTCTTTG TCCTAGCAGA TCAGGAACCT 3780 

CTCCTCTTTG TCCCAGCTCT TGAAGTAGAA CGTGCAAGTA GCACCGTTTC CTTCCCAGTA 3840 

GTGGGCTATG TCGATTCTGA AAATCCATGG CAAAAAATCA AACATGCTCT TCCACAACTT 3900 

GACTTCAAAC GTGTCGCTGT TGAGTTTGAC AATCTCATCT TGACCAAATA CCATGGTTTG 3960 

AAAACAGTTT TTGAGACTGC TGAGTTTGAC AACCTCACTC CTCGTATCCA ACGCATGCGC 4020 

CTCATCAAAT CAGCTGATGA AGTGCAAAAA ATGATGGTTG CAGGTCTTTA TGCTGACAAG 4080 

GCTGTTCATG TTGGTTTTGA CAATATTTCT CTTGATAAGA CTGAGACAGA TATCATCGCA 4140 

CAAATCGACT TTGCCATGAA ACGTGAAGGT TATGAAATGA GCTTTGATAC CATGGTCTTG 4200 

ACTGGTGATA ATGCTGCGAA TCCACACGGC ATTCCAGCAG CTAATAAGGT TGAAAATGAT 4260 

GCTCTTCTCC TCTTTGACCT GGGTGTTCTG GTCAATGGCT ATGCGTCAGA TATGACTCGT 4320 

ACAGTCGCTG TCGGCAAACC AGACCAATTC AAGAAAGATA TTTACAACTT GACTCTTGAA 4380 

GCCCAACAAG CTGCTCTTGA CTTTATCAAG CCAGGTGTGA CTGCTCATGA AGTGGACCGC 4440 

GCTGCCCGTG AGGTCATCGA AAAAGCTGGT TATGGTGAGT ACTTCAACCA CCGTCTCGGG 4500 

CATGGTATCG GTATGGATGT CCATGAATTC CCATCTATCA TGGAAGGAAA 'CGACATGGTC 4560 

ATCGAAGAAG GCATGTGCTT CTCTGTTGAA CCAGGTATCT ATATCCCTGG TAAAGTCGGT 4620 

GTTCGTATTG AAGACTGCGG TGTTGTTACC AAGGATGGCT TCAACCTCTT TACAAGCACC 4680 

AGCAAAGATT TGCTTTATTT TGATTAAACT ATATAGCCCC TATGCTTTCC TTTCAAAATA 4740 

TCTAGGGGCT ATTTTATTGT CATTTTTCTG CTATTATGCT AAAGAAATTG GCTGCAATAA 4800 

TCTAACCCTA AGTGTCTGGA ATGATAACGA GGGTGCTCTC CGCTTTTATC AAAGACAAGG 4860 

GATGAAACCC CAAGAAACAA CAATGGAAAT GATAATTGAT TAAGAAGTCA TCTATCAAAA 4920 

GATGTTAGAA AAAGTTCAAT TTCACTAGAA AATGAGGAAA ATCTCCCCAC AATAAAACGC 4980 

ATAGTATCAG GTATTGTGTA CTGACCCCAA ACAGTTAGAC AATTAATTTA TCCGAAGGAT 5040 

TTAGTTCTGT ACTGCACAGG ACTAAGTCCT TTTAGTTTTA CCTTAATTCG TTTGTTGTTG 5100 

TAGTAATCAA TATAGTCTAT AATGACTTGT TCCAATTGGT TAAGTGATTT AAATGTTTTC 5160 

TCATAGCCAT AAAACATTTC GGATTTTAAA ATGCCAAAGA AAGATTCCAT CATACCGTTG 5220 

TCTTGGCTGT TTCCCTTGCG TGACATAGAT GCTTGAATTC CCTTATTCTC TAGGAACCGA 5280 

TGATAAGAAT CGTGTTGGTA TTGCCAGCCT TGGTCACTAT GGAGAATCGT ATTCTCGTAG 5340 

TGCTTCTCTT TGAATGCCTG TTCCAACATT GTTTGTACTT ATTCTAAATT AGGCGAACAA 5400 

GAAAGATTAA AAGCAATAAT TTCGCTGTTA AAGCCATCTA AAACTGGTGA TAAGTAAAGC 5460 
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TTTTGAGTAC TTGCTGGAAT GGCAAATTCA GTCACATCTG TGTAGCACTT TTCCATTGTT 5520 

TTAGAGCCTT CAAATTGGGC TTGAATGAGA TTCTCTGCCT TCTTACCAAC GTCTCCTTTA 5580 

TGAGAAGAAT ATTTTCGTTT CTTTCGCATT TTAGCTTGTA AATTGAGTAC TTTCATCAAG 5640 

CCTTGAACTC TTTTATGATT TACCAGATAA CCACGATTTC TTAGTTCTAA ATGAACCCGG 5700 

CGATAAGCAT AATTTCCCTT GTGTTCGATA AAGATGGATT GAATTTCAGT TTTAAGCTCT 5760 

TGGTCTTTAT CTGTTTTGTC TAGCTGTTTC AAGTGATAGT AGTAGGTCCA ACGAGCTAGT 5820 

TTAATGGCTT CTAGAAGAAG ATCTAACGAA AACTCAGTCA TTAATTCTTG AACAATTTCT 5880 

GTCTTTCTTC TTTCTCTTTT TCCTCCTTCA ATCGGAGTTC TCTTAACTTT TTTAGGATGG 5940 

CATTCTCCGC TCTCAGGTAC TCTCCCTCTT GTTTTCTCAA CAATAGTATA CCCGTTTTTC 6000 

CTGTATTGTG CTAGCCAGTT AAGAAGTATC GTACGACTTG GGAGACCGTA TTCAAGAGAA 6060 

ACTCTATCTT TAGTCCAGCC TTCATGTCAG ACTTTATTAA CCCCAATTAT TCACCCCAAA 6120 

TCTAAAAACC ATCCAGAATC CTTGCCTTAG CTTAGATCCT GGATGGTTTC TTTTTTCACC 6180 

CAATGGGTGT TTTTTACTAG ACAAAAAAGA GTTTCCCCTT TATGGTATAA GTGTAGAAAA 6240 

AAACACAAAA AGAAAGGAAA CTCACATGAA CAGTTTACCA AATCATCACT TCCAAAACAA 6300 

GTCTTTTTAC CAACTATCTT TCGATGGAGG TCATTTAACC CAGTATGGTG GTCTTATCTT 6360 

TTTTCAGGAA CTTTTTTCCC AGTTGAAACT AAAAGAGCGG ATTTCTAAGT ATTTAGTAAC 6420 

GAATGAmCAA CGCCGCTACT GTCGTTATTC GGATTCAGAT ATCCwTGTCC AGTTCCTCTT 6480 

TCAACTGTTA ACAGGTTATG GAACGGAATA TGCTTG 6516 
(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 106: 



TTTTCAACCC 


ATATCGTGGC 


TCCTGAATAC 


TACTTACTGA 


CAACTATGCT 


ATCAGAGACT 


60 


TCTCTACTTG 


TTTTCTATAT 


CATTTTCATC 


CATAGAAAAC 


AACTCATCCA 


CTTGGGACAT 


120 


ATCTTTAGCT 


ATACTGTTCG 


ATACTCTCTC 


TTTTCACTTT 


CCTTTGTAGC 


AATTTATTTC 


180 


CTGATTAATT 


TCGTGTATCC 


TGTAGATATG 


GTCATTAATT 


TGCCATTTTT 


GATTAATACT 


240 


GGTTTGATTG 


TCTTGCTATC 


AGCTATCTCT 


TATATTAGTC 


TACTTGTCTT 


CACAAAAGAT 


300 
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AGCATTTTCT ATGAATTTTT AAACCATGTC CTAGCCTTAA AAAATAAATT TAAAAAATCA 360 

TAGGAGTTTA AAATGAAACA ACTAACCGTT GAAGATGCCA AACAAATTGA ATTAGAAATT 420 

TTGGATTATA TTGATACTCT CTGTAAAAAG CACAATATCA ACTATATTAT TAACTACGGT 480 

ACTCTGATTG GGGCGGTTCG ACATGAGGGC TTTATCCCTT GGGACGACGA TATTGATCTG 540 

TCCATGCCTA GAGAAGACTA CCAACGATTT ATTAACATTT TTCAAAAGGA AAAAAGCAAG 600 

TATAAGCTCC TATCCTTAGA AACTGATAAG AACTACTTTA ACAACTTTAT CAAGATAACC 660 

GACAGTACGA CTAAAATTAT TGATACTCGA AATACAAAAA CCTATGAGTC TGGTATCTTT 720 

ATCGATATTT TCCCTATAGA TCGCTTTGAT GATCCTAAGG TCATTGATAC TTGTTATAAA 780 

CTGGAAAGCT TCAAACTGCT GTCTTTCAGT AAACATAAAA ATATTGTCTA TAAGGATAGC 840 

CTTTTAAAAG ATTGGATACG AACAGCCTTC TGGTTACTCC TTCGACCGGT TTCTCCTCGT 900 

TATTTTGCAA ATAAAATCGA GAAAGAAATT CAAAAATATA GTCGTGAAAA TGGGCAATAT 960 

ATGGCTTTTA TCCCTTCAAA ATTTAAGGAA AAGGAAGTCT TCCCAAGTGG TACCTTTGAT 1020 

AAAACAATCG ATTTACCCTT TGAGAATTTA AGCCTTCCTG CACCTGAAAA ATTTGATACT 1080 

ATTTTGACAC AATTTTATGG AGATTATATG ACCCTACCAC CAGAAGAAAA ACGCTTCTAC 1140 

AGTCATGAAT TTCACGCTTA TAAATTGGAG GATTAGGATG CAATATTTAG AAAAAAAAGA 1200 

AATTAAAGAA ATTCAACTAG CCCTGCTGGA CTATATTGAT GAGACTTGTA AGAAACATGA 1260 

TATTCCTTAT TTTCTCAGTT ATGGAACCAT GCTTGGAGCC ATCCGCCACA AAGGTATGAT 1320 

TCCTTGGGAT GATGATATTG ATATTTCCCT TTATCGTGAG GATTATGAGC GTTTACTGAA 1380 

GATTATTGAA GAAGAAAATC ACCCTCGCTA CAAGGTTCTT TCCTACGATA CATCTTCTTG 1440 

GTACTTCCAT AATTTCGCAT CGATTTTGGA CACTTCTACT GTTATAGAAG ACCATGTTAA 1500 

GTACAAGCGT CATGATACCA GCCTTTTCAT CGATGTCTTC CCAATTGATC GATTTACAGA 1560 

CTTGAGCATT GTCGACAAGA GCTATAAGTA TGTGGCTCTT CGTCAACTAG CTTATATCAA 1620 

AAAATCACGA GCAGTTCACG GTGATAGCAA ACTAAAAGAT TTTCTTAGAT TATGTAGCTG 1680 

GTACGCTCTC CGATTTGTCA ATCCTCGCTA CTTTTACAAG AAAATTGATC AACTAGTCAA 1740 

AAATGCTGTA ACCAACACTC CTCAATATGA AGGAGGAGTT GGGATCGGTA AGGAAGGGAT 1800 

GAAAGAAATC TTCCCAGTTG ATACCTTTAA AGAACTGATT TTAACTGAGT TTGAGGGCCG 1860 

TATGTTGCCT GTTCCCAAAA AATATGACCA ATTTTTAACC CAGATGTATG GCGATTATAT 1920 

GACACCACCA TCAAAAGAAA TGCAAGAGTG GTATAGTCAT AGCATTAAAG CTTATCGCAA 1980 

AAACTGATTG AGGGGGATTA TACAAACTAC TAAGATAGAG GTTATTCAAA AACATAATTT 2040 

TAGTAGAAAA TGAAATACAT ATTCCCACAA TAAAACGCAT CATATCAAGG TTTTTGAAAA 2100 
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ACCTTGATAT GATGCGTTTT ATAATTTTAA AGACTTTTTT CTATAGTAGA TTGAAATAAG 2160 

ATGCGAACAA ATCAATTAGA AAATTCAAAT TAATTTATAG AAATATTTTA GTATTCCTGT 2220 

GTACTGTTCT -AAATTCAGTC TGCTATATCT TATTTTTCTA TTTAAATCGC TTCTGTAACA 2280 

AAGCTACGAC TTTCAAGTAC CTTAAGCATG GCATTAGCTG TATCTAGCGC TGTGAAGAGG 2340 

GGCACCCCGT GTTCAATGGC TGAACGACGA ATTTGCTCAC CATCTTCGTC AGCAGTTCGT 2400 

TTTGTTCCTA CTGTGTTAAT GATAGCTTGA ATTCTTCCTT TGCGTACAAA ACTTGGGATA 2460 

TCCTTATCGT CATCACCAAT CTTACCAACA GGTTGGGCTT GCAAGCCATG ACTAGCAAAG 2520 

AAGGCTGCTG TCCCTTCTGT CGCAAGGATT CCATAACCAA TGTTTTGGAA ACGACGAGCC 2580 

AAGTTCAAGG CTTCTTCTTT GGCATCATCA GCGATGGTAA AGACGACATT ACCAAAAGTT 2640 

GGCAAGTGTA GATAAGAAGC TTCAAAGGCT TTATAGAGAG CTTTTTCCAA AGTAGCATCA 2700 

GAACCCATAA CTTCACCTGT TGACTTCATT TCAGGACCGA GCAAGCTGTC TACCTTAGCT 2760 

AGTTTGGTAA AGGAGAAGAC AGGTGCCTTG ATATGAACAC GGGTGCTTTC AGGGTAAAGT 2820 

CCATTTTGGT AGCCAAGTTC TGATAAACTT TGACCAAGAA TGAGTTTGGT CGCTACTTGA 2880 

GCCATAGGAA TATTGGTTAC CTTAGATAGG AATGGAACAG TACGGCTGGC ACGTGGATTG 2940 

ACCTCAATAA CGTAGACTTT TTCATCCTTG ATAACAAACT GGATGTTCAT CATTCCAAGG 3000 

CAGTGAAGAC CGATTGCTAA GCGTTTGGTG TAGTCTGCGA TGGTCTCCTG AACCTTTTGC 3060 

GACAAGGTTT GTGGTGGGTA AACAGCCATT GAGTCACCTG AGTGGACACC AGCACGTTCG 3120 

ATATGCTCCA TGATACCAGG AATGAGTACA TTTTTACCAT CTGAAATGGC ATCAACTTCG 3180 

CACTCTTGCC CAACGATATA AGAGTCGACA AGAACTGGGT GGTCTGGACT AGCCTTAACA 3240 

GCAGTTCGCA TGTAAGAACG AAGGTCTTCT TCGTTTTCAA CGATTTCCAT GGCACGTCCA 3300 

CCAAGTACAT AAGATGGGCG GACAAGAACT GGGAAGCCAA TCTTGCGAGC TGCAAGAGCT 3360 

GCTTCTTCTT CATTGGTAGC CGTTTGTCCT GGTGGCTGTG GAATATCCAA TTCTTTGAGA 3420 

GCTTGCTCGA AGAGGTCACG GTCTTCGGCA CGATCTAGGT CAGCAACCTG TGTACCAAGG 3480 

ATGGTCACAC CTGCTTTTGC CAATGGCTCC GCAAGGTTGA TGGCTGTTTG ACCACCGAAC 3540 

TGAACGATAA CTCCCTTTGG TTGTTCCAAG TCAATGACGT TCATAACATC TTCGAATGTC 3600 

AATGGCTCAA AGTAAAGCTT ATCTGATACA GAGAAGTCTG TTGAAACGGT CTCTGGGTTT 3660 

GAGTTCATGA TGATAGCTTC ATAACCAGCT GCCTGGATAG CCTTAACAGA GTGAACGGTT 3720 

GCGTAGTCAA ACTCAACCCC TTGACCGATA CGGATTGGAC CTGAACCTAG GACAAGTACA 3780 

GATTCTTTAT CAGATCTGAT AGATTCATTT TCCCAACCAT AGGTTGAATA GAAATATGGC 3840 
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GTTTCGGAGT 


CGAACTCTGC 


CGCACAAGTG 


TCTACCATCT 


TATAAACTGG 


AACAATCTTG 


3900 


TTTTCCAAGC 


GAAGTTGGCG 


AACTTTATCA 


TCAGTCGTTC 


CCCAGAGTTC 


AGCAATCTTA 


3960 


CGGTCTGAAA 


AACCATTAAG 


TTTGGCTGTT TTCAAAACTT 


CTAAATCTTG 


TGGATGAGCA 


4020 


CCCAATTCTT 


GCTCAATTTC 


AAAGATATGC 


AAGAGTTTAT 


CAAGATAGAA 


GATATCAATT 


4080 


TTTGTAAGCT 


CTGCAATTTC 


TTCAGGTGTG 


TAGCCACGAC 


GAATGGCTTC 


TGATACGTAG 


4140 


AAGAGACGGT 


CATCTTGGGC 


TTTGACAACC 


TTTTCAATCA 


AGGCATCATC 


AGAAACTGCT 


4200 


GCAAGTTCAG 


GTATTTCATT 


GTGGTGCACC 


CCAATTTCAA 


GGGAGCGGCA GGCCTTGAGA 


4260 


AGAGATTCCT 


CGATGTTACG 


ACCGATTGCC 


ATGACTTCTC 


CAGTCGCCTT 


CATTTGTGTA 


4320 


CCGAGACGGC 


GTTCACCCTT 


TTCAAACTTG 


TCAAATGGGA 


AACGTGGAAT 


CTTAGCAACT 


4380 


ACGTAGTCAA 


GGGCTGGTTC 


AAACATGGCA 


TAGGTTGAAC 


CTGTAACTGG 


GTTTATAACC 


4440 


TCATCCAAGG 


TCAAACCTAC 


TGCAATCTTG 


GCAGCCAACT 


TAGCAATCGG 


ATATCCTGTC 


4500 


GCTTTAGAAG 


CAAGGGCTGA 


CGAACGTGAT 


ACACGAGGGT 


TTACTTCGAT 


AACATAATAC 


4S60 


TTGAAGCTGT 


TAGGATCAAG 


AGCTAGCTGA 


ACATTACATC 


CACCTTCAAT 


CTTGAGGGCA 


4620 


CGAATAATGC 


TCAAGCTCGC 


ATCACGAAGC 


ATTTGGTTTT 


CATAGTCTGA 


CATGGTTTGC 


4680 


GCAGGGGCAA 


ATACAATGGA 


ATCCCCTGTG 


TGAATCCCAA 


CTGGGTCAAA 


GTTTTCCATG 


4740 


TTACAAACAA 


CCAAGGCATT 


GTCAGCTGAG 


TCACGCATCA 


CTTCGTATTC 


AATTTCCTTG 


4800 


AAACCGGCAA 


TCGAACGCTC 


AATCAAACAT 


TGGGTAACAG 


GTGACAATTT 


CAAACC ATTT 


4860 


TCAGTGATTT 


CACGCAATTC 


TTTCTCGTTG 


GCACACATAC 


CACCACCAGT 


ACCACCAAGG 


4920 


GTAAAGGCTG 


GACGAACGAT 


GACTGGGTAG 


CCAATTGTCG 


CTGCAAAGGC 


AACTGCTTCT 


4960 


TCTACTGTGT 


TAACAATTTC 


AGATTCTGGA 


ATGGGTTGTT 


CAAGCTCTTC 


CATCAATTGT 


5040 


TTAAAGAGGT 


CACGGTCCTC 


CGCTTGGTCA 


ATGGCAGATA 


ATTTGGTACC 


CAGAAGTTCA 


5100 


ACGCCAAGCT 


CGTCTAGGAT 


ACCATTTTTA 


GATAATTCCA 


TGGCCATGTT 


GAGACCTGTC 


5160 


TGACCACCGA 


GTGTTGGTAG 


CAAGGCATCT 


GGACCTTCCT 


TACGAAGAAT 


ACGTGTCACA 


5220 


AACTCAAGTG 


TAATCGGTTC 


AATGTAAACC 


TTGTCAGCAA 


TTTCCTTGTC 


CGTCATGATG 


5280 


GTTGCAGGAT 


TTGAGTTAAC 


CAAAACAACC 


TCATAACCTT 


CCTCTTTCAA 


CGACAAGCAA 


5340 


GCCTGAGTCC 


CAGCGTAGTC 


AAACTCAGCA 


GCCTGACCAA 


TAATAATCGG 


ACCAGAACCA 


5400 


ATCACCATAA 


TTTTTTGAAT 


ATCAGTACGT 


TTAGGCATAT 


ATAAGATATT 


AAGGGTGTCA 


•5460 


AGCGGACAAA 


GCTAAAATAG 


GAGTTATGAC 


GAAGAACTGT 


CAGTTCTAGG 


AATAACTATC 


5520 


TTTTTAGCAC 


CGTCCGTAGC 


CCGTATTCAG 


TTCAGCAAAT 


ACGGAGCACC 


CTTCTCCTTT 


5580 


CTATTCGTCG 


CCTCTCAGGG 


CGACATTAAA 


TAAGATACAA 


AGGACGAATA 


GAAAGCGATT 


5640 
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GAATTTTAGG 


AAATCAAGGA 


AGGATTGACA ATCCAAGTTG 


GTTTCTCTAC 


ATTCTGAGCT 


5700 


TTCCGTCCGT 


GTTCAGTTAC 


ATA A ITTfTP 


ffi a r*r* » /"•/->rp rp 
\~unl_uiAlji, 1 1 


TTACTCGTTC 


TTAGTTTGAT 


5760 


TGTTTAAAAA 


CTTCCATCAT 


1 L«A i AAAL 


1 V_o 1 LAAA I A 


GGTAGCTAGC 


GTCGTGTGGC 


5820 


CCAGGAGCTG 


CATCTGGGTG 


ulAl luAAljA 


(jAIaAAAULACj 


GTTGGTATCT 


GTGGCGCACA 


5880 


CCTTCCACTG 


ACTTGTCATT 


GATTTCTTCG 


IuUCjI AA I AA 


TCAAGTGCTC 


TGGCAAATCC 


5940 


TCGCGGCTGA 


CTGCATAACC 


A 1 Uu i 1 K.I tivj 


CTGGTGAAGT 


CTACTCGTCC 


TGTTGCGATT 


6000 


TCACGTACCG 


CATGGTTGAA 


TCCACGGTGG 


CCAAACTTCA 


TCTTATAGGT 


CTTAGCCCCG 


6060 


TTTGCCATTG 


CAAAGAGTTG 


GTGTCCCATA 


CAAATACCAA 


AGATTGGAAT 


TTTTCCTTGT 


6120 


ACACCGCGAA 


TCATGTCGAG 


TGCTTGTGGA 


ACGTCTTCTG 


GGTTACCTGG 


ACCATTTGAC 


6180 


AACATAACTC 


CGTCAGGATT 


GAGATGGAGA 


ATTTCTTCAG 


CCGTTGTCGA 


ATAAGGAACA 


6240 


ACTGTCACGT 


TACAGTTGCG 


TTTAGAAAGT 


TCACGTAGGA 


TTGAGTGCTT 


GAGACCAAAG 


6300 


TCCACTAGCA 


CCACGCTCAA 


ACCAACTCCT 


GGAGCTGGAT 


AAGACGTTTT 


AGTAGAAACC 


6360 


TGTTTGATAT 


TGTCTGTCGG 


TAAAACTGTT 


GCTTGGAGCT 


GGTCCGTCAC 


ATGGTCCATA 


6420 


CTGTCCCCAA 


CATGGGTCAA 


GGTTGCACGC 


ATAGTACCAT 


GCTTACGGAT 


AATCTTGGTA 


6480 


AGAGCACGCG 


TATCAATTCC 


rti/^ x * * m/*v^^m 

TGAAATCCCT 


GGAATTTTCT 


TGGCTTTCAA 


AAATTCATCC 


6540 


AAGGTCATTT 


GGTTGCGCCA 


GTTGCTAGCT 


CTACGCGCTT 


CTTCAAAAAC 


AACGACTCCC 


6600 


TTACAAGTTG 


GAATAATGGA 


I ILAiAATCA 


TCACGATTAA 


TACCATAATT 


TCCTACCAAA 


6660 


GGATAAGTAA 


AGGTCAAGAT 


TTGTCCATTA 


1 AA\j At, iUi 1 


CTGTAATGGA 


TTCTTGGTAG 


6720 


CCGGTCATCC 


CTGTATTAAA 


urA^.uA 111 


LL 1 Vj 1 1 AL.A 1 


CAATATCTGC 


TCCGAAGGCC 


6780 


TTGCCTTCAA 


AAACTGTGCC 


ATCTTCTAAT 


ACTAGAATTC 


TTTTTGTCAT 


ATTTTCACCT 


6840 


CTCGTGGACG 


CTCACTGGCG 


TCTTTTAACG 


TCTTGTGTTT 


TAGTTGGCGT 


TTCTACTCGC 


6900 


TAGTACGGAT 


TCTAAGATTG 


CCATTCGAAC 


AAAGACACCA 


TTGGTCATTT 


GTTGGACAAT 


6960 


CCGTGATTTT 


GGTGCTTCAA 


CCAAGTGGTC 


TGCTATTTCT 


ACATCACGAT 


TGATTGGAGC 


7020 


TGGGTGCATG 


AGGATTGCTG 


TTTCTTTCAA 


ACGATCGTAA 


CGTTCTTGAG 


TCAAGCCATG 


7080 


TTGGGCATGG 


TAGTCTTCTT 


TTGAAAATAC 


AGCTCCACTA 


TCATGGCGTT 


CGTGTTGCAC 


7140 


ACGGAGAAAC 


ATCATGACAT 


CAACCTGATC 


AATGATTTCA 


TCAATGGTTA 


CAAACTGTCC 


7200 


ATAGTCTGCA 


AACTCTTGAC 


TTCTCCATTC 


CTCAGGTCCA 


GCGAAAAAGA 


GTTCAGCTCC 


7260 


CAAGCGTTTC 


AAAATCTGCA 


TATTGGATTT 


GGCAACGCGT 


GAGTGGTCCA 


AGTCACCTGC 


7320 


AATAGCAACT 


TTAAGACCCT 


CAAAGTGGCC 


AAATTCCTCA 


TAAATGGTCA 


TCAAATCAAG 


7380 
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CAAGCTCTGG CTAGGGTGTT GGCCCGAACC ATCTCCACCA TTGATGATGG AAGTCGTAAT 7440 

CGTTGGACTA GCAATCAATT CTCTATAGTA GTCGACCTCT , GGATGGCGAA TCACACAGAC 7500 

ATCCACTCCT AAAGCAGACA GAGTCAAAAT GGTGTCATAA AGTGTCTCAC CCTTATTAAC 7560 

CGAGCTAGTC TTCACATCAA AGTCAAGTCG TTCCAATCCA AGTTTAATCT CTGCGACTTC 7620 

AAAGGACTTA TGTGTCCGTG TAGAATCCTC AAAGAAGAGA TTGGAAACAA TCGGATGGTC 7680 

TTCATAGGGA AGCTGGGCTC CATTTTTAAA CTCAATTCCT CGCTTGATCA ATTTCATTAC 7740 

TTGATCGACA GTGAGGTCTT CCATGGACAC CACATGGTTC AATGCTTGTT GATTTTCTGA 7800 

CATGGCTACT CCTTTAACTT TCTAAGCTTC TTCAGTAATC AGAACTCTGT CTTGGTCATC 7860 

AAGTTCTGTC ATCTCTACGA TGATTTCTTC AGAACGACTG GTTGGGATAT TTTTTCCAAC 7920 

GTAATCTGGA CGGATTGGCA ATTCTCTATG TCCACGATCG ACTAGAACTG CTAAACTCAC 7980 

ACGCGCAGGA CGACCATGAC CGACAATATT ATCAATAGCA GGACGGATGG TACGACCTGT 8040 

ATAGAGCACA TCATCCACCA AGATAACTTC GCGGTCTGTC ACATCGACAG AAACCAAAGA 8100 

AGTATCTTCT CCACTTTTAA CATCATCACG GAAAGGTTTA GTATCCAATT CCACAACAGG 8160 

AACTGAAAGA TTTTCTAACT GCTTCAAACG TTCTTGGATT CGGTGGGCAA TAAAGACACC 8220 

ACGAGTTTTA ATACCAGCCA AGACGATCTT ATTCAAATCT TTGTTGCGTT CGATAATCTC 8280 

ATAAGTAATA CGCGTAATCG CTCGTTTGAC GGTCAATTCG TCTACAACTT CTTTTGTTTT 8340 

CATGACAAAC CTCCAAAAAG AAAAGTCTCC TTAAACAAGG AGACTTGAAA TTTATAGCCA 8400 

AGCGAGCCCT ACTGCACACA GTATAGACTT CACCCTTCTA CTTTATCGCG CTCCTTGCCT 8460 

GCCTCACGGG ACAGGTTTAA AGGAATATTT AGTTATCATT TACTATAGCA CAAAGCATGC 8520 

TTAAAATCAA GCAAAAAGTT TCAATGTAGC ATCTTACAAA TTGCTAAAAT CATATAATTG 8580 

TGGGTACTGG TCACACTCTG GATTTTTTGG ATGGCAAATG GCTCTTCCAA AATAAATCAT 8640 

GGCCTGATGG GCAGCTAACC ACTGCTCAGG CGGCAAGATA TCCATGACCC GCTTTTCCAC 8700 

CTCAAGTGGC GTCGCTGATT TTTTGACAAT ATCGTGGTGT TTGCAAATAC GCTCCACATG 8760 

AGTATCCACT GCAAAGGCTG GAATTCCAAA TCCTACACTC ATGACAACAT TGGCTGTCTT 8820 

GCGACCAACA CCTGCCAAAC TCTCCAATTC TTCACGTGTC TGAGGGACTT GACCATCAAA 8880 

ATCGTCTAGT AACTGTTGGG CACATTTTTT AAGGAATTTA GCTTTATTCC GATACAATCC 8940 

CAAGCGAGAA ATATGTGAAG CAATCTCACT CTCTGTCGCT ACAGACATAG CTTGGGGTGT 9000 

TGGAAAGGCA ACAAAGAGAC CTGGTGTGGC CTTATTTACC GCTGCATCTG TCGTCTGGGC 9060 

TGATAACATG ACCGCAACCA GGAGTTCAAA ATGATTGGTA AAATCAAGAC TAGGCTTGGC 9120 

ATCTGGGAAG AGGGCAATGA TTTCTTCTAG CACCTTTCGT GCTCGTTTTT TTGACAAGAC 9180 
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CATTATTCAT CTCCGTCAAA TAGTCCTTGT AAGCCAGCAA AAGGACTGTT TTCTTCTTTC 9240 

TTTACTGCTT TTTGAGCTTG GTATTCTTCC TCTGTCATGA TTTGCCAGTC ATTTCCTGAG 9300 

ATAAATCCTT GACCAGCTTC TTCTTCAGCC GTCAAGACCT TGATAGGAAT GTTTAGCAGG 9360 

ATATTGTCTG ATACACTCTC AGCAAGGTCA AGCTCCCCAT TTTCGATGGG CAAGACCAAG 9420 

TCATCATCTA AAACTTCTTG ATCTAGCTGG TTAGTTGCGC CTTCCATGAA AACTTCCGTG 9480 

ACTGGATAAG ATTCAACTAA CTCAACTGGC TCCATACTGC GACTCGACGC AAGAACAATG 9540 

GTATAAGATA GTTGATAATC TAAGAAATAC ATACGGTCTT CATATTGTAC TTTCCCAACT 9600 

GCAAGGATAT CTTTTACATC TAAAATTTCT TGATTACGTG CACGCAGGTC ATCAACTAAA 9660 

TCTAACGTTT GTTCAAAGTT CAAACCTTCA GACTGCTTAC GAATTTCTTG AATATTTAAT 9720 

TTCATACTTC CTCCATAAAG ATTTACTCTC TTGATTATAC CATGAAAAGG CTACAAATCA 9780 

V 

GCACACCAAA CTTTGTAATT AAAATTCAAA ATTTTAACAT ATTTACTATG ATAGTTTTAT 9840 

TTTTTAGTGC TATACTATAG GGAAAGAGTA CATCAGATCA AGGAGGATGC TCACATGGAA 9900 

GACAAGAAAC TCATTCAACT CCTATCCAAG TTAAATAAAA GCTACCAAAA CTGTAAACAG 9960 

GGTACGGCAG ATGATATTCG ACTACAAGAG CTGCTAAACA CTACTATGCA AGAGCTCAAA 10020 

AAAACGGAAC AGTTGAACAA CAGTATCTTA ATTGATCTTG AGAAATTTTA CCAACCTACC 10080 

AGTCTTCTGA TTGGACTGGG TAGCCTAAAA CTAAACGATC AAGCACGCAC TGCTTGGCGA 10140 

AACTATGATA AATTCCATTA CGATCATGTC AAACACGTAC TAAGTCTCTA TGGACCTGTT 10200 

TTTGAATTTT AGAGCATAGA ATTTCCAGTT TTCTGTTGAC AAAATTTCCT TAAAGGTATA 10260 

ATATAAAGAT ACTAATACTC GGAGGTAAGG GAGACATGAA CAACTAAGTC TATCAAATAA 10320 

AGAACCTTTA TTTAGTAGAT CTTGTTTTTG TCTCTTTTTG TGTGCTCTTT TATGCTCTTT 10380 

TTCTGGCATG TTAATAGAGT TTTTTTGACA TAGACTTTGG GCTCTACTAG GTAAAGTAGA 10440 

GCTTTTTGTT ATGCACTATG AACATTCTAG AAAGGGAAAT CATATGATAA AAATCAATCA 10500 

TCTAACCATC ACACAAAACA AAGATTTACG AGATCTTGTA TCTGACCTAA CCATGACCAT 10560 

CCAAGACGGG GAAAAGGTTG CTATTATTGG TGAAGAAGGA AATGGCAAAT CAACCTTACT 10620 

TAAAATTTTA ATGGGGGAAG CTTTGTCTGA TTTCACTATC AAGGGAAACA TCCAATCTGA 10680 

CTATCAGTCA CTGGCCTACA TTCCTCAAAA AGTCCCTGAG GACCTAAAAA AGAAAACTTT 10740 

ACACGACTAC TTCTTTTTAG ATTCTATTGA TTTAGACTAC AGTATCCTCT ATCGTTTGGC 10800 

GGAGGAATTG CATTTTGATA GCAATCGTTT CGCAAGTGAC CAAGAGATTG GCAATCTATC 10860 

AGGGGGCGAA GCTTTGAAAA TTCAGCTTAT CCATGAGTTA GCCAAACCCT TTGAGATTCT 10920 
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ATTTTTAGAT GAACCTTCAA ATGACCTAGA CCTTGAGACA GTTGATTGGC TAAAAGGCCA 10980 

GATTCAAAAG ACCAGGCAAA CCGTTATTTT CATTTCCCAT GATGAAGACT TTCTTTCTGA 11040 

AACGGCAGAC ACTATTGTTC ACTTGCGACT GGTCAAACAC CGTAAAGAAG CGGAAACGCT 11100 

AGTAGAGCAT TTAGACTATG ATAGCTATAG TGAGCAGAGA AAGGCTAATT TTGCCAAACA 11160 

AAGTCAGCAA GCTGCTAACA ACCAAAGAGC CTACGATAAA ACCATGGAAA AACATCGGAG 11220 

AGTTAAGCAA AATGTAGAAA CTGCGCTTCG AGCTACCAAA GATAGTACTG CCGGTCGCCT 11280 

ATTGGCTAAA AAGATGAAAA CTGTCCTCTC ACAAGAAAAA CGCTACGAAA AGGCAGCTCA 11340 

GTCCATGACT CAAAAGCCAC TTGAAGAGGA ACAAATCCAA CTTTTCTTTT CAGACATCCA 11400 

ACCATTACCA GCTTCTAAAG TCTTAGTCCA ACTGGAAAAA GAAAATTTGT CCATTGACGA 11460 

CCGAGTTTTG GTTCAAAAAC TACAACTAAC TGTCCGTGGC CAAGAAAAAA TCGGTATTAT 11520 

CGGGCCAAAT GGTGTTGGGA AATCAACTCT GTTAGCCAAG TTACAGAGAC TTCTGAATGA 11580 

TAAAAGAGAG ATTTCACTTG GTTTTATGCC ACAAGATTAC CACAAAAAAC TGCAATTGGA 11640 

TTTATCCCCA ATAGCCTATC TCAGTAAAAC TGGGGAAAAA GAGGAACTAC AGAAAATCCA 11700 

ATCTCACCTA GCTAGTCTCA ATTTCAGTTA TCCAGAAATG CAGCATCAAA TTCGCTCCTT 11760 

ATCTGGCGGA CAACAGGGAA AACTCCTGCT TTTGGATTTA GTCCTGCGCA AACCAAACTT .11820 

TCTCCTGCTG GATGAACCCA CACGAAACTT TTCTCCCACT TCTCAACCCC AAATCAGAAA 11880 

ACTCTTTGCT ACCTATCCAG GCGGTCTCAT CACTGTTTCG CATGACCGTC GTTTCTTAAA 11940 

AGAAGTCTGC TCGATCATCT ATCGCATGAC AGAACACGGT TTGAAGCTAG TTAATTTAGA 12000 

AGATTTATAA ATTTGCAACA TAGCAAAAAT CCAGAGACGA CCTCTGGATT CTTTTACATC 12060 

TGTTTTAAAC GTTCAATCCG TTCTGAGATA GGTGGGTGGG TATAAAAGAG TTTTTGGAAC 12120 

CCCCCACCTT TCTTAGGATC ATTGATATAA AGGGCACTGC TAGCATCATC GAGGTGGCGA 12180 

CTCATAGGTT TGCTATTGTC CAACTTATCT AGGGCATTAA TCATTCCCTG GGGATTGCGA 12240 

GTCAGCTCGA CACTAGATGC ATCTGCCAGA AATTCCCTCT GACGAGAAAT AGCGAGCTGA 12300 

ACCAAGGTTG CAGCGAGAGG TGCCAGTACA ATAGCTAGTA GGGAAACCAC TAGCATAATG 12360 

ATTTCAAGAC CATTTCCATC TCGGTCATCA TCACTTCGTC TGCGACCTGC TCCACGCCAC 12420 

CACATCATAC GACCTGGCAT ACTAGAAAGC ATGGTGATAG CACTAGCAAG GGCAACTGCA 12480 

ATAGTCGAAA TACGGATATC ATAATTACGA ATATGACTGA CTTCATGTCC CATAACAGCT 12540 

TCTAGTTCTT CACGATTCAT GATAGCTAGT AGACCTGAAG TCGCAGCAAC AGCOGCATTT 12600 

TGAGGATTAG AACCTGTCGC AAAGGCATTT AAGGCTGGAT CATCAATGAT GAAAACACGG 12660 

GGCATAGGAA TCTGAGCGAC CAGAGCCATA TCTTCCACTA CATGGTAGAG GTCTGGTGCC 12720 
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GTTTGCTCAT CCACCTCACG CGCTCCATTC ATGGACATGA CAATCTCTGT CGATTGAAAA 12780 

ATCATAGACA AAGCGTAGAT AAAGCCGATA ATCAGTGCAA TAACCAAACC ACCAAGTCCA 12840 

GATCTTATAA AGAGATAACC AACCGCATAA CCAACAAGAG CTAAGAGTAG GAAAAATACC 12900 

AGCAACAAAA TCCAGGTTTT TCGTTTATTG CTTGCAATTT GATCAAACAA CATCTTAGTC 12960 

ACCTAAACCG CTAAAATCAA CTTTAGGAAC CGACTTTTCC TCTTCAGGTG TTTGAAGGAA 13020 

ATCTGCCGCT TTAAATCCAA ACATTCCAGC GATAATATTG CTCGGGAAAG TTTCTAATTT 13080 

TACATTGTAG TTGCTGACAA CACTGTTATA GAGTTGACGA GAGTAAGAAA TTTTATTTTC 13140 

TGTGTTTGTC AACTCCTCTT GCAATTTAAC AAAGTTAGCA CTAGCTTTCA AATCTGGATA 13200 

GCTTTCTGCA ACTGCAAAAA TACCTGAAAC CTGACGAGTG AGGGCATCAC TGGCTTTCAT 13260 

AGCTTCTGCT GGTGAAGTCG CTGCCGCCAC TTGGTTACGT AGTTCTGCCA CCTTTTCAAG 13320 

GGTAGAACCT TCATATTTGG CATAACCTTT TACAGTCTCA ATCAAGTTTG GCAAGAGGTC 13380 

ATTGCGACGT TTCAACTGAA CATCAATCTG ACTCCAAGCC TCCTTGGTTT GCATACGATT 13440 

TTTAACCAAA CCGTTATAGC TAACAATCAC AAAAATAACA ATAAGAGCGA TAACTCCAAG 13500 

AATAATCCAA GTCATAATAT AAGTCCTTTC TGCTTTTAGA TTAGTACCAG TATATCAAAT 13560 

TTTCTATGAT TGTGGTAAAA TAAGATGATA CTAAAGAAGG AAATAACTAT GAAACCAAAA 13 620 

ACATTTTACA ACTTGCTTGC CGAGCAGAAT CTTCCACTTT CGGACCAGCA AAAAGAACAA 13680 
TTTGAACGTT ATTTTGAGCT CTTGGTCGAG TGGAATGAGA AGATTAATTT GACGGCGATT . 13740 

ACGGACAAGG AAGAAGTTTA TCTCAAACAT TTTTACGATT CGATTGCACC CATTCTTCAA 13800 

GGTTTGATTC CCAATGAAAC TATCAAACTT CTTGATATCG GGGCTGGGGC AGGATTTCCT 13 860 

AGTCTACCAA TGAAAATTCT CTATCCGGAG TTAGATGTGA CCATTATTGA TTCACTCAAT 13920 

AAG CGCATCA ACTTCCTACA ACTCTTGGCT CAAGAACTGG ATTTGAACGG AGTTCATTTC 13980 

TACCACGGAC GTGCCGAAGA TTTTGCCCAA GACAAGAACT TCCGTGCTCA ATATGATTTT 14040 

GTAACAGCTC GTGCGGTTGC CCGTATGCAG GTCCTATCTG AATTGACTAT TCCCTACCTT 14100 

AAGGTTGGTG GCAAACTATT AGCACTCAAG GCTAGCAATG CGCCTGAGGA ATTATTAGAA 14160 

GCTAAGAATG CCCTCAATCT CCTTTTTAGT AAGGTCGAAG ACAATCTCAG TACGCCCTAC 14220 

CGAATAGAGA TCCGCGCTAT ATCACAGTGG TAGAAAAGAA AAAAGAAACA CCAAATAAAT 14280 

ATCCACGTAA GGCTGGTATG CCAAATAAAC GCCCACTTTA AATTTTTTAG TAAACAAATG 14340 

TTTACAAAAT CAGCCTCGCT CTTTTATTTC TAGGCTCGGG AAAAAATGAT TTACAAAATC 144 00 

AGCCTCGCTC TTTTATTTCT AGGCTCGGGA AAAAATGATT TACAAAATCA TTTTTTTCTG 14460 
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CTATACTATC CTAAGCAAAG GTTTTTAATG TCATCCCGTG AGGTGACGAA GACGCAGAAA 14520 

TATTTAAAAC TCTTTAAAAT CTAAATTTTA AAGAAGTCTT ACTCTGAGGG CCTATTGCTG 14580 

TAAAATAATG GGCTCTTTTT TGATGCCCAA AAGTGAGGTT TATATGAAAC AAGAATCAAC 14640 

TGTTGATTTG TTAC 14654 



(2) INFORMATION FOR SEQ ID NO: 107: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

AGAAAAATCT GCTTTACAGA AAATAAAAAT AATAGGAGAA AATCTATGTC AGATTTGAAA 60 

AAATACGAAG GTGTCATTCC AGCCTTCTAC GCATGTTATG ATGATCAAGG AGAAGTAAGC 120 

CCAGAACGTA CGCGTGCCTT GGTTCAATAC TTCATTGATA AAGGTGTTCA AGGTCTTTAT 180 

GTCAATGGTT CTTCTGGTGA ATGTATCTAC CAAAGCGTTG AAGATCGCAA GTTGATTTTG 240 

GAAGAAGTCA TGGCGGTAGC AAAGGTAAAT TGACCATTAT TGCCCATGTT GCTTGCAATA 300 

ATACTAAAGA TAGTATGGAA CTTGCTCGCC ATGCTGAAAG CTTGGGAGTA GATGCTATTG 360 

CAACGATTCC ACCAATTTAT TTCCGCTTGC CAGAATACTC AGTTGCCAAA TACTGGAACG 420 

ATATCAGTTC TGCAGCTCCA AACACAGACT ACGTGATTTA CAACATTCCT CAATTGGCAG 480 

GGGTTGCTTT GACTCCAAGC CTTTACACAG AAATGTTGAA AAATCCTCGT GTTATCGGTG 540 

TGAAGAACTC TTCTATGCCA GTTCAAGATA TCCAAACCTT TGTCAGCCTT GGTGGAGAAG 600 

ACCATATCGT CTTTAATGGT CCTGATGAGC AGTTCCTAGG AGGACGCCTC ATGGGGGCTA 660 

GGGCTGGTAT CGGTGGTACT TATGGTGCTA TGCCAGAACT CTTCTTGAAA CTCAATCAGT 720 

TGATTGCGGA TAAGGACCTA GAAACAGCGC GTGAATTGCA GTATGCTATC AACGCAATCA 780 

TTGGTAAACT CACTTCTGCT CATGGAAATA TGTACGGTGT CATCAAAGAA GTCTTGAAAA 840 

TCAATGAAGG CTTGAATATT GGATCTGTTC GTTCACCATT GACACCAGTG ACTGAAGAAG 900 

ATCGTCCAGT TGTAGAAGCG GCTGCTGCCT TGATTCGTGA AACCAAGGAG CGCTTCCTCT 960 

AATCTAAAAG GAGGTATTTA TGACATATTA CGTTGCAATT GATATCGGTG GAACCAACAT 1020 

CAAGTATGGT TTGGTTGATC AAGAGGGGCA ACTTCTTGAA TCGCATGAAA TGCCAACTGA 1080 

GGCGCATAAG GGTGGACCTC ATATCTTACA AAAGACCAAA GATATCGTAG CTAGTTATTT 1140 

AGAAAAAGGC CCAGTAGCAG GTGTTGCCAT ATCTTCTGCT GGGATGGTGG ATCCGGATAA 1200 
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GGGTGAGATT TTCTATGCTG GGCCGCAAAT CCCTAACTAC GCAGGCACCC AGTTCAAAAA 1260 

GGAAATCGAA GAAAGCTTTA CTATTCCTTG TGAGATTGAA AATGATGTCA ACTGTGCAGG 1320 

TCTTGCTGAG GCAGTATCTG GTTCAGGCAA GGGAGCAAGT GTGACACTTT GCTTGACCAT 1380 

TGGAACCGGT ATCGGTGGTT GCTTGATTAT GGATAGGAAA GTCTTCCATG GTTTTAGCAA 1440 

TTCAGCCTGT GAAGTCGGGT ATATGCATAT GCAGGATGGA GCTTTTCAAG ACTTGGCTTC 1500 

TACAACAGCT TTAGTGAAAT ATGTAGCTGA AGCCCATGGA GAAGATGTTG ATCAGTGGAA 1560 

TGGCCGTAGA ATTTTCAAAG AAGCCACTGA AGGAAACAAA ATCTGCATGG AAGGTATTGA 1620 

CCGTATGGTT GACTATCTAG GAAAAGGTCT GGCAAATATT TGCTACGTTG CCAATCCAGA 1680 

AGTGGTTATT CTTGGTGGTG GTATCATGGG GCAAGAGGCT ATCCTCAAAC CTAAGATCCG 1740 

TACAGCCTTG AAAGAGGCTT TGGTACCAAG TTTAGCAGAA AAAACACGAT TAGAATTTGC 1800 

CCATCACCAA AATACAGCAG GGATGTTGGG TGCATATTAT CATTTTAAGA CAAAACAATC 1860 

CTAGTTTGGC TCAGCCAAAC TAGGATTTTC TTACACGTTT TTGTCTACGA TAGCCGTTGA 1920 

GTTTTTTATT TTCCCAGTAG CTATTAAAGA TTTTTTCCTT GCTTTCGCGA TTGATTTCCA 1980 

AAAAGTAGGC ATAAATCAAA TCGATAAAGA AGAGCATAGG AAGTTGAGCG GATATTCGTT 2040 

GGATATAGGA GGGTTGGCTG TGGGTGGCTA CAAGAACAGT CTCTGTATAG GTCTGGCTAT 2100 

CTTTATTGGG AACACTTGTA AAGAGTACAG TCTTTGCCCC CATCTCCTTA GCATCTAATA 2160 

GACTATCTAA AATAGAAGGA GTTGAGCCTG AAAGTGAGAA GCCCAGTACT AGACAATTTT 2220 

CATCCATGAT GCTGGTTGTC CAGGCAAAGC CGTCTTGGTC TGTCAAAGCT TCGCAGACCA 2280 

CACCTAGTCG CATAAAACGT AATTTCATTT CACGGGCGAC GAGGCCAGAA CTCCCTGTTC 2340 
CAAAGAAGTA GATACGCTCA GCATCTTCGA TTAGCTGGGC AATTCGTTCT AGTTGGATTT . 2400 

CGTCAATCAA GTCTTGTGTT TGTTCCCTCA TATTGCTATA ACTTCTGAGG ACTCGTTTGG 2460 

TCAGTGGACT GTGCTTGGAG ACTTGGTTGG CTTGATTTTC TGCCTGATGT TGGTATTGGA 2520 

AAATAAATTC TCGGTAGCCA GTAAAGCCAC ACTTTTTAGC AAAGCGGGTC AAAGCAGCTT 2580 

GAGAAATATG TAATTTTTGG GTGACTTGTT GAGAAGATAA ATCATCTGTA ATCGTTTCAG 2640 

CTTGCAAAAA ATAGCGAGCG ATTTCTTGTT CTAGGTCTGT CATTTCTTCA AAATGTGAAT 2700 

CAATGATAGT TGCGATATCT GGTTTGTCCA TAGGGAAAGC TCCTTTACAT GAGTCATACT 2760 

GGAAGACTAG ATCAGAGAAT AGTCACACTT CATTATAACA CATAATATAA GGATAGATAA 2820 

ATAAAAACGC ATCTCTGTTT TAAAAACGAA AAAATCGAAA AAGCTTCTCT CTTTTCCATA 2880 

ATTTTCTACT CAAATTGTGG TACAATTAAG AGTAAGATTT TAAGTTAGAA ATGAGACTGA 2940 
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TTTGTATGAG AAAATTTAAC AGCCATTCGA TTCCGATTCG GCTTAATTTA TTGTTTTCAA 3000 

TCGTCATTTT ACTCTTTATG ACCATTATTG GTCGTTTGTT GTATATGCAG GTTTTGAACA 3060 

AGGATTTTTA CGAAAAAAAG CTAGCTTCAG CTAGTCAGAC CAAGATTACA AGCAGTTCAG 3120 

CCCGTGGGGA AATTTATGAT GCTAGTGGAA AACCTTTGGT AGAAAATACG TTAAAGCAGG 3180 

TTGTTTCCTT TACGCGTAGC AATAAAATGA CGGCTACAGA CTTAAAAGAA ACAGCTAAAA 3240 

AGTTACTGAC TTATGTGAGC ATCAGTTCTC CAAATTTGAC AGAACGCCAG CTGGCGGATT 3300 

ACTATTTGGC TGATCCTGAA ATCTATAAAA AAATAGTGGA AGCTCTCCCA AGTGAGAAAC 3360 

GCTTGGATTC AGATGGCAAT CGTCTATCCG AATCAGAACT GTATAACAAT GCGGTCGATA 3420 

GTGTACAAAC GAGTCAACTA AACTATACAG AGGATGAAAA GAAAGAAATC TATCTTTTTA 3480 

GTCAGTTAAA TGCTGTTGGA AACTTTGCGA CAGGAACCAT TGCGACAGAT CCTCTAAATG 3540 

ATTCTCAGGT GGCTGTTATT GCCTCTATTT CAAAGGAGAT GCCTGGCATT AGTATTTCTA 3600 

CTTCTTGGGA TAGAAAGGTT TTGGAAACTT CCCTTTCTTC TATAGTTGGG AGTGTATCCA 3660 

GTGAAAAAGC TGGTCTCCCA GCGGAAGAAG CAGAAGCCTA TCTTAAAAAA GGCTATTCTC 3720 

TAAATGACCG TGTAGGAACC TCCTATTTGG AAAAGCAATA TGAAGAGACC TTACAAGGAA 3780 

AACGCTCGGT AAAAGAAATC CATCTGGATA AATATGGCAA TATGGAAAGC GTGGATACAA 3840 

TTGAGGAAGG TAGTAAGGGA AACAATATCA AACTGACCAT TGATTTGGCT TTCCAAGATA 3900 

GCGTGGATGC TTTACTGAAA AGTTATTTCA ATTCTGAGCT AGAAAATGGT GGAGCCAAGT 3960 

ATTCTGAAGG TGTCTATGCA GTCGCCCTTA ACCCAAAAAC AGGTGCGGTT TTGTCTATGT 4020 

CAGGGATTAA ACATGACTTG AAAACGGGAG AGTTGACGCC TGATTCCTTG GGAACGGTAA 4080 

CCAATGTCTT TGTTCCAGGT TCGGTTGTCA AGGCGGCGAC CATCAGCTCA GGTTGGGAAA 4140 

ATGGAGTCTT GTCAGGAAAC CAGACCTTGA CAGACCAGTC CATTGTCTTC CAAGGTTCAG 4200 

CTCCCATCAA TTCTTGGTAT ACTCAGGCTT ACGGTTCATT CCCTATCACA GCGGTCCAAG 4260 

CTCTGGAGTA TTCATCAAAT ACCTATATGG TCCAAACAGC CTTAGGTCTT ATGGGGCAAA 4320 

CCTATCAACC CAATATGTTT GTCGGCACCA GCAATCTAGA GTCTGCTATG GAGAAACTGC 4380 

GTTCAACCTT TGGCGAATAT GGCTTGGGTA CTGCGACAGG AATTGACCTA CCAGATGAAT 4440 

CTACTGGATT TGTTCCCAAA GAGTATAGCT TTGCTAATTA CATTACTAAT GCCTTTGGGC 4500 

AGTTTGATAA CTATACGCCG ATGCAGTTGG CTCAGTATGT AGCAACTATT GCAAATAATG 4560 

GTGTTCGTGT GGCTCCTCGT ATTGTTGAAG GCATTTATGG TAATAATGAT AAGGGAGGAC 4620 

TGGGTGACTT GATTCAGCAA CTGCAACCGA CAGAGATGAA TAAGGTCAAT ATATCCGACT 4680 

CCGATATGAG CATCTTGCAC CAAGGTTTTT ATCAGGTTGC CCATGGTACT AGTGGATTGA 4740 
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CAACTGGACG TGCCTTTTCA AATGGTGCCT TGGTATCCAT TAGCGGAAAA ACAGGTACAG 4800 

CCGAAAGCTA TGTGGCAGAT GGTCAGCAAG CAACCAATAC CAATGCGGTG GCCTATGCCC 4860 

CATCTGATAA TCCCCAAATC GCTGTCGCAG TGGTCTTTCC TCATAATACC AATCTAACAA 4920 

ATGGTGTAGG ACCTTCCATT GCGCGTGACA TTATCAATCT GTATCAAAAA TACCATCCAA 4980 

TGAATTAGAA AGGAAATTAT GCTTTATCCA ACACCTATTG CCAAGTTGAT TGACAGTTAT 5040 

TCTAAGTTAC CAGGTATCGG GATTAAGACG GCTACGCGTC TGGCCTTTTA TACGATTGGG 5100 

ATGTCTGCTG ATGATGTCAA TGAATTTGCA AAAAATCTCC TTTCTGCTAA GAGAGAATTG 5160 

ACATATTGTT CTATTTGTGG ACGTTTGACA GACGACGATC CTTGTTCTAT CTGTACTGAT ' 5220 

CCGACTCGTG ACCAGACAAC AATTTTAGTT CTTGAGGATA GTAGAGATGT GGCAGCCATG 5280 

GAAAATATCC AAGAATACCA TGGACTCTAT CATGTCCTTC ATGGCCTCAT TTCTCCTATG 5340 

AATGGTATCA GTCCGGACGA TATCAATCTC AAGAGCCTTA TGACTCGTCT TATGGATAGT 5400 

GAGGTTTCAG AAGTGATTGT GGCGACTAAT GCTACAGCGG ATGGTGAAGC GACTTCCATG 54 60 

TATCTTTCAC GTTTGCTCAA GCCGGCTGGT ATCAAGGTTA CGCGTCTAGC ACGAGGTCTC 5520 

GCTGTGGGAG CGGACATTGA GTATGCGGAC GAAGTGACAC TCTTACGAGC CATTGAAAAT 5580 

CGGACAGAGT TGTAAGTGTA GGCAAATTTA CGAACTCCAT TCATTTATAA AAAATCAAAG 5640 

AGGCTGAAAA TCGTTCCTAT CGGCCTCTTT TTGTATAGTG TGATGAGTAG GCTCAGGTTC 5700 

AAGTTTTAAA AAACCAAGCA AATATGATAT ACTAAAGAGC GAGTATTCTA GTAGAATTAG 5760 

GACAAATAAT ATGAAACAAA CGATTATTCT TTTATATGGT GGACGGAGTG CGGAACGCGA 5820 

AGTCTCTGTC CTTTCAGCTG AGAGTGTCAT GCGTGCGGTC GATTACGACC GTTTCACAGT 5880 

CAAGACTTTC TTTATCAGTC AGTCAGGTGA CTTTATCAAA ACACAGGAAT TTAGTCATGC 5940 

TCCGGGGCAA GAAGACCGTC TCATGACCAA TGAAACCATT GATTGGGATA AGAAAGTTGC 6000 

ACCAAGTGCT ATCTACGAAG AAGGTGCAGT GGTCTTTCCA GTCCTTCACG GG CC AATGGG 6060 

AGAAGATGGC TCTGTTCAAG GATTCTTGGA AGTTTTGAAA ATGCCTTACG TTGGTTGCAA 6120 

CATTTTGTCA TCAAGTCTTG CCATGGATAA AATCACGACT AAGCGTGTTC TGGAATCTGC 6180 

TGGTATTGCC CAAGTTCCTT ATGTGGCTAT CGTTGAAGGC GATGATGTGA CTGCTAAAAT 6240 

CGCTGAAGTG GAAGAAAAAT TGGCTTATCC AGTCTTCACT AAGCCGTCAA ACATGGGGTC 6300 

TAGTGTCGGT ATTTCTAAGT CTGAAAACCA AGAAGAACTC CGTCAAGCCT TAAAACTTGC 6360 

CTTCCGATAT GACAGCCGTG TCTTGGTTGA GCAAGGAGTG AATGC 6405 
(2) INFORMATION FOR SEQ ID NO: 108: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

CGAGCTCGGG TACCGGGATT TTAAGGAGTT TGATATGTAT AACCTATTAT TAACCATTTT 60 

ATTAGTATTA TCTGTTGTGA TTGTGATTGC AATTTTCATG CAACCAACCA AAAACCAATC 120 

CAGCAATGTA TTTGATGCCA GTTCAGGTGA TTTGTTTGAA CGCAGTAAAG CTCGCGGTTT 180 

TGAAGCTGTA ATGCAGCGTT TGACAGGGAT TTTAGTCTTT TTCTGGCTAG CCATTGCCTT 240 

AGCATTGACG GTATTATCAA GTAGATAAGA AAATAATGGG CAGGACTAGG TCTTTGCCTC 300 

TTTTTATTTT TAAAGGATGT TTGAGAAGGT TTTACAGTAA AAGAAAATTA AAAAATCTAG 360 

AAAGAAAATA TGAAAGATAG AATAAAAGAA TATTTACAAG ACAAGGGAAA GGTGACTGTT 420 

AATGATTTGG CTCAGGCTTT GGGAAAAGAC AGTTCCAAGG ATTTTCGTGA GTTGATTAAA 480 

ACCTTGTCCT TAATGGAAAG AAAGCACCAA ATTCGTTTTG AAGAAGATGG TAGTCTGACA 540 

TTAGAAATTA AGAAAAAACA TGAGATTACC CTCAAGGGGA TTTTTCATGC CCATAAAAAT 600 

GGCTTTGGCT TTGTTAGTCT GGAAGGCGAG GAGGACGACC TTTTTGTAGG GAAAAATGAT 660 

GTCAACTATG CTATTGATGG TGATACCGTC GAGGTAGTGA TTAAGAAAGT CGCTGACCGC 720 

AATAAGGGAA CAGCAGCAGA AGCCAAAATT ATTGATATCC TAGAACACAG TTTGACAACA 780 

GTTGTCGGGC AAATCGTTCT GGATCAGGAA AAACCTAAGT ATGCTGGCTA TATTCGTTCA 840 

AAAAATCAGA AAATCAGTCA ACCGATTTAT GTTAAGAAAC CAGCCCTAAA ATTAGAAGGA 900 
ACAGAAGTTC TCAAAGTCTT TATCGATAAA TACCCAAGCA AGAAACATGA TTTCTTTGTC . 960 

GCGAGTGTTC TCGATGTAGT GGGACACTCA ACGGATGTCG GAATTGATGT TCTTGAGGTC 1020 

TTGGAATCAA TGGACATTGT ATCCGAGTTT CCAGAAGCTG TTGTTAAGGA AGCAGAAAGT 1080 

GTGCCTGATG CTCCGTCTCA AAAGGATATG GAAGGTCGTC TGGATCTAAG AGATGAAATT 1140 

ACCTTTACCA TTGACGGTGC GGATGCCAAG GACTTGGACG ATGCAGTGCA TATCAAGGCT 1200 

CTGAAAAATG GCAATCTGGA GTTTGGGGTT CACATCGCAG AT GTT TCTTA TTATGTGACC 1260 

GAGGGGTCTG CCCTTGACAA GGAAGCCCTT AACCGTGCGA CTTCTGTTTA CGTGACAGAC 1320 

CG AGTGGTGC CAATGCTTCC AGAACGACTA TCAAATGGCA TCTGCTCTCT CAATCCCCAA 1380 

GTTGACCGCC TGACCCAGTC TGCTATTATG GAGATTGATA AACATGGTCG TGTGGTCAAC 1440 

TATACCATTA CACAAACAGT TATCAAGACC AGTTTTCGTA TGACCTATAG CGATGTCAAT 1500 
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GATATCCTAG CTGGCGATGA AGAAAAGAGA AAAGAATATC ATAAAATTGT ATCAAGTATC 1560 

GAACTCATGG CCAAGCTTCA TGAAACTTTA GAAAACATGC GTGTGAAACG TGGAGCTCTC 1620 

AATTTTGATA CCAATGAAGC GAAGATTTTA GTGGATAAAC AAGGTAAGCC TGTTGATATC 1680 

GTTCTTCGGC AGCGTGGTAT TGCCGAGCGG ATGATTGAGT CTTTTATGTT GATGGCTAAT 1740 

N GAAACAGTTG CCGAACATTT CAGCAAGTTG GATTTGCCTT TTATCTATCG AATTCACGAG 1800 

GAGCCTAAGG CTGAAAAGGT TCAGAAGTTT ATTGATTATG CTTCGAGTTT TGGCTTGCGC 1860 

ATTTATGGAA CTGCCAGTGA GATTAGTCAG GAGGCACTTC AAGACATCAT GCGTGCTGTT 1920 
GAGGGAGAAC CTTATGCAGA TGTATTGTCC ATGATGCTTC TTCGCTCTAT GCAGCAGGCT • 1980 

CGTTATTCGG AGCACAATCA CGGCCACTAT GGACTAGCTG CTGACTATTA TACTCACTTT 2040 

ACCAGTCCAA TTCGTCGTTA TCCAGACCTT CTTGTTCACC GTATGATTCG GGATTACGGC 2100 

CGTTCTAAGG AAATAGCAGA GCATTTTGAA CAAGTGATTC CAGAGATTGC GACCCAGTCT 2160 

TCCAACCGTG AACGTCGTGC CATAGAAGCT GAGCGTGAAG TCGAAGCCAT GAAAAAGGCT 2220 

GAGTATATGG AAGAATACGT GGGTGAAGAG TATGATGCAG TTGTATCAAG TATTGTCAAA 2280 

TTCGGTCTCT TTGTCGAATT GCCAAACACA GTTGAAGGCT TGATTCACAT CACTAATCTG 2340 

CCTGAATTTT ATCATTTCAA TGAGCGTGAT TTGACTCTTC GTGGAGAAAA ATCAGGTATC 2400 

ACTTTCCGAG TGGGTCAGCA GATCCGTATC CGTGTTGAAA GAGCGGATAA AATGACTGGA 2460 

GAGATTGATT TTTCATTCGT ACCTAGTGAG TTTGATGTGA TTGAAAAAGG CTTGAAACAG 2520 

TCTAGTCGTA GTGGCAGAGG GCGTGATTCA AATCGTCGTT CGGATAAGAA GGAAGACAAG 2580 

AGAAAATCAG GACGCTCAAA TGATAAGCGT AAGCATTCAC AAAAAGACAA GAAGAAAAAA 2640 

GGAAAGAAAC CTTTTTACAA GGAAGTAGCT AAGAAAGGAG CCAAGCATGG CAAAGGGCGA 2700 

GGGAAAGGTC GTCGCACAAA ATAAAAAGGC ACGCCACGAC TATACAATCG TAGATACGCT 2760 

AGAGGCAGGG ATGGTCCTGA CTGGAACTGA AATCAAGAGT GTACGAGCTG CTCGAATTAA 2820 

TCTCAAGGAT GGCTTTGCTC AAGTGAAAAA TGGAGAAGTT TGGCTGAGCA ATGTTCATAT 2880 

CGCGCCTTAC GAAGAGGGCA ATATCTGGAA CCAGGAACCA GAACGTCGTC GTAAACTCCT 2940 

GCTCCATAAA AAGCAAATTC AAAAATTGGA ACAAGAGATC AAAGGGACAG GAATGACCTT 3000 

AGTTCCCCTT AAGGTCTATA TAAAAGATGG CTACGCTAAG CTTCTTTTAG GACTTGCCAA 3060 

AGGGAAGCAT GACTATGACA AACGGGAGTC TATCAAACGT CGTGAGCAAA ATCGAGATAT 3120 

CGCGCGTGTG ATGAAAGCTG TTAATCAGCG ATAAAAAGAG GAATTGAAAA TGGAAAAATT 3180 

AGTTGCCTAT AAACGCATGC CTTTGTGGAA TAAACAAACA ATGCCTGAAG CTGTTCAGCA 3240 
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AAAGCACAAT 


ACAAAAGTTG 


GGACTTGGGG 


GAAAATTACT 


GTCTTGAAGG 


GAGCTCTCAA 


3300 


GTTTATTGAA 


TTGACAGAAG 


AAGGGGAAGT 


TCTAGCTGAA 


CACCTCTTTG 


AAGCAGGGGC 


3360 


AGACAATCCA 


ATGGCCCAAC 


CTCAAGCCTG 


GCACCGAGTG 


GAAGCTGCCA 


CAGATGATGT 


3420 


GGAATGGTAC 


TTGGAATTTT 


ATTGTAAACC 


TGAGGATTAT 


TTTGCTAAAA AATACAATAC 


3480 


CAATCCTGTT 


CATTCAGAGG 


TCCTAGAGGC 


CATGCAGACA 


GTGAAACAAG 


GGAAAGCTTT 


3540 


GGATTTGGGT 


TGTGGTCAGG 


GGCGTAATTC 


TCTTTTTCTA 


GCCCAGCAAG 


ATTTTGATGT 


3600 


GACGGCTGTA 


GATCAAAATG 


GACTAGCTCT 


TGAAATCTTG 


CAAAGCATTG 


TGGAGCAGGA 


3660 


AGATTTGGAC 


ATGCCTGTTG 


GCCTTTACGA 


TATCAATTCA 


GCTAGCATTG 


AACAAGAATA 


3720 


TGATTTTATC 


GTTTCAACAG 


TTGTTC TCAT 


GTTTCTACAA 


GCGGACCGCA 


TTCCAGCTAT 


3780 


TATTCAAAAT 


ATGCAGGAGA 


AAACCAGTGT 


TGGTGGTTAC 


AACCTTATCG 


TTTGTGCCAT 


3840 


GGACACGGAG 


GATTATCCTT 


GCTCGGTTAA 


CTTCOCATTC 


ACCTTTAAAG 


AAGGAGAACT 




GGCAGACTAT 


TACAAGGATT 


GGGAATTGGT 


TAAGTACAAT 


GAAAATCCAG 


GCCATTTGCA 


J3DU 


CCGTCGCGAT 


GAGAATGGCA 


ATCGTATTCA 


ACTACGCTTT 


GCGACCTTAC 


TAGCTAAGAA 


4020 


AATCAAGTAA 


ACACACATGA 


AGATTAGGAA 


TTTTCCTGAT 


CTTTTTTCTT 


TTTTACGAAT 


4080 


GATATAGAAA 


AGGAGGGAAT 


TCATGTTTGT 


TGCGAGAGAT 


GCTAGGGGAG 


AATTGGTAAA 


4140 


TGTGTTAGAG 


GATAAACTTG 


AGAAGCAAGC 


ATACACCTGC 


CCAGCTTGTG 


GAGGCCAGCT 


4200 


CCATTTGCGT 


CAAGGACCAA 


GTGTACGGAC 


GCATTTTGCC 


CATAAATCCT 


TAAAAGACTG 


4260 


TGATTTTTTC 


TTTGAAAATG 


AAAGTCCAGA 


ACACCTGGCC 


AATAAGGAAT 


CCCTCTATCA 


4320 


* CTGGTTGAAA 


AAAGAGACAA 


AGGTTCAATT 


AGAGTACCCG 


CTTTCAGAAC 


TTAAACAGAT 


4380 


TGCGGATGTA 


TTTGTAAATG 


GCAATCTAGC 


TCTAGAAGTT 


CAGTGTAGTC 


CCTTGCCTCA 


4440 


GAAAGTCCTT 


AAAGAGCGAA 


GTGAGGGCTA 


TCGTAGTCAG 


GGTTACCAAG 


TACTGTGGTT 


4500 




AAACTGTGGC 


TCAAGGAGCG 


TTTGACTCGT 


CTACAGCAAG 


GTTTTCTTTA 


4560 


TTTCAGTCAA 


AACATGGGCT 


TTTATGTTTG 


GGAATTAGAC 


AAGGAAAAAC 


AAGTTTTAAG 


4620 


ACTCAAATAC 


CTGATTTACC 


AGGATCTCCG 


CGGTAAACTC 


CATTATCAAA 


TCAAGGAATT 


4680 


TTCCTATGGT 


CAAGGTAGTT 


TATTGGAAAT 


ATTGCGTCTT 


CCCTATAAGA 


GACAAAAAAT 


4740 


ATCTCATTTT 


ACAGTTTCTG 


AGGACAAGGA 


CATCTGTCGC 


TATATCCGGC 


AACAACTTTA 


4800 


TTATCAAAAT 


CTCTTTTGGA 


TGAAAGAACA 


AGCAGAAGCC 


TATCAAAAGG GAGAAAATAT 


4860 


CCTGACTTAT 


GGACTGAAAG 


AATGGTATCC 


ACAAATTCGA 


CCAATAGTGG 


GCAAATTTTT 


4920 


CCAGATTGAA 


CAAGACTTGA 


CTAGCTATTA 


TCAGCACTTT 


TATACCTATT 


ACCAAAAAAA 


4980 


TCCTCAAAAT 


GATTGGCAAA AGCTTTATCC 


ACCAGCCTTT 


TATCAGCAAT 


ATTTCTTGAA 


5040 
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AAATATGGTA GAATAGAAAG GATGGAGGAA TCTAATGGTA TTACAAAGAA ATGAAATAAA 5100 

TGAAAAAGAT ACATGGGATC TATCAACGAT CTACCCAACT GACCAGGCTT GGGAAGAAGC 5160 

CTTAAAAGAT TTAACAGAAC AATTGGAGAC AGTAGCCCAG TATGAAGGCC ATCTCTTGGA 522 0 

TAGTGCGGAT AACCTACTAG AAATCACTGA ATTTTCTCTT GAAATGGAAC GCCAGATAGA 5280 

GAAGCTTTAC GCTTATGCTC ATATGAAGAA TGACCAGGAT ACACGTGAAG CTAAGTATCA 5340 

AGAGTACTAT GCCAAGGCCA TGACACTCTA CAGCCAGTTA GACCAAGCCT TTTCATTCTA 5400 

TGAGCCTGAA TTTATGGAGA TTAGCGAAAA GCAGTATGCT GACTTTTTAG AAGCTCAACC 5460 

AAAGCTGCAG GTTTATCAAC ACTATTTTGA CAAGCTTTTG CAAGGCAAGG ATCACGTTCT 5520 

TTCACAACGT GAAGAAGAAT TATTGGCTGG AGCTGGAGAA ATCTTTGGTT CAGCAAGTGA 5580 

AACCTTCGCT ATCTTGGACA ATGCGGATAT TGTGTTCCCT TATGTCCTAG ACGATGATGG 5640 

TAAAGAAGTT CAGCTATCTC ATGGGACTTA CACACGTTTG ATGGAGTCTA AAAAACGTGA 5700 

GGTTCGCCGT GGTGCCTATC AAGCTCTTTA TGCG ACTTAC GAACAATTCC AACACACCTA 5760 

TGCCAAAACC TTGCAAACCA ATGTTAAGGT GCAAAATTAC CGTGCTAAAG TTCGTAACTA 5820 

CAAGAGTGCT CGTCATGCAG CCCTCGCAGC GAATTTTGTT CCAGAAAGTG TTTATGACAA 5880 

TTTGGTAGCA GCAGTTCGCA AGCATTTGCC ACTCTTACAT CGCTATCTTG AGCTTCGTTC 5940 

AAAAATCTTG GGGATTTCAG ATCTCAAGAT GTACGATGTC TACACACCGC TTTCATCTGT 6000 

TGAATACAGT TTTACCTACC AAGAAGCCTT GAAAAAAGCA GAAGATGCTT TGGCAGTCTT 6060 

GGGTGAGGAT TACTTGAGCC GTGTTAAACG TGCCTTCAGC GAGCGTTGGA TTGATGTTTA 6120 

CGAAAATCAA GGCAAGCGTT CAGGTGCCTA CTCTGGTGGT TCTTATGATA CCAATGCCTT 6180 

TATGCTTCTC AACTGGCAAG ACAATCTGGA CAATCTCTTT ACTCTTGTTC ATGAAACAGG 6240 

TCACAGTATG CATTCAAGCT ATACTCGTGA AACTCAGCCT TATGTTTACG GGGATTACTC 6300 

TATCTTTTTG GCTGAGATTG CCTCAACTAC CAATGAAAAT ATCTTGACGG AGAAATTATT 6360 

GGAAGAAGTG GAAGACGACG CAACACGCTT TGCTATTCTC AATAACTTCC TAGATGGTTT 6420 

CCGTGGAACA GTTTTCCGCC AAACTCAATT TGCTGAGTTT GAACACGCCA TTCACCAAGC 6480 

AGATCAAAAT GGGGAGGTCT TGACAAGCGA TTTCCTAAAT AAACTCTACG CAGACTTGAA 6540 

CCAAGAGTAT TATGGTTTGA GTAAGGAAGA CAATCCTGAA ATCCAATACG AGTGGGCTCG 6600 

CATTCCACAC TTCTACTATA ACTACTATGT ATATCAATAT TCAACTGGCT TTGCGGCCGC 6660 

CTCAGCCTTG GCTGAAAAAA TTGTCCATGG TAGTCAAGAA GACCGTGACC GCTATATCGA 6720 

CTACCTCAAG GCAGGTAAGT CGGACTATCC ACTTAATGTC ATGAGAAAAG CTGGTGTTGA 6780 
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TATGGAGAAG GAAGACTACC TCAACGATGC CTTTGCAGTC TTTGAACGCC GTTTAAATGA 6840 

GTTTGAAGCC CTTGTTGAAA AATTAGGATT GGCATAAAAT GGTTGAATCG TATAGTAAGA 6900 

ATGCTAACCA TAACATGCGT CGTCCTGTCG TCAAAGAAGA AATTGTAGAC TTGATGCGTC 6960 

AGCGTCAAAA GCAGGTCACAT GGTTTCTTGA AAGAATTGGA AGACTTTGCC CGCAAGGAAA 7020 

ATATTCCTAT TATTCCCCAT GAAACGGTTG CTTATTTCCG TTTTCTTATG GAAACCATGC 7080 

AGCCTAAAAA TATTCTGGAA ATTGGGACGG CTATCGGTTT TTCAGCTCTC TTGATGGCTG 7140 

AACATGCGCC AAATGCTAAG ATTACAACTA TTGATCGTAA TCCAGAAATG ATTGGTTTTG 7200 

CCAAGGAAAA TTTTGCCCAG TTTGACAGTC GCAAGCAAAT CACTCTCCTA GAGGGAGATG 7260 

CGGTGGATGT CTTATCTACA CTGACAGAGT CTTATGATTT CGTCTTTATG GATTCTGCCA 7320 

AGTCTAAATA CATCGTCTTT CTGCCAGAAA TCCTCAAACA TTTGGAAGTT GGTGGTGTGG 7380 

TTGTCTTGGA TGATATTTTT CAAGGTGGTG ATGTTGCCAA GGATATTATG GAAGTCCGTC 7440 

GTGGTCAGCG AACCATTTAT CGAGGCCTTC AAAAATTATT TGATGCAACC TTAGACAATC 7500 

CAGAACTCAC CGCAACATTA GTGCCTTTAG GAGATGGTAT TCTCATGCTT CGTAAAAATG 7560 

TAGCAGATGT TCAACTGTCT GAAAGCGAAT GATTTTCAGA AAAATTTAAG AAAAAATAGT 7620 

AAAATAGATA GAGTAACACT TATCTCAAAG GAGTAGACAT GAAGAAAAAA TTATTGGCAG 7680 

GTGCCATCAC ACTATTATCA GTAGCAACTT TAGCAGCTTG TTCGAAAGGG TCAGAAGGTG 7740 

CAGACCTTAT CAGCATGAAA GGGGATGTCA TTACAGAACA TCAATTTTAT GAGCAAGTGA 7800 

AAAGCAACCC TTCAGCCCAA CAAGTCTTGT TAAATATGAC CATCCAAAAA GTTTTTGAAA 7860 

AACAATATGG CTCAGAGCTT GATGATAAAG AGGTTGATGA TACTATTGCC GAAGAAAAAA 7920 

AACAATATGG CGAAAACTAC CAACGTGTCT TGTCACAAGC AGGTATGACT CTTGAAACAC 7980 

GTAAAGCTCA AATTCGTACA AGTAAATTAG TTGAGTTGGC AGTTAAGAAG GTAGCAGAAG 8040 

CTGAATTGAC AGATGAAGCC TATAAGAAAG CCTTTGATGA GTACACTCCA GATGTAACGG 8100 

CTCAAATCAT CCGTCTTAAT AATGAAGATA AGGCCAAAGA AGTTCTCGAA AAAGCCAAGG 8160 

CAGAAGGTGC TGATTTTGCT CAATTAGCCA AAGATAATTC AACTGATGAA AAAACAAAAG 8220 

AAAATGGTGG AGAAATTACC TTTGATTCTG CTTCAACAGA AGTACCTGAG CAAGTCAAAA 8280 

AAGCCGCTTT CGCTTTAGAT GTGGATGGTG TTTCTGATGT GATTACAGCA ACTGGCACAC 8340 

AAGCCTACAG TAGCCAATAT TACATTGTAA AACTCACTAA GAAAACAGAA AAATCATCTA 8400 

ATATTGATGA CTACAAAGAA AAATTAAAAA CTGTTATCTT GACTCAAAAA CAAAATGATT 8460 

CAACATTTGT TCAAAGCATT ATCGGAAAAG AATTGCAAGC AGCCAATATC AAGGTTAAGG 8520 

ACCAAGCCTT CCAAAATATC TTTACCCAAT ATATCGGTGG TGGAGATTCA AGCTCAAGCA 8580 
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GTAGTACATC AAACGAATAG TCCAAATCAA TGAGTCAGGG AAAAAACTCG ACTTCAGGAA 8640 

AAAATGAAGC AAACATTCCC ACAATAAAAC GCATAGTACA AGGTTTGTAC TGCCCCCCAA 8700 

AAAGTTAGAC AATTAATTTA TCCGAAGGAT TTAGTTCTGT ATTGCACAGA GCTAAGTCCT 8760 

TTTAGTTTTA TCTTAATTCT CTTATTGTTG TAATAATCAA TATAGTCTAT AATGGCTCGT 8820 

TCCAATTGAT TAAGTGATTT AAATGTTTTC TCATAGCCAT AAAACATTTC GGATTTTAAA 8880 

ATGCCAAAGA AAGATTCCAT CCTACCGTTG TCTTGGCTGT TGCCCTTACG TGACATGGAT 8940 

GCTTGAATTC CCTTACTCTC TAGGAAGCGA TGATAAGAAT CGTGTTGATA TTGCCAGCCT 9000 

TGGTCACTAT GGAGAATCGT ATTCTCGTAG TGCTTCTCTT TGAATGCCTG TTCCAACATT 9060 

AACGATCAAT CAATTTAATC ATGTACCTAA GATTAGAATT GTTTATCCCA AATTTATTTG 9120 

AAAGCTTCTC TAAGCTATAT CCTTGTTTTC TAAGTTCATA GATCTGAACT TTATCATCAT 9180 

AAGTTAATTT CATAATAAAA ACACCCCAAA AGTTAGATTT TTTCTGTCTA ACTTTTGGGG 9240 

TGTAGTTCAT GTACACCTGA TATGATGCGT TTTATAATTT TAAAGACTTT TTGACCAGCC 9300 

TCATTTTTTT AACTTGATAC TCAGTGAAAA GCAAAGATTA AACTAGGAAG CTAGCTGTAG . 9360 

GCTGCTCAAA GAACAGCTTT GAGGTTGTAG ATAAAACTTG TGAGGTCACC AACATATATA 9420 

ATGTGAAGCT GACGTGGTTT GAATAGATTT TAGAAGAGTA TGAGTCTGGA AGTTTTAATG 9480 

GATAATGCAA GATTCCATAG AATGGGTAAG CTAGAGTTCT TATGTGAAGA GTTTGGGCAT 9540 

AAACTTTTAC CTTTTCCTCC CTACTCATCT TAGTATAGAA AAGTGAATCT GAAATAGTAC 9600 

ATAACTGCTT CTAAAACATT CTTATAAATT GATTTAAATT CTCAAATCAT ATTATTCAGT 9660 

TCTTATTTCA TTTTGTTCTA CAATCCTGTT GAGAAGACAC GTGTTCATAT CAAAAAGGTA 9720 

TTGGCAAGTT GCAATACCTT TTTACGAGGC TCTGTTGTCT TATTTTTGTT TCAACTGACT 9780 

ATATCTCCTA TGGTTCTAGT TCAGAAGGCT AGGCTATAAT TATGATTGAT AAGAAGTATC 9840 

ATTCCAAGTA TTGGGAGTGA ATGTTTCAAA ATCATGGGTT TCTATAATGG TCAGGCTGGC 9900 

ATTTGCTAGA CCGCCATCTT TACGAAGAAG TGGTTCTTTA TAGCCTAGGA GAGTACGAAG , 9960 

ACTGGCAGTA AGATTGGCGC CGTGTCCGAC AATTAGAATA CGTTCAGCTG GACTATCTTT 10020 

TAATGATTTG ATAAATTGGA TGGTCCGTTG AGTTGTACTA TAGAGGGATT CGGCTCCGAA 10080 

CATTCGAGTG TCAAATTGAG CAAGATTTGA ACGAAAAGCC TGGATTTGTT GCGGGTAAAT 10140 

AGCTTCCAAG GTTGCAATTT TCAAACCTTC TAACTTCCCA AGTTGCCATT CACGGAGATT 10200 

AGGAACGATT TCTAAAGAAC AGGGGGTATA GAGTTGACTT TGGATAATCT CAGCAGATTT 10260 

GACCGCTCGA GGTAAATCAC TTGAATAAAT CTGATCAAAA GGAATTTCCT TGAGATACTG 10320 
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ACCAAGTCGT TTTAGGGTTT CAATGGATTC AGGAAGAAGA GGAGAATCAC CACTAGCACC 10380 

TTGAAAACGA CCTTCTTGGT TCCAGAGGGT ACGACCGTGG CGGACAAAGT AGAGTTTCAT 10440 

TACTTGATGT CCTCCAAAAT ATCTACAAAG TCTGCCTTTA CAAAGCTAGC CAAGTCTTGT 10500 

GGCGCGACGA TAATGCTGTG TCCGACTTCG CCTGCAGAGA CAATCATTTG ATCCAAATCT 10560 

AGAGCAATTT TATCGATAAA AATGGGATAA TTGTGTTTCT GACGAATTCC GACAGGATTA 10620 

TTGGCTCCAT GAATGTAACC AGTTGTTTTT TCTAAGTCCT TTTGTGGAAT CATGCTCACT 10680 

TTTTTATTGC CAGAAATTTT AGCTAGTTTC TTTTCAGACA AGTGCTGAGT GATAGGGACA 10740 

ATTCCGATAA TCGGTCCGGT CTTGTCTCCC AAAAGCGCCA AGGTTTTGAA AATCTGATCT 10800 

CGTTCATAAC CTTGAGGAAG CTCTCCTTCT AGGGCATTGA TTTGAATCCC CTGATGAGGG 10860 

ATAGCTGCTT TAGATAGGAT TTGTTCCACC AATGTTTTTT TGATTTTAAC TTTTTTTGCC 10920 

ATTATTTATA TTTATCCTCC AATTGACTCA TCCAAATACC AAGCCAGATT CCCAGCGCAA 10980 

AGAAGAAGGC GATGATGACA TAACCGACAA GTGAAAGTCC TGTGTATTGG ATACTTTCAG 11040 

CGTTTCCTGC ATTTGGAATT AAGATCAAAA GGGTACTTGA TAGGACGATA CCGATGATGA 11100 

AATGATAGAC GAACTGTTTA CGGAGTTCTT CTAGTTCTCC GTCCGTCCAA GCGTAGGCCA 11160 

CTTCTTCTTT CTTGCCTTTA CCTTTGGACA TCTTGTAAAG AGGTGGGAGG GCAATATAGA 11220 

CATGACCTGC CTCGACTAGC GGACGCATGT AACGGTAGAA AAATGTCAAG AGCAAGGTCT 11280 

GGATATGGGC ACCGTCGGTA TCCGCATCG 11309 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5548 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

CCATAGTCTA ACAAGTCTTT GTAAAGGTTT ATCCCTGATT CATGTAAAGA TTGTGTAAAG 60 

AATCAAAAAA AGCCACTTTT GAAAAATGGC TGCTCCTAAA AATAGCTTTA AAAATTATTA 120 

GTCCTGTGCG AAAGATTGGT TAGGAAGAAA AATCGTGAAG CAACTGCCTC TGCCAAGCTG 180 

ACTCGTCACC GTGACTTGGC CACCTAATAA TTGACTGAGT TCTTTGACAA TGGCAAGGCC 240 

AAGACCAGTG CCACCAGTTT GTCTGCTTCG ACCTTTATTA ACTCGGTAAA AACGTTCAAA 300 

AATACGATCC TGCTCTAATT GACTAATACC AATCCCTGTA TCTGATACAG AAATCTTAAT 360 

GCCTTCGTTC ACCTTTTGGG TCTTGACCTC AATTTTTCCC CCTTGTTCAG TGTAACGGAT 420 
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GGCATTGGAT AAAAGATTGA GTAAGATTTG GGAAAGTAAT TGACTATCTG ATACGAGGGT 480 

GACATCATCT GGCACCTGCA CCTTTAGCTG TAAATCCTTC TTCTTGAGCT GAGGTTGCAA 540 

GCTTTGAGTC AAATCCTGTA CAAATTCTGC CAAAGAAAGG GTCGTCCATT GTATAGGCAT 600 

TTGTTGAGCC TTAGATAAGG TAAGAAGATG CTCAACAATA TGCTCAAGAC GCAAACTTTC 660 

TTTGTAAATA ATGTCTAGAA AGTCATCCTT GAGCGCTTCT TCTTCAGCTG ACATCCCCTT 720 

AATGGTTTCA GCAAAGCCCT TAATCGAAGT AACTGGTGTC CTCAATTCAT GGGAGGCATT 780 

TGAGACAAAG GCTAAATTTA ACTTTTCATA AGTTCTAATC GTTGTTAAAT CATATAGCAA 840 

GACGAGCACA GCTTCCACAG ATTGGGTGGG GCTAAAAACG GGAACTGCTG TCACTTCTAA 900 

AATCAAGTCA CCCTCATGAA ACCCACTTAC TTCTTGTTTT AACCTTGTTT TTTGATCAAA 960 

GGCTTGGTGA ACTAAATTCC GAATATCCAT CCGTTTGAGG TCATCAAGTG AACTTATGTC 1020 

GCCGTCCACA TCGGGAAAAT AATGAGGCAG AGAGCGACTG GATAATAACA TCTGACCTTG 1080 

AGCGGAAACT AAAAACGTCC CCATGGTTAG GTGCGACAGA AGAACCTCCA TTGTTTCGGC 1140 

TAGATCCTTG TATTGCTGAT CCTGTTGGGA GACTTTGGTT TTTAGGCCAG ACACATACTG 1200 

AGCCAAAGAC TTTAAGTCTT CTTGCCCTTT TTCTAAAAAG TATTCACTAC TGGTCAAGAG 1260 

AGGTTGGTGC AAGGTCTCAA AAGCAACTTC CCATTTCCAA AGGCAAAAGA GCCAGTAGCC 1320 

ACCTAGTCCC AAAGAAAGGG CTAGAAGAAA GAGACCGATG CCTTTACTGA TCCAAGTTAA 1380 

TGCCATCCCT GCAATCAGAA TGAGGCTAAC ACTTAGATTG ACTAGCCAAA ATTGAAGGTA 1440 

GCGTTTCATC TATAACTCCT TGAACTTATA ACCATAACCC CGAATGGTTC GAATAAATTG 1500 

AGGGGCTTTA GGATTGTCTT CAATTTTTTC CCTCAACTTA CCAATATGAA CGTCCACCAA 1560 

ACGTGTTTCC TGCCCAAAGT CATACCCCCA GATACGTTCC AAAAGACGCT CTCTAGTCAG 1620 

TGTCATGTTG GGATGTTTCA TAAGATAGAG CAAGAGTTCA AATTCTTTTG GGGTCAAACT 1680 

CAGTAACTTA TTCGCCTTGT AGACTTCATG ACGCTCAGGG TATACTTTCA AGGTCCCAAA 1740 

TAGCCAAGAA TCGTCAGCGA TATTATCTGA ATCATCTCCT TCTTGTTCTC CTTTAGTTCG 1800 

CCTGAGGACA GCCTTGACAC GCGCCAGCAA TTCTCTAGGG CTAAAAGGCT TGGTCAGGTA 1860 

GTCATCAGCC CCTAATTCCA AGGCCAAAAC CTTATCAAAT TCATCACTTT TCGCAGAAAC 1920 

CATCATAATT GGAGTTTTGA CGCCTTTGGC TCTCAGCCGC TTACAAACTT CCATGCCATC 1980 

TAATTGTGGT AACATGATAT CAAGCAAGAT AAAATCAAAG GGTTCTGTTT CTGCCAAAGC 2040 

TAAGGCCTTC CGTCCATTTG TCACCAATTG AGTAGAAAAG CCTTCCTTAC TTAAATGGTA 2100 

GTCAAGCAAT TTCAGAATGT GTTCTTCATC ATCCACTAAT AAGACTTGTT TTGTCATCTA 2160 
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TTATCTCCTA TTGGTAACAT TATAACACAA TTATCAGAAA TCCTAACATT GCTAAATCAG 2220 

ATTAAATTTG CCTATCAAGA CTAGTATCTG GTCAAACGCT CAATCATCTC CTTGTGCTCT 2280 

GGATAGGTCG CCAGTAGATC TACCCTTTCA AATAATTCAA AATCCTCAAA TTCAAAACCA 2340 

GGAGCAACAA GACAAGAAAC CAGAGCATCA TCCTTATCAA CTGTTGATCC CCAAATAGTG 2400 

CCCTTAGGAA CACAGTAGTG AAGTTGTTGC CCTTTGGATA TGTCCAGGCC TAAAGTGACT 2460 

GCTTCGTAGT GACCATCTGC TGTAATCATG TGAACAGTAA GTGGGGATCC TGCATGAAAA 2520 

TACCAGATTT CATCTGCTGT CAATCGGTGA AAATGTGAAG GATTCGTTTC TTCTAATAAG 2580 

AAATAAATAC TGGTATAAAG CGCCCTTCCC TTACCAGCAA GGTTTATAGT GTCTGAAGCT 2640 

TTTTTTGTTT GTCTAAAATA GCCACCTTCA ATATGGGGAG CTAACTCTAG AGTTCTTATC 2700 

AAGTCTTCTT TATCCGTCGG AGCCAATGGG TTGAAGTAAC TCTTGTTCAA AGTGGTTTTA 2760 

CGATTTCAAG AACTCCTCTC AGTTCTGAGG ACACGGTAAT GATTGATGCG ACGGAAGTAC 2820 

AAATCAATCG CCCTAAAAAA AGAATTAGCG AATGATTCTG GTAAAAAAAA TGCCACGCTA 2880 

TGAAGGCTCA AGCGATTGTC ACAAGTCAAG GGAGAATTGT TTCTTTGGAT ATCGCTGTGA 2940 

ACTATTGTCA TGATATGAAG TTGTTCAAAA TGAGTCGCAG AAATATCGGA CAAGCTGGTA 3000 

AAATCTTGGC TGACAGTGGT TATCAAGGGC TCATGAAGAT ATATCCTCAA GCACAAACTC 3060 

CACGTAAATC CAGCAAACTC AAGCCACTAA CAGTTGAAGA TAAAGCCTAT AACCATGCGC 3120 

TATCCAAGGA GAGAAGCAAG GTTGAGAACA TCTTTGCCAA AGTAAAAACG TTTAAAATGA 3180 

TTTCAACAAC CTATCGAAAT CATCGTAAAC ACTTCGGATT ACGAATGAAT TTGATTGCTG 3240 

GCATTATCAA TCATGAACTA GGATTCTAGT TTTGCAGGAA GTCTATTATT TGGTTAGGTG 3300 

AATTAGTGAA GCGTTTAGGC AAGTGTCTCT GGTTACGACG TCATGGACTC TAAATCGATT 3360 

ATATTTAGGG GTCATGACTA GTGAAGCAGT TAGCTAGTTC GCATATAAGC GGCTAGCGTC 3420 

TAACAATTAG GAACTTTAGT TCCAATAACT TTAAGATTAC GACGTTTTAG GACATAAATC 3480 

GATCATATTT ATGTCCTAAA ACTAGTGAAG CGCCTAGCCA AAGTCCGAAT AGGATTTGGC 3540 

GTTAGTTACT TAGATTGCTT TGCAATCAAG TAACTTTGGC GATTTACATC TTCTCTGGCG 3600 

CTTCTACTCC AAGCAAGCGA AGGGCTTCTT TGAGAACGAC TGCGGTTGCG TAGCTGAGGG 3660 

CTAGACGGCT GTCGCGTTCT GGGCTTTCAT CCAAGATACG TGTATGTGCA TAGTATTTGT 3720 

TAAAGGATTG AGCCAGGCTA ATTGCAAATT TAGCAATGAT AGAAGGTTCA AAGTTATCTG 3780 

CCGCACGGTT GATAATACGT GGGAAGTCTT GAATGAGTTT AATGATTTCC CAGCTTTCAG 3840 

TATCATTCAA GCTATAGTTG CCAGCTGTTT CTGGTTTGAA ATCGGCTTTG CGTAAGATAG 3900 

ATTGGATACG AGCGTAGGCA TATTGAACGT AAGGTCCAGT TTCACCCTCG AAGGATACCA 3960 
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TAGCCTCTAG GTCGAAGTCG TATCCATTTG TACGGTCGGT TTTGAGGTCA TAGAATTTAA 4020 

TGGCTCCAAT CCCAACAGCA TGTGCTACTT GGTCTTTGTT TTCTAGTTCA GGATTTTTAG 4080 

CCTCGATTTG GACCTTGGCA CGGCTAACAG CCTCTGCAAC AGTAGGCTCT AGCAAGATGA 4140 

CATTCCCTTT ACGAGTAGAG AGTTTCTTCC CTTCTTTTGT AACCAAACCA AAAGGAACGT 4200 

GAGTAATGTC GTCACTCCAG TCGTAGCCCA TCTCTTGCAA GACAGCTTTG AGCTGTTTAA 4260 

AGTGGGCAGA TTGTTCTTGA CCAACGACAT AGATAGATTT AGCAAATTGG TATTCGTTTT 4320 

TACGGTAGAG GGCTGCAGCC AAGTCACGTG TGATATAGAG AGTTGCACCA TCAGACTTCT 4380 

TGATGAGGGC TGGATGTTCA ATTCCATATT TCTCAAGATT CACAACTTGG GCACCTTCTG 4440 

ATTCAAGAAG TAGTCCTTTT TCAGAAAGAA TGTCTACAAC TGCATCCATC TTATCATTGT 4500 

AGAAGGCTTC TCCGTTATAG CTGTCAAATT CAACCTTCAA TTCATTGTAA AGGCGGTTAA 4560 

ATTCCACTAA ACTTTCATCG CGGAACCATT GCCAAAGAGC GAGAGCTTCC TCATCTCCAT 4620 

TTTCAAGTTT ACGGAACCAT TCGCGCGCTT CTTCATCCAA GCTAGGGTCA TTTTCAGCTT 4680 

CAGCGTTGAT GCGGACATAG AGTTTAAGGA GTTCATCGAT TGGATGAGCT TTTACAGCTT 4740 

CTTCGTCGCC CCATTTTTTG TAGGCAACAA TCAACATCCC AAATTGTTTA CCCCAGTCTC 4800 

CCAAATGGTT GACCTTGACC GTTTGATAAC CGATTTTTTG GAAAATATGT GACAAGCTAT 4860 

CTCCGATAAC AGTTGAACGC AGGTGGCCAA TAGAAAATGG TTTAGCGATA TTCGGACTAG 4920 

ACATGTCGAT AACAACATTT TCTTGTTTAC CAATATTTTG GTCAGCATAG TGTTCTTTTT 4980 

CAGTGGTAAC AGCTTGCAAT ACTTGAGCAG AAATGGCAGA TTTATCAAGG AAAAAGTTAA 5040 

l CGTAAGGTCC TGTTGCGACA ACTTTTTCAA AGGCTTGGCT GTTCATTTTT TCAGCCAGTT 5100 

CAGCCGCAAT CATTTGTGGT GCTTTACGTT CGACTTTTGC AAGAGAAAAA GCAGGGAAAG 5160 

CAATGTCTCC CATTTCTGAG TTTTTAGGGG TTTCCAGTAA CTTTAAAATA GCCTCTTGGT 5220 

CCAGGCTATC AATGATGCTA GATAATTCGC TAGCAATCAA TTCTTTTGTA TTCATTAAGA 5280 

GCTCCTTTTT GGACTTTTCT ACTATTTTAT CACAATTTTA AAGAAAGAAG AAAAAATTTT 5340 

TGAAATCTCC TGTTTTTTTG GTATAATATG GTTATAAATA TAGTTATAAA TATGCACGCA 5400 

AGAGGATTTT ATGAGAAAAA GAGATCGTCA TCAGTTAATA AAAAAAATGA TTACTGAGGA 5460 

GAAATTAAGT ACACAAAAAG AAATTCAAGA TCGGTTGGAG GCGCACAATG TTTGTGTGAC 5520 

GCAGACAACC TTGTCTCGTG ATTTGCGG 5548 

<2) INFORMATION FOR SEQ ID NO: 110: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3132 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
t D ) , TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

TACCCGGTAG TCTTAGCAGA CACATCTAGC TCTGAAGATG CTTTAAACAT CTCTGATAAA 60 

GAAAAAGTAG CAGAAAATAA AGAGAAACAT GAAAATATCC ATAGTGCTAT GGAAACTTCA 120 

CAGGATTTTA AAGAGAAGAA AACAGCAGTC ATTAAGGAAA AAGAAGTTGT TAGTAAAAAT 180 

CCTGTGATAG ACAATAACAC TAGCAATGAA GAAGCAAAAA TCAAAGAAGA AAATTCCAAT 240 

AAATCCCAAG GAGATTATAC GGACTCATTT GTGAATAAAA ACACAGAAAA TCCCAAAAAA 300 

GAAGATAAAG TTGTCTATAT TGCTGAATTT AAAGATAAAG AATCTGGAGA AAAAGCAATC 360 

AAGGAACTAT CCAGTCTTAA GAATACAAAA GTTTTATATA CTTATGATAG AATTTTTAAC 420 

GGTAGTGCCA TAGAAACAAC TCCAGATAAC TTGGACAAAA TTAAACAAAT AGAAGGTATT 480 

TCATCGGTTG AAAGGGCACA AAAAGTCCAA CCCATGATGA ATCATGCCAG AAAGGAAATT S40 

GGAGTTGAGG AAGCTATTGA TTACCTAAAG TCTATCAATG CTCCGTTTGG GAAAAATTTT 600 

GATGGTAGAG GTATGGTCAT TTCAAATATC GATACTGGAA CAGATTATAG ACATAAGGCT 660 

ATGAGAATCG ATGATGATGC CAAAGCCTCA ATGAGATTTA AAAAAGAAGA CTTAAAAGGC 720 

ACTGATAAAA ATTATTGGTT GAGTGATAAA ATCCCTCATG CGTTCAATTA TTATAATGGT 780 

GGCAAAATCA CTGTAGAAAA ATATGATGAT GGAAGGGATT ATTTTGACCC ACATGGGATG 840 

CATATTGCAG GGATTCTTGC TGGAAATGAT ACTGAACAAG ACATCAAAAA CTTTAACGGC 900 

ATAGATGGAA TTGCACCTAA TGCACAAATT TTCTCTTACA AAATGTATTC TGACGCAGGA 960 

TCTGGGTTTG CGGGTGATGA AACAATGTTT CATGCTATTG AAGATTCTAT CAAACACAAC 1020 

GTTGATGTTG TTTCGGTATC ATCTGGTTTT ACAGGAACAG GTCTTGTAGG TGAGAAATAT 1080 

TGGCAAGCTA TTCGGGCATT AAGAAAAGCA GGCATTCCAA TGGTTGTCGC TACGGGTAAC 1140 

TATGCGACTT CTGCTTCAAG TTCTTCATGG GATTTAGTAG CAAATAATCA TCTGAAAATG 1200 

ACCGACACTG GAAATGTAAC ACGAACTGCA GCACATGAAG ATGCGATAGC GGTCGCTTCT 1260 

GCTAAAAATC AAACAGTTGA GTTTGATAAA GTTAACATAG GTGGAGAAAG TTTTAAATAC 1320 

AGAAATATAG GGGCCTTTTT CGATAAGAGT AAAATCACAA CAAATGAAGA TGGAACAAAA 1380 

GCTCCTAGTA AATTAAAATT TGTATATATA GGCAAGGGGC AAGACCAAGA TTTGATAGGT 1440 

TTGGATCTTA GGGGCAAAAT TGCAGTAATG GATAGAATTT ATACAAAGGA TTTAAAAAAT 1500 

GCTTTTAAAA AAGCTATGGA TAAGGGTGCA CGCGCCATTA TGGTTGTAAA TACTGTAAAT .1560 
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TACTACAATA GAGATAATTG 


GACAGAGCTT 


CCAGCTATGG 


GATATGAAGC 


GGATGAAGGT 


1620 


ACTAAAAGTC 


AAGTGTTTTC 


AATTTCAGGA 


GATGATGGTG 


TAAAGCTATG GAACATGATT 


1680 


AATCCTGATA 


AAAAAACTGA 


AGTCAAAAGA 


AATAATAAAG 


AAGATTTTAA AGATAAATTG 


1740 


GAGCAATACT 


ATCCAATTGA 


TATGGAAAGT 


TTTAATTCCA ACAAACCGAA TGTAGGTGAC 


1800 


GAAAAAGAGA 


1TGACTTTAA 


GTTTGCACCT 


GACACAGACA 


AAGAACTCTA 


TAAAGAAGAT 


1860 


ATCATCGTTC 


CAGCAGGATC 


TACATCTTGG 


GGGCCAAGAA 


TAGATTTACT 


TTTAAAACCC 


1920 


GATGTTTCAG 


CACCTGGTAA 


AAATATTAAA 


TCCACGCTTA 


ATGTTATTAA 


TGGCAAATCA 


1980 


ACTTATGGCT 


ATATGTCAGG 


AACTAGTATG 


GCGACTCCAA 


TCGTGGCAGC 


TTCTACTGTT 


2040 


TTGATTAGAC 


CGAAATTAAA 


GGAAATGCTT 


GAAAGACCTG 


TATTGAAAAA 


TCTTAAGGGA 


2100 


GATGACAAAA 


TAGATCTTAC 


AAGTCTTACA 


AAAATTGCCC 


TACAAAATAC 


TGCGCGACCT - 


2160 


ATGATGGATG 


CAACTTCTTG 


GAAAGAAAAA 


AGTCAATACT 


TTGCATCACC 


TAGACAACAG 


2220 


GGAGCAGGCC 


TAATTAATGT 


GGCCAATGCT 


TTGAGAAATG AAGTTGTAGC 


AACTTTCAAA 


2280 


AACACTGATT 


CTAAAGGTTT 


GGTAAACTCA 


TATGGTTCCA 


TTTCTCTTAA 


AGAAATAAAA 


2340 


GGTGATAAAA 


AATACTTTAC 


AATCAAGCTT 


CACAATACAT 


CAAACAGACC 


TTTGACTTTT 


2400 


AAAGTTTCAG 


CATCAGCGAT 


AACTACAGAT 


TCTCTAACTG 


ACAGATTAAA 


ACTTGATGAA 


2460 


ACATATAAAG 


ATGAAAAATC 


TCCAGATGGT 


AAGCAAATTG 


TTCCAGAAAT 


TCACCCAGAA 


2520 


AAAGTCAAAG 


GAGCAAATAT 


CACATTTGAG 


CATGATACTT 


TCACTATAGG 


CGCAAATTCT 


2580 


AGCTTTGATT 


TGAATGCGGT 


TATAAATGTT 


GGAGAGGCCA 


AAAACAAAAA 


TAAATTTGTA 


2640 


GAATCATTTA 


TTCATTTTGA 


GTCAGTGGAA 


GCGATGGAAG 


CTCTAAACTC 


CAGCGGGAAG 


2700 


AAAATAAACT 


TCCAACCTTC 


TTTGTCGATG 


CCTCTAATGG 


GATTTGCTGG 


GAATTGGAAC 


2760 


CACGAACCAA 


TCCTTGATAA 


ATGGGCTTGG 


GAAGAAGGGT 


CAAGATCAAA 


AACACTGGGA 


2820 


GGTTATGATG 


ATGATGGTAA 


ACCGAAAATT 


CCAGGAACCT 


TAAATAAGGG 


AATTGGTGGA 


2880 


GAACATGGTA 


TAGATAAATT 


TAATCCAGCA 


GGAGTTATAC 


AAAATAGAAA 


AGATAAAAAT 


2940 


ACAACATCCC 


TGGATCAAAA 


TCCAGAATTA 


TTTGCTTTCA 


ATAACGAAGG 


GATCAACGCT 


3000 


CCATCATCAA 


GTGGTTCTAA 


GATTGCTAAC 


ATTTATCCTT 


TAGATTCAAA 


TGGAAATCCT 


3060 


CAAGATGCTC 


AACTTGAAAG 


AGGATTAACA 


CCTTCTCCAC 


TTGTATTAAG 


AAGTGCAGAA 


3120 


GAAGGATTGA 


TT 










3132 


(2) INFORMATION FOR SEQ ID NO: 111: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14672 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

CGAGATTTCT TTAAATGAAC TACGTGAAAT CTACCCATCA TCCAGATCTG GATATTCTCT 60 

CCTATCTATA AGTAAAGTTT TAGGAGATTT TAATATAAGT TCTCATGCTT TTAAAGCTTC 120 

GGTAAGAGAT TTAAAACCGC TCAGTTTCCC ACTCATTTGC TTCTGGGAGA GTTCTCATTT 180 

TATTATTCTT GAAAAAATTA GTAAAAACAA GTTTTATATT TTAGATCCTG CAAAAGGCAG 240 

GCAGAGAATG TCAATAAGTG AATTTGAAAG GCATTATTCA AATATCATTT TAACATTTAA 300 

AAAGTTAGAT AGCTTTATGT CTCGTAAAGA TAATAAGAAG TCGCCTGTTT TAAAGTATTT 360 

TTTTAAGTAT AGGAATAAGC TAGGGATTTT ATTTTTTGTA AC AG C ATT AT TGTATGTAAT 420 

ACAATCATTA GTACCTATAG CTAATAGATA CATAATTGAC ACGAATTTCA AGGACGATTC 480 

GTATTCGTCT AGAATGTTAT TTACTATATT ATTTATATTT ACTGTTTCAT TCTCACTAAT 540 

GTATTTATTA AGACAGATAT ATGTTGCATC CTTAAAATAT ATAATGGATA AAGAGATTAG 600 

CTATGATTTT ATGAAACATT TGATATATTT ACCTTACAGT TTTTATGAAA AACGTACTTT 660 

AGGGGATATA CTTTTTAGAG CTAACTCTAT TGTTTATATA AGAGAAATAC TATCAAATAA 720 

TTTTATAGCA GCTATACTTG ATTTGTTAAT GATTGTGGTT TATGCTGTGG TTTTATTTAG 780 

CTTTTCTAAG TACATGGTAA TCTTTTTAAT ATCACTAAGT CTAGCTCTAT CTATTGTAAT 840 

GTATCCAATC ATAAAAATCT CAAAAAATTT AATTGATAAA AATATAAAAG AAAAGGTTAA 900 

TGTTCAAAAT ATTACTTCCG AAGTAATTTC TAAAAATAGT GATATTAAGC TAACTGGAGA 960 

AGAGGAATTT TGGATTAACA AATGGGATAA TTTTAATACA AAACAGCTCA TCATAGGTCG 1020 

AAAACTTGAT ATACATTTAT CAATTGTTAG TAGTATAACG AATGTTTTAC AAATTATTCT 1080 

CCCTGTTTTG ACCCTTATTG TAGGTGTAAA TATAAAAACA TTCGAACAAT TGACGTTAGG 1140 

ACAAATTGTA GCAATAAGTA CAGTCTCACC ATACTTTATT TCTCCTATAA TTTCTTTAAG 1200 

TGATAACTAT ATACAATTAA TGTTATTAAA GGGATATTTT TTAAGAATAG AGGATGTGTT 1260 

TAATACTAAA TCCGAATTAA TTCCAGAAAG AGTCAGTCAA GATATAAAAT TTGATAAAAA 1320 

. AATAGAATTA AAAGATATTT GGTATAAATA TGGATTATTT GATGATTATG TTTTGAAAGG 1380 

AATAAATGTT ACTATTAAAA AAGGAGAAAC TGTTGCTATT GTTGGAGAAT CAGGTTCAGG 1440 

TAAGAGTACA TTAGCTAAAA TTTTATTAGG TTTATTAGAA CCTAATATTG GTTCAATAGA 1500 

AGTTGATGGA GTAGAAAAAG AAGAAATTGG TCAAACATTG TATAGAAAGA TTTTTGGAGC 1560 
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AGTGTTACAA AATTCAACCC TAAGTTATGG TACCTTAAGA GAGAATTTGA CATTTGGACA 1620 

CTTTGTTTCA GATGAAGAAT TAATGACAAA TCTAAATTCA ATTGGTCTTA GCAATGTAGT 1680 

TAAATCTTTA CCTCTTGGAT TAGAGACAAT CATCGCTGAA GAAGGTAATA ACTTTTCTGG 1740 

AGGGCAGCAG CAAATGATAC TTTTAGCTCG TTGTCTTTTG TCGAAACCTT CGGTAGTTGT 1800 

TTTGGACGAA GCAACAAGTA GTTTAGATAA TTTATCTCAA CAAATTACAA CTTCTTACTT 1860 

AAGTGAAATC GGTACCACTA AGATTTTAAT TGCCCATCGA CTAGATACTA TCAAGTCTGC 1920 

AGATAAGATC TTAGTAATGC ATAATGGTGA AATTGTAGAG ATTGGGACCC ATAGAGAACT 1980 

TCTTGAACTA GGAGGCATTT ATAAGCAATT GTATTCAAAT AATTAGTTTT TGATTAAAAG 2040 

GGTAAATTTA TGAAGATTAT GAAAAAAAAA TATTGGACTT TAGCGATATT ATTCTTTTGT 2100 

TTGTTCAATA ATTCTGTTAC TGCTCAAGAA ATACCTAAAA ATCTTGATGG CAATATAACT 2160 

CACACTCAGA CTAGCGAAAG TTTTTCTGAA TCTGATGAAA AACAGGTTGA CTATTCTAAT 2220 

AAAAATCAAG AAGAAGTAGA CCAAAATAAA TTTCGTATTC AAATCGATAA GACAGAATTA 2280 

TTTGTAACAA CAGATAAACA TTTAGAAAAA AACTGTTGTA AATTGGAACT TGAACCACAA 2340 

ATAAATAACG ATATTGTTAA CTCTGAAAGT AATAATTTAC TAGGCGAAGA TAATTTAGAT 2400 

AATAAAATTA AGGAAAATGT TTCTCATCTA GATAATAGAG GAGGAAATAT AGAGCATGAC 2460 

AAAGATAACT TAGAATCGTC GATTGTAAGA AAATATGAAT GGGATATAGA TAAAGTTACT 2520 

GGTGGAGGCG AAAGTTATAA ATTATATTCT AAAAGTAATT CTAAAGTTTC AATTGCTATT 2580 

TTAGATTCAG GAGTCGATTT ACAAAATACT GGATTACTGA AAAATCTTTC AAATCACTCA 2640 

AAAAACTATG TCCCCAATAA AGGATATTTA GGAAAAGAGG AGGGAGAGGA AGGAATAATA 2700 

TCAGATATTC AAGATAGATT AGGTCATGGT ACGGCTGTTG TAGCTCAAAT TGTAGGGGAT 2760 

GACAATATTA ATGGAGTAAA TCCTCACGTT AATATTAACG TCTATAGAAT ATTTGGTAAG 2820 

TCGTCAGCTA GTCCAGATTG GATTGTAAAA GCAATTTTTG ATGCTGTAGA TGATGGCAAT 2880 

GATATTATCA ATCTTAGTAC TGGACAATAT TTAATGATTG ATGGAGAATA TGAGGACGGA 2940 

ACAAATGATT TTGAAACATT TTTGAAGTAT AAAAAGGCTA TTGATTACGC GAATCAAAAA 3000 

GGAGTAATTA TAGTAGCTGC ATTAGGGAAT GACTCCCTAA ATGTATCAAA TCAGTCAGAT 3060 

TTATTGAAAC TTATTAGTTC ACGCAAAAAA GTAAGAAAAC CAGGATTAGT AGTTGATGTT 3120 

CCAAGTTATT TCTCATCTAC AATTTCGGTC GGAGGCATAG ATCGCTTAGG TAATTTATCA 3180 

GATTTTAGCA ATAAAGGGGA TTCTGATGCA ATATATGCGC CTGCAGGCTC AACATTATCT 3240 

CTTTCAGAAT TAGGACTTAA TAACTTTATT AATGCAGAAA AATATAAAGA AGATTGGATT 3300 
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TTTTCGGCAA CACTAGGAGG ATATACGTAT CTTTATGGAA ACTCATTTGC TGCTCCTAAA 3360 

GTTTCTGGTG CGATTGCAAT GATTATTGAT AAATACAAAT TAAAAGATCA GCCCTATAAT 3420 

TATATGTTTG TAAAAAAATT CTGGAAGAAA CATTACCAGT AAAAAATGGT ATAAAAGTGT 3480 

TAAATATACC AAACGTATTG AGATATGATT TGAATATGTT ACAATTAGAA TATAAAAATG 3540 

AACAAAGTTG GGATAGTTTC ATAGATAATG TTAATTTAAT TGAGTTGGAA GAGAGAATTC 3600 

AAACTACTAT TGGAATTAAA CAAATAAACA CACACAATAT TATTACTATT GCCCGAGAAG 3660 

GGTACTCTCA AAATTATTTA CCTAACACTT CAGAAAATAC ATATAATTCA TTACAAGTCA 3720 

GTTTAGTTGG AGTATTACTA CTTTTTATAA GTATGGTAAA TATTTTATGG GCTAAAAAAA 3780 

GTAAATGAAA ATAAAATTTG GAGCCCTCTG AAAAAGTAAG TCCTACAGTT CAACTAAAAT 3840 

GAGTCAAAAG ATGAATCACC TTGATGTAGG GGAGTTTGTC TTATTGCTGC CTGAACACCT 3900 

CCGTTCAGAG GAAGAACATT ATAAATCTGT TTTTGAAGAC GACTTAACCA GTCGCATATC 3960 

TAGTCAAGAT GAACGACAGC AAATGACTGC TACGGTAGGT TATTTAGAAT CAGGTCAGGA 4020 

TCGTTTTGTG TATAATACGA CCCCTATTTC TTACCAGCAG TTTTTGAAAG ATCCAATCAT 4080 

CATTGTTATA ACACCCCAAT CAACTGGTCC ACAGTCCATT TTGTTTTGGA TAGACGCAGT 4140 

ACAGAACTAC GTTCTCTTTA ATCAATTGTC TGATGCCCAG GAGCTTATCC AGAGACAAGG 4200 

CATTGAAAAT TGGGTCTCAG AAATGCAAAC AGGTTACCAC AACTACATCA CATTATTGGA 4260 

TAATATCCAG AGGGAACGTT GGGTAATGCT AGCAGGAGCT GTGCTTGGGA TTGCAACTTC 4320 

AATCTTGTTG TTTAACACTA TGAATAGGCT CTACTTTGAA GAATTTAGAC GTGCCATTTT 4380 

TATCAAACGC ATTGCAGGTC TCAGGTTCTT AGAAATCCAT CGCACTTATC TCTTTGCTCA 4440 

ACTGGGTGTG TTTTTACTGG GATTTGTTGC GAGTGTATTT CTTCAGGTAG AGATAGGAGT 4500 

TGCTTTCTTA GTCTTGTTAC TCTTTACTGG TCTATCTCTT TTACAGTTAC ATGTCCAAAT 4560 

GCAGAAAGAA AACAAGATGT CCATGCTTGT TTTGAAGGGA GGTTAATATG ATTGAACTTA 4620 

AACAGGTGAG TAAATCTTTT GGAGAACGAG AGTTATTTTC GAATCTTTCA ATGACATTTG 4680 

AGGCTGGAAA AGTCTATGCC TTAATTGGTT CAAGTGGTAG CGGAAAAACA ACCTTGATGA 4740 

ACATGATTGG GAAATTAGAA CCTTATGATG GGACGATTTT TTACCGAGGT AAAGACTTGG 4800 

CCAATTATAA ATCAAGTGAT TTTTTCCGTC ACGAATTGGG CTACCTCTTC CAGAACTTTG 4860 

GCTTAATTGA AAACCAAAGT ATTGAAGAAA ACCTTAAGCT AGGTCTCATT GGTCAAAAGT 4920 

TGAGTCGGTC GGAACAGCGG TTGAGGCAGA AGCAGGCTTT AGAACAGGTC GGCCTGGTTT 4980 

ATCTTGACCT AGATAAGCGC ATCTTTGAGT TATCGGGCGG AGAATCGCAA CGGGTTGCCT 5040 

TGGCAAAAAT TATCTTAAAG AATCCACCCT TTATTCTGGC AGATGAGCCA ACAGCTTCAA 5100 
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TAGACCCAGC AACCTCTCAG TTGATTATGG AGATTTTGCT ATCTCTTCGA GATGATAATA 5160 

GGCTAATCAT TATCGCAACA CATAATCCGG CAATTTGGGA GATGGCTGAT GAAGTGTTCA 5220 

CGATGGATCA TCTGAAATAA AAATCCTTGT TTTTAATTGC ACGATGAGTT ACTGAAATAT 5280 

TATCATGAAT CAAGAATTGG AGTTAATTTA GAATTGTACT TAATTTAGAA TTGTACTTTA 5340 

TTAATATTGA GGTAACTTTT TCTTGATAAA GGAAGAAATA ATGGAGAGGA AGTTAGAATG 5400 

AAAAAATTCG ACAATTATAT TATTGAGAAG CCTTGCGATT CTAATTCAGA TAAACTGCAA 5460 

AAAATCTTAA TAATTGAAAG TTTGGTAGAT GATATTTTGC AATTTTCTCT CAGAATCAAT 5520 

AATAGTGTAG GAGAGATTTT CCTCCTACAA CCGTTTTAAA AGAAAACTAT CTTTATTCCA 5580 

TGTTATTTTG AGGAAGATAT TGTGAAAGTC AAAGATGATG ATAAAGTTGA GTGGAATTTG 5640 

TTAGAATTTC AAAAATTTAG AGCATTTTTG GCTTAGTAAT CTGTGTTGAA GGCTCAAAAC 5700 

CTATGGTAAA AAAGTAGCTT TGAAAACGTA TTGCCTCCAA AGATTTAGTT AAATAATGAT 5760 

TTAACACAAA AAGAAATTAT TGAAGTTCTG GAAAGATGTT GTTTCAGTAT TGAGAAAAGG 5820 

TGGGAAAAAC TTGCGATTTT CACAGAGAAA GGAAGAAAAA GTATAGAAAT ATAGTCAATT 5880 

GAAACAAGAA CAGGATAAAA GAACCTTTTG TGCCATATTT TTCTCCTTTC GCTTTACAAT 5940 

TGGATTGAAC ACCTTTATTG TATCGCGTTT GGAGTTTTTT TGGTATAACC TTCGACGCAC 6000 

ACCCGCATAG CGGGTGTTTT TTTTGTCTCG CACCTAACGG AGCGAGACAA ACTAATAGTC 6060 

ACTTAATCAA AAAACGCACC ATATCAAAAA CTAAAAAGTT TGATATCATG CGTCATGTCT 6120 

TAAACTAATT GACTATACTT TCTATTCAAA TGAGCTTTTA ACCAATTGAT TGAGCCAATC 6180 

CACTCTTAAA ACCAAAGAGC AATTTCTCGC TTAGCTGACT CTTCTGAATC TGAACCATGT 6240 

ACAACATTTT GGATAATCTC ATTTTCTCCA GCAGCTTTTG CAAAATCACC TCGAATAGTG 6300 

CCTGGTAAAG CTTCTTCTGG ACGAGTTGCA CCCATCATGG TCCGCCAAGT TTCGATTACT 6360 

TTGGGACCAG AAATGACACC CACAAGAACT GGACCTGAAG TCATGAATTC ACGAATCGGT 6420 

GGGTAAAAAC TCTGACCAAC CAAGTCCTGA TAGTGCTGGT CAATCAACTC TTCTGAAACC 6480 

TGTGAACGAA ACTCCAATTT TTCGATTGTA AATCCACGTT GTTCGATGCG CTTTAACACT 6540 

TCACCCACTA GCCCTCTTTT TACACCATCT GGTTTGATGA TAAAGAATGT TTGTTCCATA 6600 

CCCGTCTCCT TTGTCAGCTT CTTTCTTTTA TTTTACCACA TTTCGTGGAA AAATGGAGAA 6660 

AGTTTTCAGA AGAGAGAATG AGAGAACCCT CGGGTTCTCT CATTCTCTCT TATTCTACTG 6720 

TTTCTTCCAC AGTTTCAACG GCAGTATCCA CAACTACTTC TGTTGTTTCT TCATTTCCTT 6780 

CTTCCTCTAC TGGAGGATTA AGGTATTCTT CTTCGTTGAC AGCATGTGGT TCAAGGTTAC 6840 
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GGTAACGGGC CATACCAGTA CCAGCTGGGA TGATCTTACC GATGATAACA TTTTCTTTAA 6900 

GTCCAAGGAG ATGGTCTTTC TTACCACGGA TAGCTGCGTC AGTAAGGACA CGAGTTGTTT 6960 

CCTGGAAGGA AGCCGCTGAC AAGAAACTGT TTGTTTCAAG TGAGGCTTTG GTAATTCCCA 7020 

TAAGGACTGG GCGACCTGTC GCTGGAACTC CACCTGCGAT AAGGACATCT TTGTTGGCAT 7080 

CTGTAAAGTC ATTGATATCC ATGAGGGTAC CCATGAGAAG ATCTGTATCA CCTGGATCCA 7140 

TGACACGGAC TTTACGGATC ATTTGACGAA CCATTACCTC GATGTGTTTG TCACCGATTT 7200 

CTACCCCTTG GCTACGGTAA ACTTTTTGTA CTTCACCGAG AAGGTACGTT TCAACTGACA 7260 

AGACATCACG AACTGCAAGG AGACGTTTTG GTTGGATAGA ACCTTCTGTC AGAGCAGCAC 7320 

CACGCGCTAC TTGGCCCCCA ACTTCGACAC GCATACGAGC TGTAAATGGA ACGACATATT 7380 

CACCTTCGCC AGTTTCACCC TTAACAAAGA CTTTCTTGGT ACGAGTTGAT GCATCTTCTT 7440 

CGATAGCAGT AACTTGTCCT TTAACCTCTG TAATAACCGC TTCCCCTTTA GGATTGCGGG 7500 

CTTCAAAGAT TTCTTGGACA CGAGGAAGAC CCTGAGTGAT ATCGGTATTT GAGGCAACCC 7560 

CACCTGTGTG GAAGGTACGC ATTGTAAGCT GTGTACCAGG TTCCCCGATA GATTGGGCAG 7620 

CGATTGTACC AACTGCTTCA CCAACTTCAA CCGCATCACC AGTCGCCAAG TTGATACCGT 7680 

AACAGTGACG GCAGACACCG TGACGAGTGT TACATGTAAA TACAGAACGG ATAGTCACTT 7740 

CTTCCACACC AGCATTGACA ATTTCACGCG CCTTGTCTTC TGTAATCAAT TCATTTGGAC 7800 

CAATAATCAC TGCACCAGTT TCTGGATGTT TAACAGTTTT CTTAGTGTAA CGACCGTTGA 7860 

GACGCTCTTC GAGAGACTCG ATCATCTCTT TTCCTTCTGC GATAGAACGG ATCAAGAGAC 7920 

CACGGTCAGT TCCACAGTCG TCCTCACGGA TGATAACGTC TTGGGCAACG TCGACCAAAC 7980 

GACGAGTCAA GTAACCTGAG TCGGCTGTCT TAAGGGCCGT ATCGGTCATA CCTTTACGAG 8040 

CACCGTGAGT TGAGAAGAAC ATTTCCAATA CCGACAAACC TTCGCGGAAG TTTGAAAGGA 8100 

TTGGCAATTC CATGATACGT CCATTCGGAG CAGCCATCAG ACCACGCATA CCGGCAAGCT 8160 

GTGAGAAGTT TGAGATGTTA CCACGGGCTC CAGAGTCCAT CATCATAACG ATTGGGTTCT 8220 

TAGGATCTTG GTTAGCAATC AAGCGTTTCT CAAGTTTTTC ACGGGCAGCA CGCCATTCAG 8280 

CTGTAACAGC ATTGTAACGC TCGTCGTCTG TGATCATACC ACGACGGAAT TGTTTGGTGA 8340 

TTTGTTCGAC ACGTTTGTGT GATTCTTCAA TGATTTCAGC CTTGTCATCA ACGACTGGGA 8400 

TATCGGCAAT ACCCACTGTC AATCCTGCAA GAGTTGAGTG GTGGTAACCG AGGTTCTTCA 8460 

TGCGGTCAAG TAGGGCAGAA GTTTCTGTCG TACGGAAACG TTTGAAGATT TCAGCGATGA 8520 

TATTTCCAAG GTTTTTCTTC TTGAATGGAG GGTTGAGCTC AAGATTGCTG ATAGCTTCCT 8580 

TGATATCTCC ACCAAGTGGC AAGAAGTATT TAGCTGGAAC ACCTTCTGTC AAGTTGGCAT 8640 
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CAACCACTAT TTCTTCCCTT TCTATTCTAA GTGAACTGCT TATTCTTGTT CAGCAGCTTC 10440 

TTCTGTTGCT TCCGCTTTTG TTGCTTTCTC AGCTTCTTCA GCTTCAAAGG CTGCTTTAGC 10500 

CTCTTGGGCT GCTTTTTCGC GGGCTTTTTC AAGGTCATCT ACGTGGATGA CATCTTCGTC 10560 

CATTCCTTCA TCCAAGTCGC GAAGTTCCAC TTCTTGGTCA TCTTCGTCTA GGACACGCAT 10620 

GTCAAGACGA AGAGATTGCA ATTCTTTGAC AAGAACTCGG AAGGATTCTG GAACACCTGG 10680 

TTTTGGAATT GGTTTGCCTT TTGTAATAGC TTCATAGGCT TTCAAACGTC CGTTGATATC 10740 

GTCCGACTTG TAAGTCAAGA TTTCTTGAAG GACATTTGAC GCACCGTAGG CTTCAAGAGC 10800 

CCAAACCTCC ATCTCACCGA AACGTTGTCC ACCAAACTGA GCCTTACCTC CGAGTGGTTG 10860 

TTGGGTAACA GTTGAGTATG GTCCGACTGA ACGCGCGTGC AATTTATCAT CAACCATGTG 10920 

GTGGAGTTTG ATCATGTACA TGACTCCGAC AGAAACACGG TTATCAAACG GTTCACCAGT 10980 

ACGTCCATCG TAAAGGATCG TTTTGGCATC GCTATCCATA CCTGCTTCTT TAACAGTTGA 11040 

CCAAAGATCT TCAGAACTTG CTCCATCAAA GACTGGTGTA GCGATGTGAA TACCAAGAGT 11100 

ACGAGCTGCC ATACCAAGGT GAAGCTCCAT AACCTGACCG ATATTCATAC GTGATGGTAC 11160 

CCCAAGTGGG TTCAACATGA TGTCGACTGG AGTTCCGTCT GGAAGGTAAG GCATGTCTTC 11220 

TACAGGAACG ATACGAGAGA CAACCCCTTT GTTTCCGTGA CGTCCGGCCA TTTTATCTCC 11280 

GACCTTAATC TTACGTTTTT GAGCGATGTA AACACGAACC AACATGTTAA CACCTGATTG 11340 

CAACTCATCT CCATTTACAC GTGTAAAGAT CTTAACATCA CGAACGACAC CATCGGCACC 11400 

GTGTGGTACA CGAAGAGAAG TATCACGCAC TTCACGAGAC TTGTCTCCAA AGATAGCGTG 11460 

CAAGAGACGT TCTTCAGCTG AAAGATCTTT CTCACCCTTA GGTGTTACTT TACCTACAAG 11520 

AATATCACCT TCTTTAACCT CAGCACCAAT ACGGATAATC CCCATTTCGT CAAGGTCTTT 11580 

GAGGGCATCT TCACCAACGT TTGGAATTTC GCGAGTGATT TCTTCAGGCC CAAGCTTTGT 11640 

ATCGCGCGTT TCTGATTCGT ATTCTTCAAG GTGAACAGAT GTGTAGACAT CGTCCTTCAC 11700 

CAAGCGTTCG CTCATGATAA CGGCATCCTC GAAGTTGTAA CCTTCCCAAG TCATGTAGGC 11760 

AACGATTGGG TTTTGTCCAA GCGCCATTTC TCCATTTTCC ATAGAAGGTC CGTCAGCGAT 11820 

GAAATCGCCT TTTTCAACGA CATCACCAAC TTTTACGAGA GTGCGTTGGT TGTAAGCAGT 11880 

ACCTGAGTTT GAACGACGGA ATTTTTGGAT GTGGTAAACA TCCAATGAAC CATCTTCACG 11940 

ACGAACTTCT ACCTTGTCAG CATCTGCGTA AGTAACTTTA CCATCATACT GAGCAATCAC 12000 

AGCCGCACCA GAATCGTGGG CTGCTTGGTA TTCCATACCA GTACCAACGT AAGGTGCCTG 12060 

AGGATTAATC AATGGCACAG CCTGACGTTG CATATTGGCT CCCATGAGGG CACGGTTGGA 12120 

GTCATCGTTT TCCAAGAAAG GAATACATGC TGTCGCAACG GCAACTACCT GTTTTGGTGA 12180 
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AACGTCCATG TAGTCAACAA TATTAGCTGG ATACTCTTGG TTGACCCCTT GGTGACGTCC .12240 

CATGACAATC TTCTCAGCAA AGGTTCCATC TTCATTCAGA CGAGAGTTAG CCTGAGCTAC 12300 

AGTATATTCA TCTTCTTCAT CAGCTGTCAA CCAAACAATT TCGTTCGTGA CAACACCTGT 12360 

TTCACGGTCA ACCTTACGGT ATGGTGTTTG AACAAAACCA TATTTGTTCA AGTGTCCATA 12420 

AGATGACAAG TTATTGATCA AACCGATGTT AGGTCCTTCA GGTGTCTCGA TTGGACACAT 12480 

ACGACCATAG TGAGTGTAGT GCACGTCACG TACTTCATAT CCAGCACGGT CACGAGTCAA 12540 

ACCACCAGGT CCTAAGGCTG ACAAACGGCG TTTGTGAGAC AACTCAGAAA GCGGGTTGTG 12600 

TTGGTCCATG AACTGTGACA ACTGTGATGA ACCAAAGAAT TCTTTAACTG CAGCTGTTAC 12660 

AGGACGGATA TTGATAATTT GTTGTGGTGT CAAGACTTCA TTGTCCTGAA CAGACATACG 12720 

TTCACGGACA TTACGTTCCA TACGAGAAAG TCCCAAACGT ACTTGGTTGG. CAAGCAATTC 12780 

ACCAACCGCA CGGATACGAC GATTTCCAAG GTGGTCGATA TCATCTACAC GGCCAAGTCC 12840 

TTCAGCCAAG TTGAGGAAGT AGCTCATCTC AGCAAGGATA TCTGCAGGAG TCACCGTACG 12900 
AACCTTGTCA TCTGGGTTAG CATTACCAAT GATCGTTACG ACGCGATCTG GATCAGTTGG . 12960 
AGCAATAACC TTGAATTTTT GAAGAACAAC AGGCTCAGTC ACAACGGCTG CATCGTTTGG , 13020 

GATGTAGACA ATCTTGTTCA AGTCGCCATC CAAATGGCTT TCAATGCTTT CAATCACGCT 13080 

ACGAGTCATA ATCGTACCAG CTTCTACCAA GATTTCTCCA GTTTCAGGGT CTACCAATGG 13140 

CTCTGCAATG GTTTGGTTGA GCAAACGTGT TTTAACATTG AGTTTTTTAT TGATTTTGTA 13200 

ACGACCAACT GCTGCCAAGT CATAACGACG TGGGTCAAAG AAGCGAGCTA CAAGCAAGCT 13260 

ACGTGAGCTT TCAGCCGTCT TAGGCTCACC TGGACGAAGG CGTTCGTAAA TTTCTTTCAA 13320 

GGCTTCGTCT GTACGAGAGT CCATTGGATT CTTGTGGATA TCTTTTTCAA CAGTGTTGCG 13380 

AACCAATTCG CTGTCACCAA AGATATCAAA GATTTCATCA TCACCTGAGA AACCAAGAGC 13440 

ACGAACCAAG GTTGTAAATG GAATCTTACG AGTACGGTCG ATACGAGTGT AGGTGATATC 13500 

TTTTGAGTCG CTTTCAAGTT CCAACCAAGC TCCACGGTTA GGGATAACAG TTGAACCATA 13560 

GCCCACCTTA CCATTTTTGT CTACTTTGTC GTTAAAGTAA ACACCTGGTG AGCGGACCAA 13620 

CTGAGAAACG ATAATACGTT CACCACCATT GATGATGAAA GTACCCATTT CTGTCATGAT 13680 

TGGGAAATCA CCAAAGAAAA CTTCTTGGGT CTTGATTTCG CTTGTTTCTT TATTGATCAA 13740 

ACGGAAGGTT ACAAAAATTG GTGCTGAGTA GCTAGCATCG TGGATACGAG CTTCTTCTAG 13800 

CGTATATTTT GGTTCCTTGA TTTCATATCC AACAAATTCC AACTCCATTG TGTCTGTGAA 13860 

GTTTGAAATT GGCAATACAT CTTCAAACAC TTCCTTAAGA CCGTGGTCTA GGAAAGCTTT 13920 
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GAATGAGTCA GTTTGAATTT CAATCAAATT TGGTAAGTCA AGAACTTCTT TGATTCTTGA 13980 

AAAACTACGA CGGGTACGAT GTTTCCCGTA TTGAACGTCA TGTCCTGCCA AGATGATTCT 14040 

CCTTTGTAAA TAAGTTCCAA GCCTTGTCAA TCAGGCTTTT CTAATCGTCA TATGGTTGTA 14100 

AACCCCTTAT CACCGTGTCC TCTTGACGAA TTTTCAGAAT CTTTAAGCCT CTGTTACAAA 14160 

TGCTCAAAAT CTTGAAAAAA AGCACAAAAA GAGCAGCTAA ATCTGACTTT TTCAGAAGAT 14220 

TTAACTGCTG TGAGCCTTGT CTGGACAATA TTTCAGACAA AACCTACGAC AAATGATTAC 14280 

CCATATTATA CCCTATTTAG CTAGATTTTT CAAGGGGTTT CAGTAGGTTT TTGGTAAATT 14340 

TTTTCCCATA GAAAACTTGG CATCACATTC GAATCACGCT ATGGTACAAA AAACTGAAAA 14400 

AACTATTGAC TGAAAATCAT TTTCAAGGTA TAATAATAAA CGTTAAGGCG GTATAGCCAA 14460 

GTGGTAAGGC ACGGCTCTGC AAAAGCTTGA TCGTCGGTTC AAATCCGTCT ACCGCCTTCT 14520 

ATAACTTGAT TTATCAGGTT TCAAATGAAC AGAAAGCCCA ATTTGAAGGG CTTTTTTTAT 14 580 

TTTCCCTCGA ATAAATACGT ATAACTTTAA AAACTTTTGG AGCGAGTTTG TGGCAGAGTT 14640 

CTTTCCATGG CATAATTCCC TTTTGAAATC AG 14672 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7902 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

AGGAGACTAT TCAAGCCCAA ATTgAGTAGC CCAGCAAAGA CTGTATAGAC TGTGATACGT 60 

TTTTCATAGC CATTGGTAAA GAGAATTTGG GAACCAAGAA TGGTATCTAA GGCCAGGATA 120 

ATCGTACGAA AAGCGAAGAG AGAGGTCAAG ATGCCGCCTC CGATATATTT TTCACTACCG 180 

TAAAGTAGGA TGGCATTTGG TCCTAAAACC ATGAGTCCAA AACTCAGTGG AATGATAAAG 240 

AAGTTAAAGA TTCGACTACC TCTATTAACC AGAGAAACAT AGGCTTCTTT GTCTCCTTTC 300 

CCCAGATAGT AACTGAGACG AGGCACACTC ACTCCAATTG CACCTGTTAC AACCCCAGCT 360 

ATAACGGTCA CAATTCGCTG AGCTATGGTA TAGTAACTAA CGTTGACATC AATCCCTGTT 420 

TTAACGAGGA AGAGGCGATC TAAAAAAGTG AAGAGCATAT TGGCATTGGC AAAGACTAAC 480 

ATGGCTGTCA GAGGGAGAAA GAGTGGTTTA AAATCACTTA GGTGAATTTT AACAAGTTTG 540 

ATGTCTCTTT TAATCCAAAA ATAACTAATC AGGTAGTTAA TCAGCGTCGA TAAACTCATC 600 

ACAAGTGTAT AGACAACAAT ATCGTGTTCA TTTTTAACAA ATAAGAAAAT AGAGACCAGC 660 
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ATCAGGATAC GGATGAAGGC AGTTTTGTAA AAGAGAAAAC TGTAATTTTC CAGAGCTTCA 720 

TTGACCCATT CGATTGAAAA AATCTGGGCA ATGAGTTGAA TCCCCATAAC AAGGTAGACC 780 

TTTTTGACGA TTGGATTATC AGTAAAGAAG AGAGGATAGG CTAGGATATA GACAGCAGTG 840 

GTCAAAATCG TACAAGCGAT GCACAAATAA AAAAGACTAG AAAAGGTTCT GTTAAGATCT 900 

TTTTTGTTAT CCTTGACATT ACTGATAGCC CTTAAACCGT AGTTATAGAC ACCATAAGTT 960 

GCAAAGGGCA AGAAAAATGA CAAAATAGTG TCGACTGAGT TGAAGTAACC ATAGTCAGTT 1020 

CGGTCCAAGA CACGCGCGAC ATAGGTTCCA GTTAGGATGG GAAAAATAAT ATTCAAGACA 1080 

CGAATTCCCA TGTAAGATAG AGCATTTAAT TTTATACTTT TCATTCAATT TACCTCGTTT 1140 

TTCATTATAT CATAAAGTTA GCTAATAAGA AATGAAGGGC AGTAAGTCAA GTAATCACTT 1200 

TGAAGTTTCA AATCTTAAGT TTTAAGTTTT CTTTAAGGAA AGTATATTAT TCTGAAGGAC 1260 

TCTAAAATTT CGCAGCCATT TATTAGTAAT TGCTACAGAA TTCCTAGTCA TTACTAGAAA 1320 

TGGACTAGTT TCTTTGAATA ATAGAACTGC ATAATTCTCC TATTCTAGAA GGGGAGGACC 1380 

AGTATTTCTT TTATGATAGG ACTAGATTGT GGTATAATAG AGAGAATAAG TTTTTTTAGT 1440 

AAGACAAAGG AGAAAATAGA TGATTTATGC AGGAATTCTT GCCGGTGGAA CTGGCACACG 1500 

CATGGGGATC AGTAACTTGC CAAAACAATT TTTAGAGCTA GGTGATCGAC CTATTTTGAT 1560 

TCATACAATT GAAAAATTTG TCTTGGAGCC AAGTATTGAA AAAATTGTAG TTGGTGTTCA 1620 

TGGAGACTGG GTTTCTCATG CAGAAGATCT TGTAGATAAA TATCTTCCTC TTTATAAGGA 1680 

ACGTATCATC ATTACAAAGG GTGGTGCTGA CCGCAATACA AGTATTAAGA ACATCATTGA 1740 

AGCCATTGAT GCTTATCGTC CGCTTACTCC AGAGGATATC GTTGTTACCC ACGATTCTGT 1800 

TCGTCCATTT ATTACACTTC GCATGATTCA GGACAATATC CAACTTGCCC AAAATCATGA 1860 

CGCAGTGGAC ACAGTGGTAG AAGCGGTTGA TACTATCGTT GAAAGTACCA ATGGTCAATT 1920 

TATTACAGAT ATTCCAAATC GTGCTCACCT TTATCAAGGA CAAACACCTC AAACATTCCG 1980 

TTGCAAGGAC TTCATGGACC TTTATGGATC TCTTTCTGAT GAAGAGAAGG AAATCTTGAC 2040 

AGATGCATGT AAAATCTTTG TGATCAAAGG AAAAGATGTG GCTTTGGCCA AAGGTGAATA 2100 

CTCAAATCTG AAGATTACAA CCGTAACAGA TTTGAAGATT GCAAAAAGTA TGATTGAGAA 2160 

AGACTAGTAA AATGATTAAT CAAATTTATC AACTAACTAA GCCTAAGTTT ATCAATGTCA 2220 

AATATCAGGA AGAGGCTATT GACCAAGAGA ATCATATCCT TATCCGTCCC AACTACATGG 2280 

CTGTCTGTCA TGCGGATCAG CGTTACTATC AGGGAAAACG TGATCCCAAG ATTTTGAATA 2340 

AAAAGCTTCC AATGGCAATG ATTCACGAGT CATGTGGAAC CGTCATTTCT GACCCGACCG 2400 
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GAACCTACGA GGTTGGTCAA AAAGTTGTCA TGATTCCCAA TCAGTCTCCT ATGCAGAGTG 2460 

ATGAAGAATT CTATGAAAAC TACATGACAG GGACCCATTT CTTGTCTAGT GGATTTGATG 2520 

GCTTTATGAG AGAGTTTGTT TCTCTCCCTA AAGATCGTGT GGTGGCTTAT GATGCTATTG 2580 

AAGATACGGT TGCAGCCATT ACAGAGTTTG TCAGTGTGGG CATGCACGCT ATGAATCGTC 2640 

TATTGACTCT TGCTCATAGC AAGCGGGAGC GGATCGCCGT TATTGGAGAT GGAAGTTTAG 2700 

CTTTTGTGGT TGCCAATATT ATCAACTATA CTTTGCCAGA AGCAGAGATT GTGGTTATTG 2760 

GTCGTCATTG GGAAAAGTTG GAACTCTTCT CATTTGCCAA AGAATGCTAT ATTACGGATA 2820 

ATATTCCTGA AGATTTGGCC TTTGACCATG CTTTTGAATG TTGTGGTGGT GATGGTACTG 2880 

GACCAGCTAT TAATGACTTG ATTCGCTACA TTCGTCCTCA GGGAACGATT CTCATGATGG 2940 

GAGTTAGCGA ATATAAAGTC AATCTCAATA CTCGCGATGC CTTAGAAAAG GGCTTGATTT 3000 

TGGTTGGGTC ATCTCGTTCT GGTCGCATTG ATTTTGAAAA TGCTATCCAA ATGATGGAAG 3060 

TCAAGAAATT TGCCAATCGT CTTAAAAATA TCCTTTATCT AGAAGAACCT GTAAGAGAAA 3120 

TTAAAGATAT TCATCGTGTC TTTGCAACCG ATTTAAACAC AGCCTTTAAA ACAGTGTTTA 3180 

AGTGGGAAGT ATAAGTACTG GAGGTTAATT GTGGAGAAAA TCATTAAAGA AAAAATTTCT 3240 

TCCTTACTTA GTCAAGAAGA GGAAGTCCTC AGTGTTGAAC AACTGGGTGG AATGACCAAT 3300 

CAAAACTATT TGGCCAAAAC AACAAATAAG CAATACATTG TTAAATTCTT TGGTAAAGGG 3360 

ACAGAAAAGC TTATCAATCG ACAAGATGAA AAGTACAATC TTGAACTACT AAAGGATTTA 3420 

GGCTTAGATG TAAAAAATTA TCTTTTTGAT ATTGAAGCTG GTATCAAAGT AAATGAGTAT 3480 

ATCGAATCTG CGATTACGCT TGATTCAACG TCAATCAAGA CCAAGTTCGA CAAAATTACT 3540 

CCAATATTAC AAACTATTCA TACGTCTGCT AAGGAATTAA GAGGAGAATT TGCTCCTTTT 3600 

GAAGAAATCA AAAAATACGA ATCCTTGATT GAAGAACAAA TTCCTTATGC CAACTATGAA 3660 

, TCTGTTAGAA ATGCAGTCTT CTCCTTAGAG AAAAGACTGG CTGACTTAGG TGTTGACAGA 3720 

AAATCTTGTC ATATCGATTT GGTGCCTGAA AACTTTATCG AATCACCTCA AGGACGACTT 3780 

TATTTGATTG ACTGGGAATA TTCATCAATG AATGATCCAA TGTGGGATTT GGCTGCCCTC 3840 

TTTTTAGAGT CTGAATTCAC TTCCCAAGAG GAAGAAACTT TCTTATCTCA CTATGAGAGT 3900 

GACCAAACAC CGGTTTCTCA TGAAAAGATT GCTATTTATA AAATTTTACA AGATACTATT 3960 

TGGAGTCTAT GGACTGTCTA TAAGGAAGAG CAAGGTGAAG ATTTTGGTGA CTATGGTGTG 4020 

AATCGTTACC AAAGAGCTAT TAAAGGTTTG GCTTCTTATG GAGGTTCAGA TGAAAAGTAA 4080 

AAACGGAGTT CCTTTTGGCC TTCTCTCAGG TATTTTCTGG GGCTTGGGTC TAACGGTTAG 4140 

TGCTTATATC TTTTCGATTT TTACAGATTT GTCACCCTTT GTGGTGGCTG CAACTCATGA 4200 
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TTTTTTGAGC 


ATCTTTATCT 


TACTAGCTTT 


TCTCTTGGTA 


AAAGAAGGGA 


AAGTTCGCCT 


4260 


CTCAATTTTC 


TTAAATATTC 


GCAATGTCAG 


TGTTATCATC 


GGAGCCTTGC 


TAGCAGGCCC 


4320 


TATCGGTATG 


CAGGCCAATC 


TTTATGCAGT 


TAAGTATATC 


GGAAGTTCTT 


TAGCTTCATC 


4380 


TGTATCGGCT 


ATTTACCCTG 


CGATTTCAGT 


TCTATTGGCT 


TTCTTCTTTT 


TGAAGCACAA 


4440 


GATTTCGAAA AATACTGTAT 


TTGGGATTGT 


CTTGATTATT 


GGAGGGATTA 


TTGCTCAGAC 


4500 


L1A1 AAWaTT 


GAACAGGTTA 


ATTCTTTCTA 


CATTGGGATT 


CTTTGTGCTT 


TGGTTTGTGC 


4560 


TATTGCATGG 


GGAAGTGAGA 


GTGTTCTTAG 


CTCTTTTGCC 


ATGGAAAGTG 


AATTGAGTGA 


4620 


AA rCbAAGCC 


CTCTTAATCC 


GTCAAGTAAC 


TTCGTTCTTG 


TCCTATCTTG 


TGATTGTGCT 


4680 


CTTCTCTCAT 


CAGTCATTTA 


CTGCAGTAGC 


CAATGGACAA 


TTGCTAGGTC 


TCATGATTGT 


4740 


TTTTGCAGCC 


TTTGATATGA 


TTTCCTACTT 


GGCTTATTAT 


ATCGCTATCA 


ATCGCTTGCA 


4800 


ACCAGCCAAG 


GCTACAGGCT 


TGAACGTGAG 


CTATGTAGTA 


TGGACGGTCT 


TGTTTGCAGT 


4860 


TGTTTTCTTG 


GGTGCACCGC 


TAGATATGCT GACCATTATG ACGTCACTTG 


TCGTCATTGC 


4920 


TGG AGTTTAT 


ATTATTATTA 


AAGAATAAAG 


GAGATTCGTG 


TGAAAGCCAT 


TATCTTAGCA 


4980 


GCGGGATTGG 


GAACTCGCTT 


GCGTCCTATG 


ACTGAAAATA 


CCCCTAAAGC 


CTTGGTTCAG 


5040 


GTTAATCAAA 


AACCTTTGAT 


TGAGTACCAA 


ATTGAGTTTC 


TCAAAGAAAA 


AGGAATCAAT 


5100 


GACATCATCA 


TCATTGTTGG 


TTATCTTAAA 


GAACAATTCG 


ATTACTTGAA 


AGAGAAATAC 


5160 


GGTGTTCGTC 


TCGTTTTCAA 


TGATAAATAC 


GCTGACTACA 


ATAACTTTTA 


CTCTCTCTAT 


5220 


CTTGTAAAAG 


AAGAATTGGC 


CAACAGCTAT 


GTTATTGATG 


CTGACAATTA 


TCTCTTTAAA 


5280 


AATATGTTCC 


GCAATGATTT 


GACACGTTCG 


ACTTATTTTA 


GTGTTTATCG 


TGAAGATTGT 


5340 


ACCAACGAAT 


GGTTCTTGGT 


TTATGGAGAT 


GACTACAAGG 


TTCAAGACAT 


TATTGTTGAT 


5400 


AGCAAGGCAG 


GTCGCATCCT 


TAGTGGTGTA 


TCCTTCTGGG 


ATGCTCCAAC 


TGCAGAAAAG 


5460 


ATTGTCAGCT TTATCGACAA GGCTTATGTA AGTGGTGAAT TTGTTGATCT 


CTATTGGGAC 


5520 


AATATGGTTA AGGATAATAT 


CAAAGAGCTA 


GATGTCTATG 


TTGAAGAATT 


AGAAGGCAAT 


5580 


AGCATTTATG 


AGATCGATAG TGTCCAAGAC 


TATCGTAAAT 


TAGAAGAAAT 


TCTTAAAAAC 


5640 


GAAAATTAAA 


GATTCCAACA 


TCTGACAAAA 


TAGTCGGATG 


TTTTTTGATT 


TTTTACGAAC 


5700 


TTTTACGAAT 


AGATAGATGA 


GTAGAAAAAG 


AAATGGAGTT 


ATTTATGAAA 


ATCACAAACT 


5760 


ATGAAATCTA 


TAAGTTAAAA 


AAATCAGGTT 


TGACCAATCA ACAGATTTTG 


AAAGTGCTAG 


5820 


AATACGGTGA 


AAATGTTGAT 


CAGGAGCTTT 


TGTTGGGTGA 


TATTGCAGAT 


ATCTCAGGTT 


5880 


GCCGTAATCC 


AGCCGTTTTT 


ATGGAACGTT 


ATTTTCAGAT 


AGACGATGCG 


CATTTGTCGA 


5940 
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AAGAGTTTCA AAAATTTCCA TCTTTCTCTA TTTTAGATGA CTGTTATCCT TGGGATTTGA 6000 

GTGAAATATA TGATGCGCCT GTACTTTTAT TTTACAAGGG AAATCTTGAC CTCCTGAAAT 6060 

TCCCGAAGGT AGCGGTCGTG GGCAGTCGTG CTTGTAGCAA ACAGGGAGCT AAGTCAGTTG 6120 

AAAAAGTCAT TCAAGGCTTG GAAAATGAAC TGGTTATTGT CAGTGGTCTG GCCAAGGGCA 6180 

TTGACACAGC AGCTCATATG GCAGCTCTTC AGAATGGCGG AAAAACCATT GCAGTGATTG 6240 

GAACAGGACT GGATGTGTTT TATCCTAAAG CCAATAAACG CTTGCAAGAC TACATCGGCA 6300 

ATGACCATCT GGTTCTAAGT GAATATGGAC CTGGTGAACA ACCTCTGAAA TTTCATTTTC 6360 

CTGCCCGTAA TCGCATCATT GCTGGACTTT GTCGTGGTGT GATTGTAGCA GAGGCTAAGA 6420 

TGCGTTCAGG TAGTCTCATT ACGTGTGAGC GAGCAATGGA AGAAGGACGC GATGTCTTTG 6480 

CTATTCCTGG TAGCATTTTA GATGGACTAT CAGACGGTTG CCATCATTTG ATTCAAGAAG 6540 

GAGCAAAATT GGTCACCAGT GGGCAAGATG TTCTTGCGGA ATTTGAATTT TAAAAATGAC 6600 

CTAAGCTAGA ATTCTAAGAA AAAATCAATT TTAAGAGAAA ATGAACCCAA CATTTCCATA 6660 

ATAAAACGCA TATTAGCAAG TTTTTAACAC TTGATAATAT GCGTTTTTTC TAAGTGGATT 6720 

AGTAGAGTAG AGGATTTTTC TCATATAATA CTCTTCGAAA ATCTCTTCAA ACTACGTCAG 6780 

CTTCCATCTG CAACCTCAAA ACAGTATTTT GAGCgaCTtC GTCAGTCTTA TCTACAACCT 6840 

CAAAGCAGTG CTTTGAGCAA CCTGTGGCTA GCTTCCTAGT TTGCGCTTTG ATTTTCATTG 6900 

AGTATAAGGG AAAGTATAGT GAATTGAAAT AAGATGTGAA CAACTCTATC AGGAAAGTCA 6960 

AATTAATTTA TAGAAATATT TTAGCAGCCA AGGTGTACTG TTATAGATTC AATTACACTA 7020 

TAATTTAGTG TAATTGAGAA AGGAGAAATG ATTGTGATTG ATGTTGGCTA GGTTATGTTC 7080 

AATGATTCCT ACCGTCTCAA ATCTTGTCAG TAAGGAAAAA TAAATTCTTC AAAAGTAGAG 7140 

ATTACAAGGC TTGTTTAAGA AAGAATTCAA AGACCTTGAC AAATAAAAAT AAAATGGTTA 7200 

TTATAAAAAA TGGTCTGAAA TAGATGATGA TACTTTTCGA AAATCTCTTC AAATACGTCA 7260 

GCTCAGCTTT GCCTTGCTGT GTTTTGAGCA AGCTACGGTT AGCTTCCGAG TTTGATTT-'C 7320 

ATTTACTAGA AATGAAACTG ATGAGAGATA TCAGTAGACA TTTGAGTCAG GATATTATGG 7380 

AAAATGATAA AAAGAGCTCG TGAGATTGGC ATATCAGACT ACTAAAGTAT TGAGTTTGTT 7440 

AGGATTTTAG CGACTAGTTA GCTGGGAAAG GAAGATATTT GTGACAAATA ATAAACTGTA 7500 

TTCGTTGATA GAATTTAGAA ATAAAATATA TGAAGAATTA GAACTTTCCA GAAGTGATTT 7560 

AGCGATTTTA CTATGTGCCA TGCTTATCGC CTCTATCGGA TTAAATATGG ATTCGACTCC 7620 

CGTGATTATT GGAGCCATGT TAATCTCTCC TTTGATGACA CCTATTCTGG GAGTGGGGCT 7680 

CTCTCTAGCT ATATTTGATT TTAAATTGTT AAGAAAATCT TTTAAAATAT TAGCTATTCA 7740 
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AATTCTTGCC AGTCTAATAG CTTCAACACT TTATTTTTAT CTTTCTCCCA TTTCGTATGC 7800 

TAGTTCGGAG ATTGTTGCTA GAACCTCTCC GACTATTTGG GATGTTCTCA TTGCTTTTGT 7860 

AGGAGGGATA GCAGGTATCA TTGGTGCTAG GAAAAAAGAG AC 7902 



(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 16627 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

GAAGTTGAAA TGGCCAGCTG ATGAGCAATA TCGGTCATAG AAATCTTCTC AATCAACTTT 60 

TGCGCAATTT TTTGGTTGAT AATACGAGGA ATTTGGTGAT TTTTCTTGAC GATAGAAGTT 120 

TCAGCGACCA TCATTTTTGA ACAGTGATAG CACTTGAAAC GACGCTTTCT AAGTAGAATT 180 

CTAGTAGGC A TACCAGTTGT CTCAAGGTAA GGAATCTTAG ACGGTTTTTG AAAGTCATAT 240 

TTCTTCAATT GGTTTCCGCA CTCAGGGCAA GATGGGGCGT CGTAGTCCAG TTTGGCGATG 300 

ATTTCCTTGT GTGTATCTTT ATTGATGATG TCTAAAATCT GGATATTAGG GTCTTTAATG 360 

TCTAGTAATT TTGTGATAAA ATGTAATTGT TCCATATGAA TCTTTCTAAT GAGTTGTTTG 420 

GTCGCTTTTC ATTATAGGTC ATATGGGACT TTTTTTCTAC AATAAAATAG GCTCCATAAT 480 

ATCTATAAGG GATTTACCCA CTACAAATAT TATAGAGCCA AAAATCCTTT GTTTACTAAA 540 

CAAGGGATTT TTCTTTTGTC TCTGCTCCTT TTTTGATATA ATAGTTCTAT GTTAAAATCA 600 

GAAAAACAAT CACGTTATCA AATGTTAAAT GAAGAATTGT CCTTCCTATT GGAAGGCGAA 660 

ACCAATGTTT TGGCTAATCT TTCCAACGCC AGTGCTCTCA TAAAATCACG TTTTCCTAAT 720 

ACCGTATTTG CAGGCTTTTA TTTGTTCGAT GGAAAGGAAT TGGTTTTAGG CCCCTTCCAA 780 

GGAGGTGTTT CCTGCATCCG TATTGCACTA GGCAAGGGTG TTTGTGGTGA GGCAGCTCAC 840 

TTTCAGGAAA CTGTTATTGT TGGAGATGTG ACGACCTATC TCAACTATAT TTCTTGTGAT 900 

AGTCTAGCTA AAAGTGAAAT TGTGGTGCCG ATGATGAAGA ATGGTCAGTT ACTTGGAGTT 960 

CTGGATCTGG ATTCTTCAGA GATTGAGGAT TACGATGCTA TGGATCGAGA TTATTTGGAA 1020 

CAATTTGTCG CTATTTTGCT TGAAAAGACA GCATGGGACT TTACGATGTT TGAGGAAAAA 1080 

TCTTAATGTA TCAAGCACTT TATCGAAAAT ATAGAAGTCA AAACTTCTCC CAGTTAGTTG 1140 

GTCAAGAAGT TGTGGCTAAG ACTCTTAAAC AAGCGGTGGA GCAAGAGAAA ATAAGTCACG 1200 
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CTTATCTTTT TTCTGGTCCT CX3TGGAACGG GAAAAACCAG TGTTGCTAAA ATCTTTGCCA 1260 

AGGCTATGAA CTGTCCCAAT CAAGTGGGTG GCGAACCTTG CAATAACTGC TATATTTGTC 1320 

AAGCAGTGAC GGACGGTAGT TTAGAAGATG TCATTGAAAT GGATGCAGCT TCTAATAATG 1380 

GGGTAGATGA AATTCGCGAA ATTCGTGATA AATCTACCTA TGCGCCTAGC CTTGCTCGTT 1440 

ATAAGGTTTA TATCATAGAT GAGGTTCACA TGCTGTCTAC AGGGGCTTTT AATGCCCTCC 1500 

TAAAGACGCT GGAAGAACCA ACACAGAATG TAGTCTTTAT TTTGGCCACT ACTGAATTGC 1560 

ACAAGATTCC TGCTACTATT CTATCCCGTG TGCAACGTTT TGAGTTTAAA TCAATTAAGA 1620 

CACAGGATAT TAAGGAACAT ATTCACTATA TCTTAGAAAA AGAAAATATC AGTTCTGAAC 1680 

CAGAGGCTGT GGAAATCATT GCCAGACGGG CGGAAGGTGG AATGCGGGAC GCCTTGTCTA 1740 

TTTTGGATCA AGCCCTGAGT TTGACACAGG GAAATGAGCT GACGACTGCT ATCTCTGAAG 1800 

AAATTACTGG CACCATTAGC CTATCAGCCT TGGATGATTA TGTGGCGGCC TTGTCTCAAC 1860 

AGGATGTTCC CAAAGCTTTG TCTTGCTTGA ATCTTCTTTT TGACAATGGT AAGAGCATGA 1920 

CTCGTTTTGT GACCGATCTT TTGCACTATT TAAGAGACTT GTTAATTGTT CAAACAGGGG 1980 

GAGCAAATAC TCATCATAGT TCAGTCTTTG TAGAAAATTT GGCACTTCCT CAAAAAAATC 2040 

TGTTTGAAAT GATTCGCTTA GCAACAGTGA GTTTAGCAGA TATTAAGTCT AGTTTGCAAC 2100 

CCAAGATTTA TGCTGAAATG ATGACCGTCC GTTTGGCGGA AATCAAGTCC GAACCAGCTC 2160 

TATCAGGAGC GGTTGAAAAT GAAATTGCTA CGCTGAGACA GGAAGTTGCC CGTCTCAAAC 2220 

AAGAGCTTTC TAATGTAGGT GCGGTTCCTA AACAAGTTGC ACCAGCTCCT AGTCGACCAG 2280 

CTACGGGCAA AACAGTCTAT CGTGTCGATC GCAATAAAGT GCAATCTATC TTACAAGAGG 2340 

CCGTCGAAAA TCCTGATTTA GCACGTCAAA ATTTAATTCG TTTGCAGAAT GCCTGGGGAG 2400 

AGGTAATTGA AAGTCTAGGT GGGCCGGACA AGGCTCTGCT AGTTGGTTCT CAACCGGTTG 2460 

CTGCCAATGA ACACCATGCT ATTCTTGCTT TTGAGTCTAA CTTCAATGCT GGTCAAACTA 2520 

TGAAACGAGA CAATCTCAAT ACCATGTTTG GTAATATCCT CAGTCAGGCG GCAGGTTTTT 2580 

CACCTGAGAT TTTAGCTATT TCCATGGAGG AATGGAAAGA AGTTCGCGCA GCCTTTTCAG 2640 

CCAAAGCCAA ATCTTCTCAA ACTGAAAAAG AAGTAGAAGA AAGCCTGATT CCAGAAGGAT 2700 

TTGAATTTTT GGCTGATAAA GTGAAGGTAG AGGAAGACTA AAGAAAGATT TCATGATACA 2760 

ATAAGTTTAT GAATAAACAA CAATTTATTA TTATGGCGCT GTTTACAGCT GCTGAGACCT 2820 

ATTTTTTCAA TGAAGCCTGG ATGACTGGCC GCTATATTAT GGCAGCCTTT TGGGCAATTT 2880 

TACTCTTTAG AAATTTCCGA GTCAGTTATG TGATGGGCAA AATCGTTGAT GTCATCGATC 2940 

AGCATTTTAA TAGGAAAGAC TAGCCCTCAG CTTCCAGACA AAATCAAAGC CTTTTAGGCT 3000 
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TTTTTTTGTT ATACTAGAAA AGTATATTTA TAGAATTTTT GCTCTATTTC TGGGGAAATC 3060 

AGACGTTTTT CTAGTAAGTA CTGTAAAAGT TTTGAAAAAG AAAGGAACTA TCATGTCAGT 3120 

ATTAGAGATC AAAGATCTTC ACGTTGAGAT TGAAGGAAAA GAAATTTTAA AAGGGGTTAA 3180 

CCTGACCCTG AAAACAGGAG AAATTGCCGC TATCATGGGA CCAAATGGTA CAGGTAAATC 3240 

GACTCTTTCT GCCGCTATCA TGGGAAATCC AAACTATGAA GTAACTAAAG GTGAAGTTTT 3300 

GTTTGATGGC GTAAACATCC TTGAGTTGGA AGTGGATGAG CGTGCGCGTA TGGGACTTTT 3360 

CCTTGCTATG CAATACCCAT CAGAAATCCC TGGAATTACC AATGCTGAGT TTCTTCGTGC 3420 

CGCTATGAAT GCGGGTAAAG AAGATGATGA GAAGATTTCA GTTCGTGAGT TTATTACTAA 3480 

GCTAGATGAA AAAATGGAAT TGCTCAACAT GAAAGAAGAA ATGGCAGAGC GTTACCTCAA 3540 

CGAAGGCTTC TCTGGTGGTG AGAAAAAACG CAATGAAATT CTTCAACTTT TGATGTTGGA 3600 

GCCAACATTT GCTCTTTTGG ACGAGATTGA CTCAGGTCTT GATATTGACG CTCTTAAAGT 3660 

TGTGTCTAAA GGTGTCAATG CCATGCGTGG TGAAGGTTTT GGTGCTATGA TCATCACTCA 3720 

CTACCAACGT CTTTTGAACT ATATCACACC TGATGTGGTA CACGTGATGA TGGAAGGTCG 3780 

TGTTGTCCTT TCTGGTGGTC CAGAATTGGC TGCGCGTTTG GAACGTGAAG GATACGCAAA 3840 

ATTAGCTGAA GAACTTGGCT ACGACTACAA GGAAGAATTG TAATTCCCTC GTATCTTTTA 3900 

GGAGAAGTAA ATGACTAGAG AAAATATTAA ACTTTTTTCA GAAATGCACG CTGAACCAAG 3960 

CTGGTTGGCT GATCTCCGTC AAAAAGCTTT TGACAAGATT GAGACTTTGG AATTACCAGT 4020 

TATTGAGTGT GTCAAATTCC ACCGTTGGAA TCTGGGTGAT GGAACGATTA CAGAAAATGA 4080 

GCCATCAGCA AATGTTCCAG ATTTCACAGC TTTAGATCAT CACTTGAAGT TGGTGCAAGT 4140 

AGGAACTC AA ACTGTTTTCG AACAAACTCC AGTTGAGTTA GCTGAACAGG GTGTTGTCTT 4200 

CACAGACTTT CACTCAGCTT TAGAAGAAAT TCCAGAGCTG ATCGAAGAAT TCTTCATGTC 4260 

ATCTGTTAAG TATGATGATG ACAAGTTGGC GGCTTACCAC ACAGCTTACT TTAACAGTGG 4320 

TGCTGTACTC TATATTCCAG ATAACGTAGA AATCACAGAG CCAATTGAAG GAATTTTCTA 4380 

CCAAGATAGC GATAGCAATG TGCCGTTTAA CAAGCATATT ATGATTATCG TTGGTAAAAA 4440 

TTCTAAGATT AGTTATCTGG AGCGTTTAGA GTCACGCGGT GAAGGAAGTG ACAAAGCAAC 4500 

TGCCAATATC ACAGTGGAAG TGATTGCACG TTCTGGTGCG CAAGTCAAGT TTGCTGCTAT 4560 

CGACCGTCTA GGTGAAAACG TCACTGCCTA CATTAGCCGT CGTGGTAAAT TAGGCAACGA 4620 

TGCAAGTATT GACTGGGCTA TCGGTGTCAT GAACGAAGGA AATGTCGTTG CTGATTTTGA 4680 

TAGTGACTTG ATTGGTAATG GTAGCCATGC TGACCTCAAG GTTGTAGCTC TTTCAAGTGG 4740 
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TCGTCAGGTA CAAGGGATTG ATACTCGTGT AACTAACTAT GGCTGCAACT CAATCGGAAA 4800 

CATTCTACAA CATGGGGTTA TCCTTGAAAA AGCAACTTTG ACTTTCAATG GTATCGGCCA 4860 

CAT CATC AAG GGTGCTAAGG GAGCAGATGC GCAACAAGAG AGCCGTGTTC TCATGCTTTC 4920 

AGACCAAGCG CGTTCAGATG CTAACCCAAT TCTTTTGATT GATGAAAATG ACGTAACTGC 4980 

AGGCCATGCA GCCTCTATTG GTCAGGTAGA TCCAGAAGAT ATGTACTACC TCATGAGTCG 5040 

TGGCTTGGAT AAGGCAACTG CAGAGCGTTT GGTTGTTCGT GGTTTCCTTG GATCTGTTAT 5100 

CGTGGAGATT CCAGTCAAGG AAGTTCGTGA TGAAATGATT GCAACTATCG AAGAGAAATT 5160 

GTCAAAACGC TAAGGGGCAG CCTATGTTAG ATGTAGAAGC GATTCGCAAG GATTTTCCAA 5220 

TTTT AG AT CA GATTGTCAAT GATGAACCTC TGGTCTATCT GGACAATGCT GCGACGAC AC 5280 

AAAAACCACT AGTAGTTCTG AAAGCTATTA ACAGCTACTA TGAGCAGGAC AATGCCAATG 5340 

TTCACCGTGG TGTCCATACC TTAGCGGAAC GAGCGACAGC TTCTTATGAA GCTGCTCGTG 5400 

AAACCATTCG TAAGTTTATT AATGCAGGCT CTACAAAGGA AGTTCTCTTT ACCAGAGGAA 5460 

CGACAACCAG CCTTAACTGG GTGGCACGCT TTGCTGAGGA AATTCTCACT GAGGGAGACC 5520 

AGGTCTTGAT TTCAGTAATG GAACACCATT CTAATATCAT TCCATGGCAG GAAGCTTGTC 5580 

GAAAGACTGG AGCAGAGCTT GTCTATGTCT ATCTTAAAGA CGGTGCCTTG GATATGGAGG 5640 

ATTTGCGAGC TAAATTGACT GATAAGGTTA AATTTGTTTC CCTAGCTCAT GCCTCCAATG 5700 

TTCTTGGTGT GGTCAATCCG ATCAAGGAAA TCACTCAATT AGCCCACCAA GTTGGGGCAA 5760 

TTATGGTAGT GGATGGTGCT CAATCTACAC CTCATATGAA GATTGATGTC CAGGACTTGG 5820 

ATCTGGACTT TTTCGCCTTT TCGGGTCACA AGATGGCTGG TCCGACTGGT ATCGGTGTCC 5880 

TTTACGGCAA AGAAAAGTAT CTTGAGCAAA TGTCTCCAGT AGAATTTGGC GGCGAGATGA 5940 

TTGATTTTGT CTACGAGCAA TTTGCTAGTT GGAAGGAATT GCCTTGGAAA TTTGAGGCTG 6000 

GAACGCCAAA TATGGCAGGA GCTATTGGAC TTGCGACTGC AGTTGATTAT CTGGAAAAGA 6060 

TTGGTATGGA TGCCGTTGAA GCTCATGAAC AGGAATTGAT TGCGTACGTC TATCCAAAAC 6120 

TGCAGGCAAT TGAGGGATTG ACCATTTACG GTTCTCAGGA TTTGGCTCAA CGTTCGGGTG 6180 

TTATTGCCTT TAACCTAGGT GATCTCCATC CTCACGATCT TGCGACGGCT CTGGATTATG 6240 

AAGGAGTGGC TGTTCGTGCT GGTCACCATT GTGCGCAACC CTTGCTTCAG TATTTGGAAG 6300 

TCCCAGCAAC AGCTCGTGCA AGTTTTTATA TCTACAATAC CAAGGCAGAT TGCGACAAAC 6360 

TAGTCGATGC CCTACAAAAG ACAAAGGAGT TTTTCAATGG CACTTTCTAA ACTAGATAGC 6420 

CTTTATATGG CAGTGGTAGC AGACCATTCG AAAAATCCAC ATCACCAAGG GAAGTTAGAA 6480 

GATGCTGAGC AAATCAGTCT CAACAATCCG ACTTGTGGGG ATGTCATCAA CCTCTCTGTC 6540 
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AAGTTTGATG CAGAGGACCG TTTGGAAGAT ATTGCTTTTC TAAATTCAGG ATGCACGATT 6600 

TCAACTGCTT CTGCTAGTAT GATGACAGAT GCCGTTTTAG GAAAAACCAA ACAAGAAATT 6660 

TTAGAACTGG CGACTATTTT TTCTGAAATG GTTCAAGGGC AAAAAGATGA GCGTCAAGAC 6720 

CAACTTGGAG ACGCGGCATT CTTGTCAGGT GTTGCCAAAT TCCCTCAAAG AATCAAGTGT 6780 

GCAACCCTAG CTTGGAATGC CCTTAAGAAA ACAATTGAAA ATCAAGAAAA ACAGTAAGAC 6840 

AAGTTTCTTT TGTCTTATGA ATTATTAGAA ATGAAGAAAG AAAGGATACT ATGGCTGAAG 6900 

AAAGAGTAGA ACCAAAACCA ATTGACCTTG GTGAATATAA ATTTGGTTTC CATGACGATG 6960 

TAGAGCGTGT CTTATCGACA GGAAAAGGAC TCAACGAAGG TGTTATTCGT GAATTATCTG 7020 

CTGCTAAGGG TGAGCCTGAG TGGATGTTGG AGTTCCGTTT GAAGTCTTAT GAAACCTTCA 7080 

AAAAAATGCC CATGCAAACT TGGGGAGCAG ACTTGTCAGA GATTGACTTT GATGACTTAA 7140 

TCTACTACCA AAAACCATCT GACAAACCAG CCCGTTCTTG GGATGATGTA CCTGAAAAGA 7200 

TTAAAGAAAC CTTTGAACGT ATCGGGATTC CAGAAGCTGA ACGTGCTTAT TTAGCAGGGG 72 60 

CTTCTGCCCA GTACGAGTCA GAAGTGGTTT ACCACAACAT GAAGGAAGAG TTCCAAAAAT 7320 

TAGGTATTAT CTTTACAGAT ACAGATTCCG CACTCAAGGA ATACCCAGAC TTATTTAAAC 7380 

AATACTTTGC GAAGTTGGTA CCGCCGACAG ATAACAAGTT GGCAGCCCTC AACTCAGCAG 7440 

TATGGTCGGG TGGAACTTTT ATCTACGTGC CAAAAGGTGT CAAGGTAGAT ATTCCACTTC 7500 

AAACTTATTT CCGTATCAAT AACGAAAATA TAGGTCAGTT CGAACGTACC TTGATTATCG 7560 

TTGATGAGGG AGCAAGCGTC TACTACGTAG AAGGATGTAC AGCACCAACA TATTCAAGCA 7620 

ATAGCTTACA CGCTGCCATT GTAGAAATTT TTGCTTTGGA CGGAGCTTAT ATGCGTTATA 7 680 

CAACTATCCA AAACTGGTCT GATAACGTCT ATAACTTGGT AACAAAGCGT GCTAAGGCTC 7740 

AAAAGGATGC CACTGTTGAG TGGATTGATG GAAACTTGGG TGCCAAAACG ACTATGAAAT 7800 

ATCCATCTGT TTACCTTGAT GGAGAAGGAG CGCGTGGTAC CATGCTCTCT ATCGCCTTTG 7860 

CTAATGCAGG GCAACACCAA GACACGGGTG CTAAGATGAT TCACAATGCT CCACATACCA .7920 

GCTCGTCTAT TGTGTCTAAA TCCATCGCTA AAGGTGGAGG AAAGGTTGAC TACCGTGGAC 7980 

AAGTCACCTT TAACAAGAAC TCTAAGAAAT CTGTTTCCCA CATTGAATGT GATACCATTA 8040 

TCATGGATGA CTTGTCAGCA TCAGATACTA TTCCATTTAA TGAAATTCAC AACTCGCAAG 8100 

TGGCTTTGGA ACACGAAGCC AAAGTATCTA AGATTTCAGA AGAGCAATTG TATTATCTCA 8160 

TGAGCCGTGG ATTGTCAGAA TCTGAGGCAA CTGAAATGAT TGTCATGGGA TTTGTAGAAC 8220 

CCTTTACAAA AGAACTTCCA ATGGAATACG CAGTTGAGCT GAACCGCTTG ATTAGCTATG 8280 
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AAATGGAGGG ATCAGTTGGA TAAAATTTGA TTTTATACTC TTCGAAAATC TCTTCAAACC 8340 

ACGTCAGCAT CGCCTTACCG TATGTATGGT TwCTGAtTCG TCAGTTTCAT CTACAACCTC 8400 

AAAACAGTGT TTTGAGCAAC tGCGGCTAGC TTCCTAGTTT GTTCTTTGAT TTTGAGTATT 8460 

AGATTTACTC AAAATCAAGG ATTTTGAAGA TGAACTTGTA TCAAAAAATC GCGGTTTAAA 8520 

ATCGCGATTT TTTATAATTT CTCGTTAACA AAGCGGACAA ACTGATTCCA CCAAACTTTT 8580 

AAGAAGAAGG CTTTTTCAAT TTTCTTGTCT GCTACCATTT CGAAACTAGG GCGCTCTGTG 8640 

GTGATGTAAC CTTGACCAAT CAAGTCCTTG TCTTCATAAG TCAAATGGCC AACCACTGTT 8700 

CCAGCTTCAA GTGGTGCTGG GATTGCTTTG GAATCAGGTG TGAATTGAAC AGATTGGGAA 8760 

GATTGATTCC CAACACGTTC GATTAGATAG ATATCCTCTG GAGCCACTGC AGTTACTGTA 8820 

TCTTCTTTTC CATCTTGTAC AGGGGCTTTG CTATCTTGAT AGGCATCGCC TTGTTGAACG 8880 

ATTTTGCGAA GTGTAAATGT AGAAGAAATA TAATCCATTA GGGAAGATGT AGCTGTAAAT 8940 

CGAGCGTAAG GATTATTGTC TTGATGATCT GCATTTAAAA CAACTGTGAT GACTCTCATG 9000 

CCTTTTTCGA CAGTAGTACC AACAAAAGAC TCTCCAGCCT TATCTGTTGT TCCTGTTTTT 9060 

AGCCCATCAA AACCACCACG GTAAGCAGGC ATACCTTCTA ACATGTAGTT GGTTGAAGTG 9120 

ATTGTCATCC CAGCAAAAGT AGAAGAAGGT TTTTTGGTGA TTTCTAAGAC TTGTGGGTAT 9180 

TTTTTGATGA GGTTGCGAGC AACGATAGCG ACATCATAAG CACTAAGCTT ATTTTCCTCA 9240 

TCTTTTTTAG AACCTGGGTA AATGTTATCC CCTAGAGTTT CATTGTTAAG ACCTGTCGTA 9300 

TTGACAACAG TGGCATCCTG AATTCCCCAT TCCAAGAGTT TTGCCCGCAT CATATCGACG 9360 

AAATCTTTTT CTGAGCCAGC AATTTTCTCA GCTAGGGCAA TAGCGGCGCT GTTGGCACTA 9420 

GATACCAGAG TTGCTTCAAG CAACTCTTCG ACAGTATAAT TACGGGCCTC CATAGGAATA 9480 

TTACTGGCTT CAGAATTTGT CGTCAATTGA TAAGGATAAT CAGAAATATC TACAGGAGTG 9540 

GAGAGGGTAA TACTTCCGTT TTCCAAAGCT TCATAGACCA GATAAACAGT AATCAATTTT 9600 

GTTATGGAAG CAATTTCGAC AGGTTGCGTT GCATCCTTCT CATAGAGAAT TTTACCAGTA 9660 

TTTGCCTCAA CAGCAATCGC ATGTTTAGCG GCAATGGTAA AATCTTGAGC AACAGCAGTA 9720 

GAAGCACCCC CTAAAAGAGA GACAGTTAAC AAAGTTAAAA ATATTTTTTT CATAGTAGTC 9780 

TTATTCTATC ATAAAGAAAA AAAATATTCT TGCTTTAATA ATTCATCTGT TAAGCTTTTT 9840 

GAAAATATGG TAAAATAAAG TAAGGGAGGT AACTCATGTT TCGTAGAAAT AAATTATTTT 9900 

TTTGGACCAC AGAAATTTTA CTCTTAACCA TCATCTTTTA CCTATGGAGA CAGATGGGGT 9960 

CTTTGATTAA CCCTTTTGTT AGCGTGCTTA ATACAATTAT GATTCCATTT TTATTAGGGG 10020 

GCTTTTTTTA TTATTTGACA AACCCTATTG TTACTTTCTT AAATAAAGTC TGTAAACTCA 10080 
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ATCGTTTGCT TGGTATTTTA ATTACCTTGT GTACTTTGGT CTGGGGAATG GTCATAGGTG 10140 

TTGTCTATCT CTTACCTATT TTGATTAATC AGTTATCTAG TTTGATTATA TCTAGTCAAA 10200 

CTATTTATAG TCGAGTACAA GACTTAATCA TAGACTTATC TAATTATCCT GCGCTCCAGA 10260 

ATTTGGATGT AGAAGCTACA ATTCAGCAGT TAAACTTATC CTATGTTGAT ATTCTTCAAA 10320 

ATATCCTAAA TAGCGTATCA AATAGTGTGG GGAGCGTCTT GTCAGCTCTT ATCAGTACTG 10380 

TTTTGATTTT GATTATGACT CCAGTTTTTT TGGTTTATTT CTTATTAGAT GGACATAAAT 10440 

TCTTGCCCAT GCTTGAAAGA ACGATTCTAA AGAGGGATCG CTTGCATATT GCAGGCTTAT 10500 

TAAAGAATTT AAATGCGACG ATTGCTCGCT ATATTAGTGG AGTTTCGATT GACGCAATCA 10560 

TTATAGGTTG TTTGGCTTAT ATTGGCTATA GTATTATTGG TTTAAAATAT GCTTTAGTTT 10620 

TTGCCATTTT TTCTGGTGTA GCCAATTTAA TTCCTTATGT GGGGCCAAGT ATTGGTTTGA 10680 

TTCCTATGAT CATCGCAAAT ATATTCACTG ATCCCCATAG ACTGCTGATT GCAGTGATTT 10740 

ATATGCTTGT TGTTCAGCAG GTAGATGGCA ATATCTTATA TCCTCGAATC GTAGGAAGTG 10800 

TTATGAAGGT TCATCCAATC ACGATTTTAG TTTTACTTTT GTTGTCAAGC AATATCTATG 10860 

GTGTAGTTGG AATGATTGTC GCAGTGCCAA CCTATTCTAT CTTGAAAGAA ATTTCTAAGT 10920 

TCTTATCCCA TTTGTATGAA AATCATAAAA TAATGAAAGA ACGAGAAAGA GAATTAGCTA 10980 

AGTAAAAGTC AGGAGAACCC TGATTTTTCT TTACTGGAAG TGGCCTTTAG ATTAGAAGAC 11040 

TGAAAATAAG TTAAAGTCTT AAACTAATTT TCACAGCTAA GAATAGTAGA AGTTAATCTG 11100 

ATAAAAATCG AAAAAACCAG TGGAATTCTG TGTCAGGGTA AGTTCCACTG GTTTTCATAG 11160 

TCTATTAAAG TTCGAATGAA ACCTATTTAT AGTAGATTGA AACTAGAATA GTACACCTCT 11220 

AATTCTAAAA CATTGTTAGA AATCGATTTG ACTGTCCTGA TCTATTGGTT CTATTCTTAT 11280 

TTCATTTTAC TATATTTTGG TGCAATAAGT GAAAAGTAGT CCGAATAATA TAAGGATTGA 11340 

TTTTATAGTT TTTAAACTCA AATGAATTGA AATAAAGAGA GTACGAAAAT TCTCATCTGA 11400 

AAGTATTTTA GAATAATTCT CTTCGTGAAT TTCTTCAAAA CAGATAGCTT CATCTTAGGT 11460 

ATGTGATTTC TTTTTGCATT TTTGAGTTAG ATAAGGTATA ATGATTTTAT TGTCTTTTGG 11520 

GGTCGTTACG GATTCGACAG GCATTATGAG GCATATTTTG CGACTCGTGT GGCGACGTAA 11580 

ACGCTCAGTT AAATATAACT GCAAAAAATA ACACTTCTTA CGCTCTAGCT GCCTAAAAAC 11640 

CAGCAGGCGT GACCCGATTT GGATTGCTCG TGTTCAATGA CAGGTCTTAT TATTAGCGAG 11700 

ATACGATTAA GCCTTGTCTA GCGGTTTGAT AAGAGATTGA TAGACTCGCA GTTTCTAGAC 11760 

TTGAGTTATG TGTCGAGGGG CTGTTAAAAT AATACATAAC CTATGGTTGT AGACAAATAT 11820 
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GTTGGCAGGT GTTTGGACGT GGGTTCGACT CCCACCGGCT CCATTATTCC TTTGCATTCT 11880 

TTTGCATTCC TTGGTAAAAC GTTGTTAAAT CAACGTTTTT TATTTTTATC TTTGGTATTC 11940 

CTTTGCATTC TTTTGCTAAA AAGGGAGTCA CAAACAGACC CTATTTTAAA AAAGGATAGA 12000 

AAAAAGGATA CAACATTTGT CGCATCCTAA AAATAATCTT TTTTCGACGG AAGACATGGG 12060 

ATTCGAACCC ACGCACGCTA TTACACGCCT ACCGCGTTTC CAACACGGCC TCTTAAGCCT 12120 

CTTGAGTAAT CTTCCAATAC TTACTCAAAT AGTCTACCAT AAAGGCTCTT ATCTTGCAAT 12180 

AAAAATTCTA GAAATAAGAA AAATGATAGA TTTTGAAAGA AAATGATAAA AAATGCTTGA 12240 

CTTCGAAAGA AAGTATGATA GAATGAATAG TGTAAACGAT AACAGGAGGT GATTCAGTGT 12300 

TAAAAACAGA ACGTAAACAA CTAATTTTAG AGGAGTTAAA TCAACATCAT GTAGTTTCTC 12360 

TAGAAAAATT AGTTAGTTTG CTAGAAACGT CAGAATCAAC GGTTCGAAGA GACTTGGATG 12420 

AGTTGGAAGC GGAAAACAAG CTTCGTCGTG TGCATGGTGG AGCAGAACTC CCCTACTCCT 12480 

TACAGGAAGA AGAAACCATT CAAGAAAAAT CTGTCAAAAA CCTTCAAGAA AAGAAATTGC 12540 

TGGCTCAGAA AGCAGCCTCT CTCATTAAAG AAAAAGATGT CATCTTTATC GATGCTGGAA 12600 

CAACAACTGC TTTTTTGATT CATGAATTGG TCAATAAGAA TGTTACAGTT GTGACCAACT 12660 

CCATTCACCA TGCCGCTCAG TTGGTTGAAA AGCAGAwTCC AACTGTCATG GTTGGAGGAA 12720 

ACGTCAAGAC GGCGACAGAT GCTAGTATCG GGGGCGTTGC TCTTAACCAG ATTAACCAAT 12780 

TGCACTTTGA CCGTGCCTTT ATCGGAATAA ATGGTGTTGA CGATGGCTAT TATACGACTC 12840 

CTGATATGGA GGAGGGAGCT GTGAAAAGAG CTATTTTGGA GAATGCCAAG CAGACCTACG 12900 

TCTTGGTGGA TTCGTCAAAA ATTGGACAAA CTTGCTTTGC CAAGGTAGCC CCACTCAAAC 12960 

GCGCTATCGT TATCACTAGT CAAGGGCATG AGCTCTTGCA GGTTATTAAG GAGAAAACGG 13020 

AGGTAATAGA AGTATGATTT ATACAGTCAC ACTCAATCCA TCCATTGACT ATATCGTTCG 13080 

TTTGGACCAA GTCAAAGTTG GTAGTGTCAA TCGTATGGAC AGTGATGATA AGTTTGCTGG 13140 

TGGGAAAGGA ATCAATGTCA GCCGTGTCTT GAAACGTTTG AATATACCAA ATACAGCGAC 13200 

GGGATTTATC GGTGGCTTTA CTGGTAAATT TATCACAGAT ACTTTAGCAG AGGAAGAAAT 13260 

CGAGACACGT TTTGTCCAGG TGGCAGAAGA TACTCGTATC AATGTTAAAA TCAAAGCAGA 13320 

CCAAGAAACA GAAATCAACG GAACGGGTCC AACTGTTGAA TCGGTTCAGC TAGAAGAATT 13380 

GAAAGCTATT TTATCTAGTC TGACAGCAGA AGATACAGTT GTCTTTGCAG GTTCAAGTGC 13440 

TAAAAATCTA GGCAATGTTA TCTATAAGGA TTTGATTTCC TTGACGCGCC AGACTGGTGC 13500 

GCAAGTGGTC TGTGACTTTG AAGGACAGAC CTTAATTGAT AGTTTGGACT ACCAGCCTCT 13560 

TCTTGTAAAA CCAAACAATC ATGAACTTGG AGCGATTTTT GGGGTTAAAC TCGAAAGTTT 13620 
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AGATGAAATT GAGAAATACG CTCGTGAGTT ACTGGCTAAG GGTGCTCAAA ATGTTATTAT 13680 

CTCTATGGCT GGTGATGGTG CCCTTCTTGT CACATCTGAG GGAGCTTACT TCGCTAAACC 13740 

AATCAAAGGA ACAGTCAAAA ATTCAGTTGG AGCTGGTGAT TCTATGGTTG CTGGATTCAC 13800 

AGGTGAATTT GTCAAATCAA AAGACGTAGT AGAAGCCTTC AAATGGGGAG TGGCTTGCGG 13860 

AACGGCAACT ACCTTCTCAG ATGACTTGGC AACGGCGGAA TTTATTAAAG AAACATATGG 13920 

AAAAGTTGAG GTAGAAAAAC GATGAAAATT CAAGACCTAT TGAGAAAAGA TGTCATGTTG 13 980 

CTAGATTTGC AGGCAACTGA AAAAACAGCT GTCATCGACG AGATGATTAA AAATTTGACA 14040 

GACCACGGTT ATGTAACAGA TTTTGAAACA TTTAAAGAAG GAATTTTGGC GCGTGAAGCT 14100 

TTGACTTCTA CTGGTTTGGG TGATGGAATC GCAATGCCTC ACAGCAAAAA CGCTGCTGTC 14160 

AAAGAAGCGA CAGTTCTATT TGCTAAGTCA AATAAGGGTG TTGACTACGA GAGCTTGGAT 14220 

GGACAAGCAA CTGACCTCTT CTTCATGATT GCAGCTCCAG AAGGTGCCAA TGATACTCAC 14280 

TTGGCAGCCT TGGCAGAATT GTCTCAATAC TTGATGAAAG ACGGTTTTGC AGACAAACTT 14340 

CGTCAAGCAA CATCTGCAGA CCAAGTTATC GAACTTTTTG ACCAAGCTTC AGAAAAAACT 14400 

GAGGAACTTG TTCAAGCACC TGCTAATGAC TCTGGTGACT TTATCGTAGC TGTTACAGCT 14460 

TGTACAACAG GTATTGCCCA CACTTACATG GCCCAAGAAG CCCTTCAAAA AGTAGCTGCT 14520 

GAAATGGGGG TTGGTATCAA GGTCGAAACC AACGGTGCTA GCGGTGTTGG AAATCAACTA 14580 

ACTGCAGAAG ATATCCGTAA GGCTAAAGCT ATTATCATTG CAGCAGACAA GGCCGTTGAA 14640 

ATGGATCGAT TTGATGGAAA ACCATTGATC AATCGTCCAG TTGCTGACGG TATCCGTAAG 14700 

ACAGAAGAGC TAATTAACTT GGCTCTTTCA GGAGATACTG AAGTCTACCG TGCCGCTAAT 14760 

GGTGCCAAAG CTGCAACAGC CTCTAACGAA AAACAAAGCC TTGGTGGTGC CTTGTACAAA 14820 

CACTTGATGA GTGGTGTATC TCAAATGTTA CCATTCGTTA TCGGTGGTGG TATCATGATT 14880 

GCCCTTGCCT TCTTGATTGA CGGTGCTTTG GGTGTTCCAA ATGAAAACCT TGGCAATCTT 14 940 

GGTTCTTACC ATGAGTTAGC TTCTATGTTC ATGAAAATTG GTGGAGCTGC CTTTGGTTTG 15000 

ATGCTTCCAG TCTTTGCGGG TTATGTTGCC TACTCTATTG CTGAAAAACC GGGTTTGGTA 15060 

GCAGGTTTCG TGGCTGGTGC TATTGCCAAA GAAGGTTTTG CCTTTGGTAA AATTCCTTAT 15120 

GCCGCAGGTG GTGAAGCAAC TTCAACTCTT GCAGGTGTCT CATCTGGTTT CCTAGGTGCC 15180 

CTTGTTGGTG GATTTATCGC AGGTGCCTTG GTTCTTGCCA TCAAGAAATA CGTTAAAGTT 15240 

CCTCGTTCAC TCGAAGGTGC TAAATCAATC CTTCTATTGC CACTTCTTGG AACAATCTTG 15300 

ACAGGATTTG TTATGCTAGC TGTGAATATC CCAATGGCTG CAATCAACAC TGCTATGAAT 15360 
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GACTTCCTAG GCGGTCTTGG AGGAGGTTCA GCTGTCCTTC TTGGTATCGT CCTTGGTGGA 15420 

ATGATGGCTG TTGACATGGG TGGACCAGTT AATAAAGCAG CTTATGTCTT TGGTACAGGT 15480 

ACGCTTGCAG CAACTGTTTC TTCAGGTGGT TCTGTAGCCA TGGCAGCAGT TATGGCTGGA 15540 

GGAATGGTGC CACCACTTGC AATCTTTGTC GCAACTCTTC TTTTCAAAGA TAAATTTACT 15600 

AAGGAAGAAC GTAACTCTGG TTTGACAAAC ATCATCATGG GCTTGTCATT TATCACTGAG 15660 

GGAGCGATTC CATTTGGTGC CGCTGACCCA GCTCGTGCGA TTCCAAGCTT CATCCTTGGT 15720 

TCAGCAGTAG CAGGTGGACT CGTTGGTCTT ACTGGTATCA AACTCATGGC GCCACACGGA 15780 

GGAATCTTCG TTATCGCCCT TACTTCAAAT GCTCTCCTTT ACCTCGTTTC TGTCTTGGTA 15840 

GGAGCAATCG TAAGTGGTGT GGTTTATGGT TACCTACGCA AACCACAAGC ATAAAAAATA 15900 

GAAAAATGAA AAGATTGGAC CGTTTGGTGC AGTCTTTTTC TCTTCCCGAA ATGCCTGTGA 15960 

AATATGGTAT AATAGAAGAA TGGCAAACAA GAATACAAGT ACAACAAGAC GGAGACCGTC 16020 

TAAAGCAGAA CTGGAAAGAA AAGAAGCGAT TCAACGAATG TTGATTTCGT TAGGAATTGC 16080 

GATTTTATTG ATTTTCGCAG CCTTCAAATT AGGGGCTGCA GGTATAACCC TTTATAATTT 16140 

AATTCGCTTG CTAGTGGGTA GCCTAGCTTA TCTGGCGATA TTCGGCCTAT TAATCTATCT 16200 

CTTCTTTTTC AAGTGGATAC GAAAACAGGA AGGACTCTTA TCTGGCTTTT TCACCATATT 16260 

TGCTGGCTTA CTCTTGATTT TTGAGGCCTA CTTGGTTTGG AAATATGGTT TGGACAAGTC 16320 

CGTTCTAAAA GGGACCATGG CTCAGGTTGT GACAGATCTG ACTGGTTTTC GAACGACTAG 16380 

CTTTGCTGGA GGGGGCTTGA TCGGGGTCGC TCTTTATATT CCAACAGCCT TTCTCTTTTC 16440 

AAATATCGGA ACTTACTTTA TTGGTTCTAT CTTGATTTTA GTGGGTTCTC TCCTAGTCAG 16500 

CCCTTGGTCT GTTTACGATA TTGCTGAATT TTTCAGTAGA GGCTTTGCCA AATGGTGGGA 16560 

AGGGCACGAG CGTCGAAAAG AGGAACGCTT TGTCAAACAA GAAGAAAAAG CTCGCCAAAA 16620 

GGCTGAGAAA GAGGCTAGAT TAGAACAAGA AGAGACTGAA AAAGCCTTAC TCGATTTGCC 16680 

TCCTGTTGAT ATGGAAACGG GTGAAATTCT GACAGAGGAA GCTGTTCAAA ATCTTCCACC 16740 

TATTCCAGAA GAAAAGTGGG TGGAACCAGA AATCATCCTG CCTCAAGCTG AACTTAAATT 16800 

CCCTGAACAG GAAGATGACT CAGATGACGA AGATGTTCAG GTCGATTTTT CAGCCAAAGA 16860 

AGCCCTTGAA TACAAACTTC CAAGCTTACA ACTCTTTGCA CCAGATAAAC CAAAAGATCA 16920 

GTCTAAAGAG AAGAAAATTG TCAGAGAAAA TATCAAAATC TTAGAAGCAA CCTTTGCTAG 16980 

CTTTGGTATT AAGGTAACAG TTGAACGGGC CGAAATTGGG CCATCAGTGA CCAAGTATGA 17040 

AGTCAAGCCG GCTGTTGGTG TAAGGGTCAA CCGCATTTCC AATCTATCAG ATGACCTCGC 17100 

TCTAGCCTTG GCTGCCAAAG ATGTCCGGAT TGAAGCACCA ATCCCTGGGA AATCCCTAAT 17160 
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CGGAATTGAA GTGCCCAACT CCGATATTGC CACTGTATCT TTCCGAGAAC TATGGGAACA 17220 

ATCGCAAACG AAAGCAGAAA ATTTCTTGGA AATTCCTTTA GGGAAGGCTG TTAATGGAAC 172 BO 

CGCAAGAGCT TTTGACCTTT CTAAAATGCC CCACTTGCTA GTTGCAGGTT CAACGGGTTC 17340 

AGGGAAGTCA GTAGCAGTTA ACGGCATTAT TGCTAGCATT CTCATGAAGG CGAGACCAGA 17400 

TCAAGTTAAA TTTATGATGG TCGATCCCAA GATGGTTGAG TTATCTGTTT ACAATGATAT 17460 

TCCCCACCTC TTGATTCCAG TCGTGACCAA TCCACGCAAA GCCAGCAAGG CTCTGCAAAA 17520 

GGTTGTGGAT GAAATGGAAA ACCGTTATGA ACTCTTTGCC AAGGTGGGAG TTCGGAATAT 17580 

TGCAGGTTTT AATGCCAAGG TAGAAGAGTT CAATTCCCAG TCTGAGTACA AGCAAATTCC 17640 

GCTACCATTC ATTGTCGTGA TTGTGGATGA GTTGGCTGAC CTCATGATGG TGGCCAGCAA 17700 

GGAAGTGGAA GATGCTATCA TCCGTCTTGG GCAGAAGGCG CGTGCTGCAG GTATCCACAT 17760 

GATTCTTGCA ACTCAGCGTC CATCTGTTGA TGTCATCTCT GGTTTGATTA AGGCCAATGT 17820 

TCCATCTCGT GTAGCATTTG CGGTTTCATC AGGAACAGAC TCCCGTACGA TTTTGGATGA 17880 

AAATGGAGCA GAAAAACTTC TTGGTCGAGG AGACATGCTC TTTAAACCGA TTGATGAAAA 17940 

TCATCCAGTT CGTCTCCAAG GCTCCTTTAT CTCGGATGAC GATGTTGAGC GCATTGTGAA 18000 

CTTCATCAAG ACTCAGGCAG ATGCAGACTA CGATGAGAGT TTTGATCCAG GTGAGGTTTC 18060 

TGAAAATGAA GGAGAATTTT CGGATGGAGA TGCTGGTGGT GATCCGCTTT TTGAAGAAGC 18120 

TAAGTCTTTG GTTATCGAAA CACAGAAAGC CAGTGCGTCT ATGATTCAGC GTCGTTTATC 18180 

AGTTGGATTT AACCGTGCGA CCCGTCTCAT GGAAGAACTG GAGATAGCAG GTGTCATCGG 18240 

TCCAGCTGAA GGTACCAAAC CTCGAAAAGT GTTACAACAA TAAAAAAATA GCTTCTTTCC 18300 

AAGTTTGGAG GGAAGCTATT TTAGTGGCTA TTGATTGCTT TTATTTTCTG AAGTTGGCGC 18360 

ATTGGACTGT TTTTCGTTTT CAGTAGCAGG TTTACTTGAA GCAGGAGTAG AAGAGTCCTG 18420 

AGTTGCTGTT TTCTGATCTT CTTTTTTCTC TTCCTTGACG CTAGATTTTG GTGTTTCCTC 18480 

TTGCTGTGTT TTTTCTTGAC TAGTGTTAGT CTCTTTAGTT GGACTGGTGT TTTCCTTAGG 18540 

GGATTCCTTT TGGATTTCTT TGACAATGGT TGTCGTCTGG CTTGTCGTAG GTTCTTTTTT 18600 

AATATTTTTG TTATTATCCA AGGCGTT 18627 
(2) INFORMATION FOR SEQ ID NO: 114: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2560 base pairs ( 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS : double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

TAAAATACGT TACCTTGCTT CTGCACGTTC AGCAGGTAAG TCATTGAAAT TTAAAGATCA 60 

AGATATTACA ATTGAAGAAA CGACTGAAAC AGCTTTTGAA GGAGTTGATA TTGCTCTCTT 120 

TTCAGCAGGT AGTTCTACAT CAGCTAAGTA TGCACCATAC GCAGTAAAAG CTGGCGTGGT 180 

AGTAGTAGAT AATACATCTT ATTTCCGTCA AAATCCAGAT GTTCCTTTGG TTGTTCCAGA 240 

GGTCAATGCT CATGCACTTG ATGCTCACAA CGGAATCATT GCCTGCCCTA ATTGTTCAAC 300 

AATTCAAATG ATGGTGGCTC TTGAGCCGGT TCGCCAAAAA TGGGGCTTGG ACCGTATCAT 360 

TGTTTCAACT TATCAAGCCG TTTCAGGTGC TGGTATGGGA GCAATTCTTG AGACACAACG 420 

TGAACTTCGT. GAAGTCTTGA ATGATGGTGT GAAACCACGT GATTTGCATG CGGAAATCTT 480 

GCCTTCAGGT GGTGACAAGA AACATTATCC TATCGCCTTT AACGCTCTTC CACAAATTGA 540 

TGTTTTCACT GATAATGATT ACACGTACGA AGAGATGAAG ATGACCAAGG AAACTAAGAA 600 

AATTATGGAA GATGATAGCA TTGCAGTATC TGCAACATGT GTGCGTATTC CAGTCTTGTC 660 

AGCTCACTCT GAGTCTGTTT ATATCGAAAC AAAAGAAGTG GCTCCAATCG AAGAAGTAAA 720 

AGCAGCTATC GCAGCCTTCC CAGGTGCTGT TCTTGAAGAT GATGTAGCTC ATCAAATCTA 780 

TCCTCAAGCT ATCAATGCAG TTGGTTCGCG TGATACCTTT GTTGGTCGTA TCCGTAAAGA 840 

CTTGGATGCA GAAAAAGGAA TTCACATGTG GGTTGTTTCA GATAACCTTC TCAAAGGTGC 900 

TGCTTGGAAC TCAGTTCAGA TTGCTGAAAC TCTTCATGAA CGTGGATTGG TTCGTCCAAC 960 

AGCCGAATTG AAATTTGAAT TAAAATAGTC ATATCGTTTA GGAGTTCAGA TGAACTCCTT 1020 

CTTTGAAATA GAGAGGTGTT TTCGTGTCTT ATCAAGATTT AAAAAAATGT AAAATCATTA 1080 

CAGCCTTTAT TACCCCCTTC CATGAGGATG GTTCCATTAA CTTTGATGCT ATTCCAGCCT 1140 

TGATTGAGCA TTTATTGGCC CATCATACGG ATGGAATTCT TCTCGCAGGA ACGACTGCTG 1200 

AGAGTCCAAC TTTGACCCAC GATGAGGAGT TGGAGTTGTT TGCGGCTGTA CAAAAGGTTG 1260 

TCAATGGACG CGTTCCTTTG ATTGCGGGTG TAG G TACT AA TGATACGCGT GACTCTATTG 1320 

AGTTTGTCAA AGAAGTAGCG GAATTTGGTG GTTTCGCAGC TGGGCTTGCT ATTGTTCCTT 1380 

ACTACAACAA ACCTTCTCAA GAAGGGATGT ATCAGCACTT TAAGACTATT GCAGATGCTT 1440 

CTGACCTACC AATTATTATC TATAACATTC CAGGGCGTGT AGTTGTCGAA TTGACTCCAG 1500 

AAACCATGCT TCGCTTGGCT GACCATCCAA ATATTATCGG TGTCAAAGAA TGTACTAGCT 1560 

TGGCTAATAT GGCTTACTTG ATTGAGCACA AGCCTGAAGA GTTCTTGATT TATACAGGTG 1620 

AGGATGGAGA TGCTTTCCAT GCCATGAACC TTGGGGCGGA TGGGGTTATT TCTGTTGCCT 1680 
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CTCATACAAA TGGGGATGAA ATGCACGAGA TGTTTACTGC GATTGCAGAA AGCGATATGA 1740 

AGAAAGCCGC AGCAATTCAG CGTAAATTCA TTCCTAAGGT TAATGCTCTC TTCTCTTATC 1800 

CAAGTCCTGC TCCAGTTAAG GCAATTCTTA ACTATATGGG ATTTGAAGCT GGACCCACTC 1860 

GTCTACCTCT TGTTCCAGCA CCAGAAGAAG ATGCCAAACG CATTATCAAG GTTGTCGTAG 1920 

ATGGCGACTA CGAAGCAACT AAGGCAACTG TAACAGGGGT CTTAAGACCA GATTACTAAT 1980 

AAAGACAATA AAATCCGGCT CTTTGTCAAC TGTAGTGGGT TGAAGTCAGC TAAGCTCGAG 2040 

AAAGGACAAA TTTTGTCCTT TCTTTTTTGA TATTCAGAGC GATAAAAATC CGTTTTTTGA 2100 

AGTTTTCAAA GTTCCGAAAA CCAAAGGCAT TGCGCTTGAT AAGTTTGATG AGATTATTGG 2160 

TCGCTTCCAA TTTGGCGTTT GAATAGGGTA GTTGAAGGGT GTTGACGATT TTCTTTTTGT 2220 

CCTTTAGAAA GGTTTTAAAG ACAGTCTGAA AAATAGGATG AACCTGCTTC AGATTGTCCT 2280 

CAATGAGTCC GAAAAATTTC TCCGGTTCCT TATTCTGAAA GTGAAACAGC AAGAGTTGAT 2340 

AGAGCTGATA GTGATGTTTC AAGTTTTGTG AATAGCTCAA AAGCTTGTTT AAAATCTCTT 2400 

TATTGGTTAA GTGCATACGA AAAGTAGGAC GATAAAATCG CTTATCACTC AGTTTACGGC 2460 

TATCCTGTTG AATGAGTTTC CAGTAGCGCT TGATAGCCTT GTATTCGGGA TTTTCGATGA 2520 

AACTGATTCA TGATTTGGAC ACGCACACGA CTCATAGCAC 2560 



(2) INFORMATION FOR SEQ ID NO: 115: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11303 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

TATTGGATTT CCCTTGCAAT CAGTTTATGG GACAAGCACC CGGCAGCGCA GAGGAAATCA 60 

ACGCCTTCTG TAGCCTACAT TTTCAAACCA CCTTCCCACG TTTTGCCAAG ATTAAGGTCA 120 

ACGGTAAGGA AGCAGACCCT CTCTATGTCT GGTTACAAGA CCAGAAATCC GGCCCACTAG 180 

GAAAACGAGT CGAATGGAAT TTCGCTAAGT TTCTCATCGG TCGAGATGGG CAAGTCTTTG 240 

AACGCTTTTC TTCAAAAACA GACCCAAAAC AAATTGAAGA GGCGATACAA ACT CT ACT AT 300 

AATTCACAAT CTCACTATGA TTAGGTTTCC TTTAACCTGA TGAATAGTGA GATTTTTTGA 360 

TGGGCTTTGA CTTAAATAGA AAAACACCCC ATGATATGAA ACATGAAGTG TTGTAAAGTC 420 

TATGTTGTAG GTGCTTATTT CACAATTTCA ATGTGACCAG TGATAACGAA TACCATACAG 480 
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AATCTTCATA TACACTAAAC AAATGACTTT CTAATTATTT CAATTAGTTT TGGCTAGTAA 540 

ATATCATTTC CAACAAACGC CCTCTCAATT CCTTATCCTG ATGATGCAAG ATATTCATTA 600 

AGTCATGAGA GTTTTTCGCA TTGATGAATT GATTTAACAA TCTATCTTTT AATTCATATG 660 

GAAGAGAAGC TGTCTTTAGT AGTCTAAAAA CTTCGTCATT TAAAGATGTC CTTTTATTAT 720 

CTTTCCATTC AAATTTAGCT GTATCATTCT TATTTGGCAA TTCAATTATA GACACATTCG 780 

TTCCTTTAAA ATGAATTCTA TGTTTTCTAT TGCTTGGAAC GATACTAGAA TCTCCTTGTA 840 

ATGCTAACTC TACCATTCCC ATTTCCCAAT CGATTGATAA TCTTGTTTTA TATCTTTGAC 900 

CATTTTGATC TTCAAGCATT TCAAAAGAAT GTTGTTTTCC TGGGAATACA TACCAATCTA 960 

CAACTTCAGG TAAATCAACA CCCATACCTA TCTCAGAACC AACCAAGGGA ATGATTGCAC 1020 

CACTTTTTGC AAACACAGGC GTAGTCGAGA TGTCCCTATA AACACTTAAC TTCACACCAC 1080 

CTGTGTATTT TTTCTCTGAA AAGAAGTCAT ACCATTCACC TTCAGGGAAC CATACATCTA 1140 

CTTTTGCAGA TTGGAATGTC AAATCCATCT TTTCTACAAT GGGAGCCACC ATCAGTTCTG 1200 

TTCCAAAAAA GTATTGGTTT GGAACATTAT AGCTCTCATC ATTCTCTGGA TAGAAATAAT 1260 

AGATTGGACT GATTAATGGG GCACCTTCCT CATGTGTCTG TACATTCATG GTATATAGAT 1320 

AGGGAATCAT CTGATGTCTC AAACGAAGGT ATTTCTTCAT AATCTTAGAT GTTGTTTCTG 1380 

AAAAAAACCA AGGTTCTTTA CTATTAAAAG GACTTCTAGA ACTATGTAAT CGAGTAATCG 1440 

GACTAAAAAC ACCAAACTGT AGCCATCTAG TTTGTAGCTC TTCGTCATAA TCCCCCAACA 1500 

TATGTCCACC GATATCATGA CTCCACCAAC TATAACCGAT ATTAGATGCT GTCGCTGTAA 1560 

AATAGGGTTG AAATCTTAAG GAATTCCAAC TAATAATAGT ATCCCCTGAA AAACCAACAG 1620 

GGTAGCGGTG ACTACCAGGA CCTGCATATC TTGATAAAAT CAAACCACCT TCTGCATTTT 1680 

TACAACTATC CTGATAGTGA TAATGGTTTA AAAGCCAAAG TGGATCTAGC ATACCTTGTG 1740 

TCCCTTGTTG CCAGTCAATC CACCAAAAAT CTACTCCCTG CTTTTCTAGT TCATAATGAA 1800 

CATCTTTAAA GTAGGCTTCC CTAAAAGAGG GATTAAAAAA ATCAAAAATA GCAGGTTCTT 1860 

CTAGTTCTAC ATTTAACCCC AACCGTTTTG CGATTTGAGG ATAAGCTTCT TCATAAGCCC 1920 

GTATCCCATC AGCAGGATGG ACATTTAAGG AGAGTTTTAG CTTTCTATCA TGAAGTTGTT 1980 

GCAATAACTG TTCTGGATTT GGTATTAAGT TTCTATTCCA ACTATATCCT GTCCAGCGAC 2040 

TTCCAAAGCG AGCTGGAATG TCAGTTATAT GCCAATCCAT ATCTAACACA CCGATAGATA 2100 

ATGGAATTTT CTCTGTTTCA AATCTGTCTA TTAAATCCAA GTATTCATCC GACGTATAAG 2160 

GCCAATATCT ACTCCACCAA TTGCCTAAAG CATATCTTGG CAACAAGGGT GTTGAACCAG 2220 

TCAAATGGTA AAAATCTCTG ATTGCTCCTC TATAATCATG CCCATAGGCA AAGAAATACA 2280 
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GGTCAATTTG ATTTTCTCTC TCAATATAAC CAGATTGTTC ATCCCAAATA AATCCTTGAG 2340 

AATCATCCAA TAAGGCTATA CCATTTCGGC TAATAATTCC ATCTTCTAAC GAGATTGCTC 2400 

CATCTGCCTT ATCCAGAGTC CGAGCTGTTC CTTTTAACGT TTCAATAGAT TCACCAAAAT 2460 

ACCAGCGACT ACCATATACG GCAAAATTTC CTTTTAATTC TATAAATAAA TTTTCGGCGT 2520 

TAAATTCTCC TTTATTAAAG TGCAGATGAA AATAGTCCGT CATAATATCT AGTACGTTTG 2580 

ATGTCTCGAT ATAATCTAAC GAAATTTGGC CAAAATCTCT ATTATAGATA AGTTGTGTCG 2640 

TTCTATCCTC AAAACTTCCA GTTTGAGAGT ATTCTAACCT TACTAGCTTG TCTGTTAATA 2700 

CAGAGATTCG ATAAAACTCT CCCTTAAAAA TTTTCAATTT GTTTTCCTCC TTTTATGGTA 2760 

GCATAAAAAC AGAACGCACC ATTTTTGATG CGTTTTTCAT TATTCTGAAT GCAATGTTCT 2820 

ATCTGTTATA TCTATGACAA ATAATAGTCA ATTGAAAAAA TGCAGTGGAC AAAATATCTT 2880 

TTAAGAAACC AAGAGTTTAT TAAAGAGTTA TCACTTTTCA ACTTTTCTAA GCTTATGCAG 2940 

TTGTGAAACA AACTACTTTT AAACTATTAA CTAAGATAGG ATTGATAAAT AATTTCAAAC 3000 

TCTTACTAGC AATCATACGA TATTCAAGCT CACGTGCTTT TTTCCTTCCT GCTTATTTCT 3060 

TAGAACTGAA GAACCCGGAT CGGTATATAA ATTATCCGGA TCAACATAGT CATAAGATTC 3120 

ATAACAGTTG CGCTTCATTA AGTCATCCCC AGAGCAAGAG CTTCATCTCG TAATTTTTCA 3180 

ACATCACTAA CCGTAGGTCG CCATCCTTCA ATCATATTTG TACTTAAAGC ATACCAAACA 3240 

CTCTTAAAAA CGGATCGGTT TTCAAAAGCT ATTCCCATGA TTGTCATCTT TTCTTTATCT 3300 

ATATCTAAGG ACATATGCTA CCTCCTTTAG ATACATTATA CCATGTTTCT CTGTAGCTTT 3360 

TAAAAATTTT ATTTTGTTTG TCATATCTAA GTTTTCAGCA CGCTTATCCT ATTTTATAAG 3420 

CCTCAAACCC AAATATAAAA CGCATTCTTT TTGCTTTTTT ACTATTGTAT CGTATTCTAC 3480 

GATAACATAC TTTACTTTAT TGTTTTTTTA AATAACAGCA GTTCCCTGTT TATCAACTAT 3540 

TCGAACTACT TTCTATTTTG CTTCATACCC TACATAGCGA AAAAATATGA AAAAGCAGAG 3600 

AAGAATATCT TAAAAAGACC TCTTCACTGC TAATATTAAC ACTCATTATT TAAACTATAT 3660 

GGATTCTATC ATCGAGTATA CTTTTTTACT TATTAGATAC CTTGCTCTTC TTTCACCAAT 3720 

TTTTGATCAT ATACACGGAT GAATGGAAGA TAGACTAGGA ATGCTGCAAA TGCACATACT 3780 

AGAGCAACTA ATACAGCTCG AAGATCTGCT GTCCCTAAGA AAGCTCCAAT CCCTACTGGA 3840 

GTTGGCCATG GAACCTGTGC GATAATTGGC TTAATAAAGT TTAGAGAATT CGCTACGTAA 3900 

TAAATAGTAG CAGTAACCAT TGGTGCTAAA ATAAATGGTA TAGCCAAGGC TGGATTATAG 3960 

ATAATAGGTA ATCCAAAAAT TAATGGTTCA TTAATATTAA ATAAGGCTGG AACTACAGAT 4020 



WO 98/18931 



PCT/US97/19588 



834 

GCTCGTCCTA TTGCTTTAAG CTGTTCAGAT TTAGAGGCAA AAGCAATATA TAAACATAGT 4080 

CCTAAAGTTG CACCAGAACC ACCTGCAATT ACAAACATAT TAGAAAATTC ACCTGCAACA 4140 

GCGAAGTGCC CGCCAGCAGC ATTTTCAGCC ATGTTAGCAA GAGCAATTGG ACTAACAAAT 4200 

GCAAAAACAA TGTTCGCACC GTGGATACCT ACAATCCAAA GTAGTTGAGT CAATAGATAA 4260 

ATAATCATTA AACCAATCCA CGAATTAGTC AGATTGGATA CAAAACCAAA TGGAATTGCA 4320 

ATGACTTTAA AAATATCTGT TCCCATTGCT ACAAGAAGAC CGTTGATAAA GATAACAACA 4380 

AATGCAACAA CAAATCCCGG AACCAAAGCG GTAAATCCAC GAGAAACTCC TTCTGGAACA 4440 

GCTTCAGGCA TTTTAATAAC CCAATTATGT TTAACACACA TACGATAAAT AAGAACAGTC 4500 

ACAATTGCCA TAATGATTGC GGTAAAAATC CCTGTTGTCC CAAAACGTGC GACTACATTT 4560 

CCCATTGCCC ATCCATCTGC AATTACTGCA CCTTCTTTTA GACTTGTCAC AGTCTTCATC 4620 

ATTCCACCAT CAAAAATGAT TTGCGGTACT GTCATGACAA AAGCCATCAA GGCAAGCAAG 4680 

GCACCATTAA GAGGATTCAT ATTGAGTTCT TCTTCCTCTG CATAAATTTT TGTCAATTCA 4740 

TATGCAAGTG ATAGAACGAA ATAAAGAGAT AGAGAACCCA TAGTCGCATA GTTTGCAACC 4800 

ATGTAAAGTG ATGTGAATTT ATCAAATGAA GCAGAGAAAA TATCTGCCAC AATTGGCCAA 4860 

AATGAGAAAG CTTGTGGCAA AATACTGAAT ACCAAAAACA TTGATCCTAC AATAGTAAAT 4920 

GGTACAGCAG CCATACCTGC AGCCGTGATA GCACGTACTA CTTTAAACTG AGCAAGTTTG 4980 

CCCATTGGTC CCATAACATG GTTTTCAAGA AAACCAAACA ACCCGTTTTG TTGATCCATA 5040 

AATAGACCTC CTTAATAAAA CATAATAATT TTTACTTTCT AAAGACTAGT TTCAAATACA 5100 

AATTATACTA GATCAGGATT ATAAACTAAG TGAGTTCTTT TCCAATTGGA CAAATTGTTG 5160 

ATAAGCCTTA TCTGTTCGTT TATAAATTTT TTTAATTCTT CTAATGTCTA ACAAACTCAG 5220 

AACTAAACCT AATAGAAGAA CTACAAAAAC AAATAAACGT GCTACTTGGT TATTTTCAAA 5280 

AATCGGAAAA AGATTCTTAA ACCAACTTGT CCAAGTTAAA ACAAGTAATC CTATTGAAAT 5340 

AAGCATTTGT ATTCTAACAA ACATTAGTGT TATTCCCAAC TTTTCTTTCC TATTTCCATA 5400 

AAGTTTAAAT TGTTCAACAG TTGCTAAAAT AGAAAATACT ATGAGCATAA TGGGGAAAAT 5460 

AATAATAGGC GAGGGACTAA TAAACTGACT CAAAAGCCAA TAAATATTCC CAAAAAAGAA 5520 

GAGTGCTATT GAATAACGTA GAAGAAGATA TCGATTGAAA AAAGTATTAG TTAGAGCCAT 5580 

CTCTCGACGT TGTTGTTCAA TCTTTTGTCG TTCTTTTTTA TCCATATCAT TTCCTCCTTA 5640 

TATAACAACA CATATTTAGT TAACTTTCTT ATAAAGAGCT AACATTTCCT TTGCTACTTC 5700 

TAATAATGTC ATAGTGGTCA TTAAATGATC TTGAGCATGT ACCATGATAA TTTCAATTTT 5760* 

AATTTCCACT CCACTTGCGT ATTCTTGCAA GAGTTTGGTT TGTGCATGAT GCGCTTCAAG 5820 
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AATTATCTCA TTTGATTGAT TTAATTTACT TTCTGCATCA TCAAAACTAC CTTCTCTCAT 5880 

TTTTGCAAAT GCTTCATGTA TTTCTGACCT TGCATTTCCC GAATGCAGGA TAATTTCAAA 5940 

TGCTGCAACC TGCAGTTCCT CTTGATTCAT ATAAACCTCC TATTTTATCT TCTCAAATAT 6000 

GTTAATAAAA TCTTCAAAGT TATTGCAAGA TATTAGCTGA TTTTGCAATT CATCATTCTC 6060 

TGTCAGAGAG ACTATCTTTT TAGTCACAGT TGCCAAACCT TCGTTCCCAT ATATTGATGG 6120 

AGATAGAAGA AATACTAGCT GGACATGTGA ACTTTGATTA TCCCAGAGTA ACGAATCTTT 6180 

ACAAATTGCA ACCGAAACCT TTCCCTCTGT ACCAAAGGGC TGAATAGGAT GCGGAACTGC 6240 

AATTTTTTCA GAAAAAACAA CTGAACTTAA TTCTTCGCGC TGTTTAATTC CATAAAGTAA 6300 

AGATTGTTCA AACTCATTTG ATTCACCAAC AGATAAACTC TCAACCATCT TTTCAAGTAA 6360 

ATTTACCTTG TCTGATTCAG TACATATTAA AAAGTTTTCT TTACTAAAAT ACTGTCTAAA 6420 

GCCGTTGTTT TCAAATTTGT TAATCTTTGA TGATTGTACA TAACTAGAAA CTTGCATCTA 6480 

ATCCATAGCT TTTCTAATCA TTTCCATCTC ATCACTCTTA AGAAACACAC TAACTTTAAA 6540 

AACTGGGATT TGAAAATATA GATTTGATAA ATCAATAGCT GACACTATAA AATCTATTCC 6600 

TTTAAGTTTT TCTTGATTCA ATTCATAGTA GCCTATTACA TCAACAACTT CTACTCGCTT 6660 

CCCAAACTCC GTTTCCAAAC GATTTCTTAA CATTTGGGCT GCACCAAATC CTGTTGCACA 6720 

AATAGCAAGA ATATTAAACT TAGTACTCTC TTTGCTACGT TCCATAGCAG CTAAAAAGTG 6780 

AAGACTTACA TATGCTACTT CATCATCTGA TATTGTCCAC TCCAAGAACT TGTCCATATT 6840 

TGCAAGAATT TCTCTAGTCA TAAAGAATAT ATCACTATAA TTCTGTTTAA TTTCATCTAC 6900 

CAAAGGGTTA TTTAAGGTAA TCCGGCTTTC TAAACGTACT TGTAGTGTCA TTAGATGAGT 6960 

TATCAATCCT TCAATTAGTT GGAAATCTGA AGAAAAGTTA TACATATCAT CTAATCCTAA 7020 

ATTCTGAAAT GTTTTAAATA AAGATTTTTT TAAAACTTCT TCAGAAATAT TCTTCTGATT 7080 

TTTTTGACAT TGTTGACTCT TAGCTAACAA ATGCAAAGTA ATGTAGTCTA TTTCCTGAAC 7140 

TGGAAATTCC TGATTTGTTA CTTCTCTTAC TTTAGAAAGA ATTCTTTGGG CAACCTTTCT 7200 

CTCTATTGCA TCATCAGTCA TCTGACAGTC TATATTTTTT ATTTCAAATC CGGATTTTAA 7260 

ACGAATCACA GACAATGCTA TGTGAACTAC TAAATTCTGT AGTACAAAAT CAGATAGTTT 7320 

TAGGTTGGCC TCTTGGCATT CATCCAAAAC AATTCTAGCA AATTCTTCTA ATGGAACAGT 7380 

TTGATCAAAA AAGTTAAATT TTACATAGCA ATGTATTGTT TTAAAAAATT GATTCTCTAG 7440 

GAAATAATTT ATGATAAAAC GTCGTTTATC ACGTTCCTCG CCTGAGACAT AAACTCCTTT 7500 

ATTCGCCCTA CTCTCAATGG ACAAATTATA CTCTGATAAC ATCACTCGTA TCTTTCTGAA 7560 
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ATCATGAGAT 


AATGTTGAAC 


GACTAACGTA 


AAGTTCATCA 


GCTAAATCAT 


CAAAAAGAAC 


7620 


TGGAACTTGC 


TCAAATAATA 


ATTTATTTAA 


GATAAATACT 


AAACGATCAT 


CACCTTTTGA 


7680 


AACCGCAGTT 


TTCGTATAGT 


CTTCTTCCAG 


TTCATAAGTT 


TGTCTAAACT 


CCTGGTAAGC 


7740 


GCCTTGATTC 


TCAAAAAATA 


TTTGATACCC 


TTGACCTTGT 


TTTGAAATCA 


ACCGGACTCC 


7800 


TTGAATAATC 


ATTGTCTTCT 


CAATTAATTT 


CAGTACATTA 


CGGACAGTTC 


TATCTGAACA 


7860 


GGATAAATAT 


TCTGCCAGTT 


CTTTGCTTGT 


AACAAAACGT 


TCCTTATTTT 


TTATTAAAAA 


7920 


TTGAAGGATA 


TCTTTCTCTT 


TAATGTTTAA 


CACATTCATT 


CCCTCCTAAA 


ACGTATGTTT 


7980 


TCATATATTG 


AAGCATATTA 


TACACTTAAA 


TCAGTTTATA 


TCAAACTCAA 


AACAATTTAT 


8040 


CTTAACCTAA 


ATATTTATTG 


ACATTTCATG 


TGTTCATCAA 


ATATTCTCAA 


GAATCAAATT 


8100 


AGCCATTTTT 


TCAATTCCCA 


TTGGAATAGG 


AATATAGGCT 


TGAGGAGGTA 


TTTGTACAAC 


8160 


TGGTTTTCCT 


GCTTTAGAAC 


CAGCCTCTTC 


AAATTGCTTA 


AAGTACATTT 


TTGTTTGAGG 


8220 


ACTGACAAGA 


TACAAATCAA 


AAGCTGCTGC 


TGCGATAGCT 


TTCCCTCCTT 


CAGTAGCACT 


8280 


AATAGCATCA 


ACTACAATAT 


CTTTCCCTTT 


TCCTTTTAGA 


AACTCTGTTG 


TTTTCTGTGC 


8340 


CATAAGTGAT 


GAAGACATTC 


CTGCTGCACA 


AATAATTAAA 


GCTTTTGCCA 


TAATATTTTC 


8400 


TCCTTTTCTT 


AAATCCAATC 


AAAGCTGTGC 


TAAGTTGGCT 


TATTTGTTAT 


CTATTTTTAT 


8460 


TATAAAATAA AGCGTTTCCA 


ATGACAATTC 


CCTCATTTTC 


CTAAATGATA 


TGGAAAAAAA 


8520 


TTATTTATAC 


TTCAATTTAT 


AAAATAAAAT 


TATTCCTGAG 


AGTAGAAATG 


AAACACTATT 


8580 


TGCTAAAATC 


AAAGGCAAGT 


CTCCTATACG 


AATACCATGA 


GCAAGCCACA ATGCAATACC 


8640 


AATAACTTGC 


ATAACATACA 


TACCTAGAGC 


AATAGATCCT 


GTGTCCTTTG 


TCTTAACTAC 


8700 


ACGAAAAACT 


TGTGGTAAAA 


ATGCAAATGT 


TGTTAAAATT 


GCTGCAATAC 


TTCCAATCAT 


8760 


ATGTCACCTC 


AATATGCTAA 


ACAAACTGAG 


AATAATCTCA 


GTTTGTTTAT 


ACTATTCTAC 


8820 


TGATTCACCG 


TTAGATGAAA 






AAAGATTTTT 


TCGGGGAACG 


QQQft 


ATTATAACTT 


CCCTTCCCAT 


TATCATCTTT 


ATCTACATAA 


ATAAAGCCAT 


AACGTTTCCG 


6940 


CATTTCACCG 


GTACCAGCTG 


AAACCAAATC 


AATACATCCC 


CATGGAGTAT 


AACCCATTAA 


9000 


ATCAACACCA 


TCTTCAACTA 


CAGCCTTTTT 


CATTTCACGA 


ATATGGGCAC 


CTAGATATTC 


9060 


AATTCTATAA 


TCATCATGTA 


CCATACCATC 


TGCTGCAACT 


TGATCTATAG 


CTCCAAAACC 


9120 


ATTTTCAACA 


ATAAAGAGTG 


GTAAGTGATA 


GTGGTCTGTA 


AACCAATTTA ACGCATAACG 


9180 


CAAACCTTCT 


GGATCAATTT 


GCCACTCCCA 


TTCAGAAGCC 


TTAACATAAT 


TATTTTTCAC 


9240 


TAAATCTTCT 


GTTTCAAGAT 


AATCAAAATA 


AGGATTATTT 


TCACGATGAG 


AGTCGATAGC 


9300 


AAAGGACATA 


TAGTAACTGA 


AACCAATGTA 


ATCTACAGTC 


CCACCAAGTA AATCTTCTTT 


9360 
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ATCCTGGGCA GTAAAATCAA CTGAAATACC TTTTCGTTCC CAATACTTGA AAATATGCTC 9420 

AGGATATTTA CCTAAAACAT GCACATCAGC AAAATAATAA CGCTTCTGCA TAGCTTTCAT 9480 

TGCCATTAAG ATATCCTTAG GATTGCAAGT AACTGGATAA ATTGGACACA TCGCAATCAT 9540 

ACAACCTATT TGAAAATCTG GATTAATCTC ATGACCAATT TTTACAGCTC GTGCAGAAGC 9600 

AACTAATTCG TAATGTGCTG CTTGATACAT AATTGCTTCT CTATTATCAC CTTCCTCATA 9660 

TACAATACCT GAGTTAGTAA ATGGTGCAAA ATCTTCCTGA TAATTCGCTT GATTATTGAT 9720 

TTCATTGAAA GTCATCCAAT ATTTAACCTT ATCTTTGTAA CGTTTAAATA CGACTTCTGC 9780 

AAAACGAGCA AAGAAATCAA TCAATTTCCT ATTTTTCCAA CCACCATATT CGGTCACTAA 9840 

GTGATAAGGC ATTTCAAAAT GAGATAGAGT GATGACAGGT TCAATACCAT TCTTTAAGCA 9900 

TTCATCAAAA AGATTATCAT AAAACTGTAA TCCTTCTTCA TTCGGCTCTA ACTCATCACC 9960 

f 

TTTTGGAAAG ATACGTGTCC ATGCAATAGA GGTACGGAAG CACTTGAATC CCATTTCAGC 10020 

AAAAAGTGCT ATATCTTCTT TATAACGGTG ATAAAAATCT ATCGCCTCAT GATTTGGATA 10080 

ATATTTACCC TCTAAAACTC CCAAAGTAAT TTCACGAGCT ACTCCATGAC GACCAGCAGT 10140 

CATAACATCA GCAACACTAA TTCCCTTGCC ACCTTCTTGC CATCCACCTT CAAGTTGATG 10200 

AGCAGCAACA GCACCACCCC ATAAAAATCC ATCTTTAAAA GTAGTCATCT TTTTTCCTCC 10260 

TGACTTTGAT ACTCTTATTA TAAACCTTAA ACCAAAAGAT GAAAACGCAT TCTTTTTCCT 10320 

TATTGTTAAG GAAAGAAGTA ATTTTTAATG GAAATAGAAC AATATCTTCT TGTATTCTCG 103 80 

TAATGATATC TTTACGATTT TCAATACTTT CAAACTACAA AAACTCTCAC AATAATTCTA 10440 

ATTCCCTGTG TCTATAAACG ACTTATCGCT TTCTGGd'ATC CCAGAATCAT CTTCTATATA 10500 

ACGTTCAACT TGGATCTGCA AGTGATATTT TTTTCTTAAA TCTAAGATTT TCTGCATTGT 10560 

CTTTGATTGA TAATGTTTAT CTAAAGTTTC TTGATTTATC CACTGATCAA TAAGGAGAAT 10620 

AGTTCCCTCT TTTTCAATTG GTAAAAAATA TTCGTATTTC AAGTTACCTT TTTGATTTCT 10680 

AATTTCTTTA ACAAGGCCAC TATCAAGCAT TTCTCTTGCA AACTTTATTG CACTATCTCC 10740 

ATCACCTTTA TAATATACAT GAATAGTCAA TGTCATCTTA TATCCTCCAA AATCATCCTT 10800 
CAATTTTAAA AAAACAAGTT TAGATGAGGA TCTAAACTTG TTTTTTATGA ACTAATTATC : 10860 

TAACGTTTCG CCATTACTTT CAATCACTTC TTTATACCAA TAAAATGATT TTTTCTTATA 10920 

GCGATTTATA GTCAATTGAA ACAAGAGCAG GACAAAAGAG CCTCATAAAA GGTATTGCAA 10980 

CTTGGTAATA CCTTTTTGAG GTGCTTTTTG ATATGAGCCC ATGTTTTCTC AATAGGATTG 11040 

TACTCAGGTG AGTAGGGAGG AAGAGGTAAA AGTTTATACC CAAACTCTTC ACACAAGAGT 11100 
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TCTAGCTTCC CCATTCTATG GAATCTTGCA TTATCCATAA TAATAACCGA TGGTGTGGTT 11160 

AATGTTGGTA AGAGAAACTT CTGAAACCAA GCTTCAAAAA AGTCGCTCGT CATCGTCTCT 11220 

TCGTAAGTCA TTGGAGCGAT TAACTCACCA TTTGTTAGAC CTGCAACCAA AGAAATCCTC 11280 

TGATATCTTC TTCCAGATAC TTT 11303 



(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; double 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

CCTTAGATTT CCACTTGCCA GAGGAATTGA TTGCCCAAAC GCCCCTTGAA AAACGTGATG 60 

CCTCCAAACT CCTCATCGTC AACCGTGAGA CAGGAGAAAT GCAAGATAAA CATTTCCACT 120 

CTATTATTGA TATGCTGGAA CCTGGTGATG CCCTTGTCAT GAACGACACC CGAGTTCTCC 180 

CTGCCCGCCT CTATGGTCAA AAAGTGGAGA CAGGAGGTCA TGTGGAACTT CTCCTCCTTA 240 

AGAACACTAG TGGAGACGAG TGGGAAGTTC TGGCTAAACC TGCCAAACGC CTCAAGGTCG 300 

GTACTCGTAT CAGCTTTGGT GATGGCCGCC TCAGCGCTGT CGTTACAGAA GAATTGACCC 360 

ACGGGGGACG CATTGTCCGC TTTGAATACC AAGGAATTTT CCTAGAAGTC TTGGAAAGTC 420 

TGGGAGAAAT GCCTCTGCCA CCTTATATCC ACGAAAAATT AGATGACCGT GAACGTTATC 480 

AAACCGTCTA CGCCAAGGAA AGTGGCTCTG CTGCAGCACC GACTGCTGGT CTTCACTTCA 540 

CCAAAGAACT GCTGGCAGAA ATCCAAGCTA AGGGTGTTCA TCTAGTCTAT CTGACTCTCC 600 

ATGTCGGACT CGGAACCTTT AGACCTGTTT CTGTGGATAA TCTGGACGAA CACGAAATGC 660 

ACTCAGAGTT CTATCAACTT TCTGAGGAAG CTGCTGCCAC CCTTCGCTCT GTCAAAAAAA 720 

ATGGTGGTCG TGTCATCGCT GTCGGAACCA CTTCTATCCG CACCTTGGAA ACTATTGGTT 780 

CCAAGTTTGA TGGGCAAATC CAAGCAGATT CTGGTTGGAC CAATATCTTT ATCAAACCTG 840 

GGTATGAGTG GAAGGTCGTG GATGCCTTCT CAACCAACTT CCACCTGCCA AAATCAACTC 900 

TGGTCATGTT GGTTTCTGCC TTTGCAGGCC GTGAATTAGT CTTAGATGCC TACCACCATT 960 

CCATCCAAGA ACACTACCGC TTCTTCAGTT TTGGTGACGC CATGTTTATT TATTGAGAAA 1020 

GAATTTCTCT AAATCTTCTA ATACCAATAA ATCGCTAAGA TATTATTTCA AAGAACATCT 1080 

ACAATTGAAA CTCTAGCTAG CTGTAGAAGA GGCCTAGTAC ATTGAAATTA AAATGCTTCC 1140 

CCCTAGCTTC GAAAATATTG CCATAGATTG CGTTGACTCT CCAAATTGAT TCATCTATAT 1200 
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TTTATTTCAG 


CTTCCTATAC 


TTTCTTCGCT 


GTTTGTAAAT 


CAAAATGCAA 


GACACATGAG 


1260 


TAGCACCATA TTTGTTACTC 


TTATCTGTCC 


TCTCAAGAGA 


CTATTATGAG 


TTATTTCAGA 


1320 


ATCATTCACT 


ACTTTGACCC 


TGACTCTCCT 


TAGTCTCAAA 


ATCAAAGACT 


TATACTCTTC 


1380 


AAAAATCTCT 


TCAAACPGCG 


TCAACGTCAC 


CTTGGATTAT 


ATATGTGatC 


TGaCTTCGTC 


1440 


AGTTCP AT CYF 


ACAACCPPAA 


AGCAGTACTT 


TGAGCAACCT 


GCGACTAGTT 


TTCTAGTTTG 


1500 






ATTAAACAAA 


AAGTGAACAA 


ATCTGAATTC 


TAATGTACAG 


1560 


AAGAPTAfiRT' 


TTnTT^iPTT 


TTTTATAGTC 


GCTATAAGAT 


GACCTTATCT 


ATAGCTTTTT 


1620 


UTIT & T & ITT 
Aintninnl 1 


ITITITTPIP 


ACATACTATT 


ATCAATTTTG 


TCGCAGGGAG 


GAATCTGTTA 


1680 




1 CACCAi i AT 


CATTGACTCT 


ATAGCCATCT 


ATACTTGTAT 


TGACCGCTAA 


1740 




GTATTTACAT 


AATACCATTT 


ACCACCAACT 


TGGAACCATT 


GATTGACTTT 


1800 




TTGCTGTTGA 


GGTAGTACCA 


TGAACTATTA 


ACTTGTACCC 


AACCTGTTGC 


1860 




TCAGTAl 1AT 


AAAAATACCA 


CATACCATTT 


TCTTGTTTCC 


AGTCTGTTGT 


1920 


TGGAGCAACT 


GCTTTAGCTG 


GTTCTACTGC 


TACATCTGTT 


CCTTGGTTAG 


ATGTAACAGA 


1980 


TACAGGATAC 


GAAGGAATAG 


ATGATTGCTC 


AGGAACAACA 


ACTTTTTCAG 


GTTCTCTCGT 


2040 


CCCTCTCCTT 


ATACGTCTTT 


TTACCATCTC 


TTTAGTAATT 


TGACGAGAAG 


TAGTTTCTTC 


2100 


AATTGTTCCA 


TCACGTTCAT 


CTACAGTATA 


GATTGTAGTA 


AGAGTAATTT 


ACCAATTTCT 


2160 


CCTACTTCTT 


CTACTTCTTG 


ACTTTTATCA 


AGAGTTGGGC 


CATCGAGATA 


TTCTGTTTCG 


2220 


ATTGGAATTT 


CTTGGACAAG 


AACTTGGGGC 


TTGGTTCTTT 


TTTTAACAAC 


TCTTGTTTGA 


2280 


GAGTCTTTTT 


TTTGACTTAA 


AGTACTCTCA 


GTTACTTGTC 


CACTCTTTCC 


ATCTACATTA 


2340 


TAAGTTATCG 


TTGTAACTGT 


TTTCCCATTC 


TTTCCTAGAG 


TAATCTCTTG 


CTCCTGTCCT 


2400 


GCAGAAAGGT 


CATTGTCTGC 


TTCATATTTA 


GTAGCAAATG 


GAACAAGAAC 


TTCTTCAACC 


2460 


TTGCTTTTAG 


CTGGAACTTT 


GATAACTGTA 


TCCGTGGCTT 


CTTTTCTATC 


AACAGTAACC 


2520 


TGTTCGGTAA CATAACCAGT 


CTCTGGATTA ACATCGTAGG 


TCCTTGTCGT 


AGTTACATAG 


2580 


CCATCCTCTC 


CATCAATTGT 


AACAGGATTT 


TCACTACGGT 


CTTTTGTTTC 


ATCTTTTTCA 


2640 


TAACGAATTC 


GCGTACTTGA 


AATTTTCTTG 


GTTACTACCT 


TAGGTTTAGT 


CGCTACTTTT 


2700 


ACAATAATAT 


CCCCATTGTC 


AGCGTCATCA 


TACTCTATTC 


CCTCTTCTTT 


ATCTCTAGTA 


2760 


TCATCTCTGA 


CATATTGAAT 


CCCATCAGCA 


GCATGAACAA 


AACTTGTATT 


CAGATTCCTC 


2820 


CTAAAAATAA AGTTAGCCCG 


ATTACCGCAG AACCAAAAAT 


CTTTCCGAGT 


TTACGTATTG 


2880 


CATAGCGCTT ATTAGTATTA 


GATTTTGCCA TTACATCCTA 


CTTCTAGTAT 


AGCATCTTTT 


2940 
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CTATCAAACG TTAAACAATA TACGTTATAT ATAAAATAGA CTTAGAATGA TATATTGATT 3000 

ATTGAACTAA CACTTTAACT ATATCGTAAT CAATCTCATA TATAAAGGAT TGCAGACATC 3060 

TTATCTAAAT ACATGCGAAT ATATTTAGAT ACAAACATTC CAACTTGATA AT 3112 



(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

CCCAAAAATC TCTTCAAACC ACGTCAGCTT CGCCTTGCCG TAGTATGGTT ACTGACTTCG 60 

TCAGTTCTAT CCACAACCTC AAAACAGTGT TTTGAGCATC ATGCgGCTAG CTTCTTAGTT 120 

TGCTCTTTGA TTTTCATTGA GTATAAAAAC AGATGAGTTT CTGTTTTCTT TTTATGGACT 180 

ATAAATGTTC AGCTGAAACT ACTTTCAAGG ACATTATTAT ATAAAAGAAT TTTTTGAAAC 240 

TAAAATCTAC TATATTACAC TATATTGAAA GCGTTTTAAA AATGAGGTAT AATAAATTTA 300 

CTAACGCTTA TAAAAAGTGA TAGAATCTAT TTTTATGTAT ATTTAAAGAT AGATTGCTGT 360 

AAAAATAGTA GTAGCTATGC GAAATAACAG ATAGAGAGAA GGGATTGAAG CTTAGAAAAG 420 

GGGAATAATA TGATATTTAA GGCATTCAAG ACAAAAAAGC AGAGAAAAAG ACAAGTTGAA 480 

CTACTTTTGA CAGTTTTTTT CGACAGTTTT CTGATTGATT TATTTCTTCA CTTATTTGGG 540 

ATTGTCCCCT TTAAGCTGGA TAAGATTCTG ATTGTGAGCT TGATTATATT TCCCATTATT 600 

TCTACAAGTA TTTATGCTTA TGAAAAGCTA TTTGAAAAAG TGTTCGATAA GGATTGAGCA 660 

GGAAGTATGG TGTAAATAGC ATAGGCTGAT GTC CATC ATT TGCTTATAAA GAGATATTTT 720 

AGTTTAATTG CAGCGGTGTC CTGGTAGATA AACTAGATTG GCAGGAGTCT GATTGGAGAA 780 

AGGAGAGGGG AAAATTGGCA CCAATTTGAG ATAGTTTGTT TAGTTCATTT TTGTCATTTA 840 

AATGAACTGT AGTAAAAGAA AGTTAATAAA AGACAAACTA AGTGCATTTT CTGGAGTAAA 900 

TGTCTTATTT CAGAAATCGG GATATAGATA TAGAGAGGAT CAGTATGAAT CGGAGTGTTC 960 

AAGAACGTAA GTGTCGTTAT AGCATTAGGA AACTATCGGT AGGAGCGGTT TCTATGATTG 1020 

TAGGAGCAGT GGTATTTGGA ACGTCTCCTG TTTTAGCTCA AGAAGGGGCA AGTGAGCAAC 1080 

CTCTGGCAAA TGAAACTCAA CTTTCGGGGG AGAGCTCAAC CCTAACTGAT ACAGAAAAGA 1140 

GCCAGCCTTC TTCAGAGACT GAACTTTCTG GCAATAAGCA AGAACAAGAA AGGAAAGATA 1200 

AGCAAGAAGA AAAAATTCCA AGAGATTACT ATGCACGAGA TTTGGAAAAT GTCGAAACAG 1260 
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TGATAGAAAA AGAAGATGTT GAAACCAATG CTTCAAATGG TCAGAGAGTT GATTTATCAA 1320 

GTGAACTAGA TAAACTAAAG AAACTTGAAA ACGCAACAGT TCACATGGAG TTTAAGCCAG 1380 

ATGCCAAGGC CCCAGCATTC TATAATCTCT TTTCTGTGTC AAGTGCTACT AAAAAAGATG 1440 

AGTACTTCAC TATGGCAGTT TACAATAATA CTGCTACTCT AGAGGGGCGT GGTTCGGATG 1500 

GGAAACAGTT TTACAATAAT TACAACGATG CACCCTTAAA AGTTAAACCA GGTCAGTGGA 1560 

ATTCTGTGAC TTTCACAGTT GAAAAACCGA CAGCAGAACT ACCTAAAGGC CGAGTGCGCC 1620 

TCTACGTAAA CGGGGTATTA TCTCGAACAA GTCTGAGATC TGGCAATTTC ATTAAAGATA 1680 

TGCCAGATGT AACGCATGTG CAAATCGGAG CAACCAAGCG TGCCAACAAT ACGGTTTGGG 1740 

GGTCAAATCT ACAGATTCGG AATCTCACTG TGTATAATCG TGCTTTAACA CCAGAAGAGG 1800 

TACAAAAACG TAGTCAACTT TTTAAACGCT CAGATTTAGA AAAAAAACTA CCTGAAGGAG 1860 

CGGCTTTAAC AGAGAAAACG GACATATTCG AAAGCGGGCG TAACGGTAAC CCAAATAAAG 1920 

ATGGAATCAA GAGTTATCGT ATTCCAGCAC TTCTCAAGAC AGATAAAGGA ACTTTGATCG 1980 

CAGGTGCAGA TGAACGCCGT CTCCATTCGA GTGACTGGGG TGATATCGGT ATGGTCATCA 2040 

GACGTAGTGA AGATAATGGT AAAACTTGGG GTGACCGAGT AACCATTACC AACTTACGTG 2100 

ACAATCCAAA AGCTTCTGAC CCATCGATCG GTTCACCAGT GAATATCGAT ATGGTGTTGG 2160 

TTCAAGATCC TGAAACCAAA CGAATCTTTT CTATCTATGA CATGTTCCCA GAAGGGAAGG 2220 

GAATCTTTGG AATGTCTTCA CAAAAAGAAG AAGCCTACAA AAAAATCGAT GGAAAAACCT 2280 

ATCAAATCCT CTACCGTGAA GGAGAAAAGG GAGCTTATAC CATTCGAGAA AATGGTACTG 2340 

TCTATACACC AGATGGTAAG GCGACAGACT ATCGCGTTGT TGTAGATCCT GTTAAACCAG 2400 

CCTATAGCGA CAAGGGTGAT CTATACAAGG GTGACCAATT ACTAGGAAAT ATCTACTTCA 2460 

CAACAAACAA AACTTCTCCA TTTAGAATTG CCAAGGATAG CTATCTATGG ATGTCCTACA 2520 

GTGATGACGA CGGGAAGACA TGGTCAGCTC CTCAAGATAT TACTCCGATG GTCAAAGCCG 2580 

ATTGGATGAA ATTCTTGGGT GTAGGTCCTG GAACAGGAAT TGTACTTCGG AATGGGCCTC 2640 

ACAAGGGACG GATTTTGATA CCGGTTTATA CGACTAATAA TGTATCTCAC TTAGATGGCT 2700 

CGCAATCTTC TCGTGTCATC TATTCAGATG ATCATGGAAA AACTTGGCAT GCTGGAGAAG 2760 

CGGTCAACGA TAACCGTCAG GTAGACGGTC AAAAGATCCA CTCTTCTACG ATGAACAATA 2 820 

GACGTGCGCA AAATACAGAA TCAACGGTGG TACAACTAAA CAATGGAGAT GTTAAACTCT 2880 

TTATGCGTGG TTTGACTGGA GATCTTCAGG TTGCTACAAG TAAAGACGGA GGAGTGACTT 2940 

GGGAGAAGGA TATCAAACGT TATCCACAGG TTAAAGATGT CTATGTTCAA ATGTCTGCTA 3000 
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TCCATACGAT GCACGAAGGA AAAGAATACA TCATCCTCAG TAATGCAGGT GGACCGAAAC 3060 

GTGAAAATGG GATGGTCCAC TTGGCACGTG TCGAAGAAAA TGGTGAGTTG ACTTGGCTCA 3120 

AACACAATCC AATTCAAAAA GGAGAGTTTG CCTATAATTC GCTCCAAGAA TTAGGAAATG 3180 

GGGAGTATGG CATCTTGTAT GAACATACTG AAAAAGGACA AAATGCCTAT ACCCTATCAT 3240 

TTAGAAAATT TAATTGGGAA TTTTTGAGCA AAAATCTGAT TTCTCCTACC GAAGCGAACT 3300 

AGAGAGATGG GCAAAGGAGA GATGGGCAAA GGAGTTATTG GCTTGGAGTT CGACTCAGAA 33 60 

GTATTGGTCA ACAAGGCTCC AACCCTTCAA TTGGCAAATG GTAAAACAGC GACTTTCCTA 3420 

ACCCAGTATG ATAGCAAGAC CTTGTTGTTT GCAGTAGATA AGGAAGATAT CGGACAGGAA 3480 

ATTATTGGTA TAGCTAAAGG AAGCATCGAA AGTATGCATA ATCTTCCTGT AAATCTAGCA 3540 

GGTGCCAGAG TTCCTGGCGG AGTAAATGGT AGCAAAGCAG CGGTGCATGA AGTTCCAGAA 3600 

TTTACAGGGG GAGTTAATGG TACAGAGCCA GCTGTTCATG AAATCGCAGA GTATAAGGGA 3660 

TCTGATTCGC TTGTAACTCT TACTACAAAA AAAGATTATA CTTACAAAGC TCCTCTTGCT 3720 

CAGCAGGCAC TTCCTGAAAC AGGAAACAAG GAGAGTGACC TCCTAGCTTC ACTAGGACTA 3780 

ACAGCTTTCT TCCTTGGTCT GTTTACGCTA GGGAAAAAGA GAGAACAATA AGAGAAGAAT 3840 

TCTAAACATT TGATTTTGTA AAAATGGCTC TTTGTCAACT GTAGTGGGTT GAAGTCAGCT .3900 

AAGCTCGAGA AAGGACAAAT TTTGTCCTTT CTTTTTTGAT ATTCAGAGCG ATAAAAATCC 3960 

GTTTTTTGAA GTTTTCAAAG TTCCGAAAAC CAAAGGCATT GCGCTTGATA AGTTTGATGA 4020 

GATTATTGGT CGCTTCCAAT TTGGCGTTAG AATAGTGTAG TTGAAGGGCG TTGACGATTT 4080 

TCTCTTTGTC CTTTAGAAAG GTTTTAAAGA CAGTCTGAAA AAGAGGATGA ACCTGCTTTA 4140 

GATTGTCCTC AATGAGTCCG AAAAATTTCT CCGGTTCCTT ATTCTGAAAG TGAAACAGCA 4200 

AGAGTTGATA GAGCTGATAG TGATGTTTCA AGTCTTGTGA ATAGCTCAAA AGCTTGTTTA 4260 

AAATCTCTTT ATTGGTTAAA TGCATACGAA AAGTAGGGCG ATAAAAATGT TTATCGCTGA 4320 

GTTTACG 4327 
(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

CTCTGGCCCT GCCACTCCAA CGTTTTGTCA GGGTGCTTTT TTCATAAAGG AGTTCTTATG 60 
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TTAGATATCA 


AACGTATTCG 


TACAGATTTT 


GAAGCTGTCG 


CAGAAAAATT 


AGCTACACGT 


120 


GGTGTAGATG 


CTGCTGTCTT 


GAATGAAATG 


AAAGAAATCG 


ATGCTAAACG 


TCGTAACATC 


180 


TTGGTCAAGG 


TTGAAACTCT 


CAAAGCAGAA 


CGTAACACAG 


TTTCTGCTGA 


GATTGCCCAA 


240 


GCTAAGCGCA 


ACAAGGAAAA 


TACAGATGAC 


AAGATTGCTG 


CCATGCAAAA 


TCTATCTGCT 


300 


GAGGTTAAAG 


CCTTGGATGC 


TGAATTGGCA 


GAAATCGATG 


CTAAATTGAC 


AGAATTTACA 


360 


ACGACTCTTC 


CAAATATCCC 


AGCTGACAGC 


GTTCCTGTTG 


GGGCTGACGA 


AGACGACAAT 


420 


GTGGAAGTTC 


GCCGTTGGGG 


TACTCCACGC 


GAGTTTGACT 


TCGAACCTAA 


AGCTCACTGG 


480 


GATCTCGGTG 


AAGACCTTGG 


TATCCTTGAC 


TGGGAACGCG 


GTGGTAAGGT 


AACAGGCGCT 


540 


CGCTTCCTCT 


TCTATAAAGG 


CCTCGGTGCT 


CGTTTGGAAC 


GTGCTATCTA 


CAACTTTATG 


600 


TTGGATGAAC 


ATGGAAAAGA 


AGGCTATACT 


GAAGTCATCA 


CACCTTACAT 


AGTCAACCAT 


660 


GATTCTATGT 


TTGGTACTGG 


TCAGTATCCA 


AAATTTAAGG 


AAGATACTTT 


TGAACTCAGC 


720 


GATACCAACT 


TTGTCTTGAT 


TCCAACTGCT 


GAAGTTCCTC 


TGACAAACTA 


CTACCGTGAT 


780 


GAAATCTTAG 


ACGGCAAAGA 


TCTTCCAATC TACTTCACTG 


CCATGAGTCC 


GTCATTCCGT 


840 


TCTGAGGCTG 


GTTCTGCCGG 


TCGTGATACG 


CGTGGCTTGA 


TCCGTTTGCA 


CCAATTCCAC 


900 


AAGGTTGAAA 


TGGTCAAATT 


TGCCAAACCA 


GAAGAATCTT 


ACGAAGAATT 


GGAAAAAATG 


960 


ACAGCCAACG 


CTGAAAACAT 


TCTTCAAAAA 


CTCAACCTTC 


CATACCGTGT 


CGTTGCTCTC 


1020 


TCTACTGGAG 


ATATGGGCTT 


CTCAGCTGCG 


AAGACTTACG 


ACTTGGAAGT 


GTGGATTCCA 


1080 


GCACAAAACA 


ATTACCGTGA 


AATCTCAAGC 


TGTTCAAACA 


CAGAAGATTT 


CCAAGCCCGT 


1140 


CGTGCCCAAA 


TCCGTTACCG 


TGATGAAGCA 


GATGGCAAGG 


TGAAACTCCT 


TCATACCTTG 


1200 


AACGGTTCTG 


GACTTGCAGT 


TGGACGTACA 


GTGGCTGCAA 


TTCTTGAAAA 


TTACCAAAAT 


1260 


GAAGATGGTT 


CTGTGACCAT 


CCCAGAAGCA 


CTTCGTCCAT 


ACATGGGTGG 


AGCTGAAGTC 


1320 


ATCAAACCAT 


AAAAAATAAG 


GTTTAGCTAT 


TTCTAGCTAG 


ACCTTTTTTC 


GTAACCAAAT 


1380 


CAGATAAGCA 


CCTAGTACAA 


AGAATAAAAT 


AGTTAGGCAT 


ATAATGGTTT 


CAGCCAATAC 


1440 


CAGGTAATCC 


AGAAATGGAA 


GTTTCAAAAT 


TCCCTGAGCC 


ATCTTGAGCG 


AGGTCGCTGT 


1500 


GATAATGGTT 


GGGAAGGTGA 


GGGCTGAGAA 


GGCTGGTTGA 


AAACCTTGTT 


TTAAAATGTT 


1560 


GGGCAGACGA 


GTTAAAACAA 


AGAAAAAGAA 


GGATTGAGAA 


GCCAAAATCA 


TGACAATCAA 


1620 


GACCCAAGTC 


GGCAGGCTGG 


TTCCTCCTAC 


TCGAACTAGA 


GAAGCCAAGA 


GTAGAGAGAA 


1680 


AGGAGCACAG 


TAGATTCCTT 


CTTGTCCAAG 


CAAGGCTAGT 


GGGAGTGGAT 


GTTTCTTTAA 


1740 


ATCGCTATAA 


ATAAGGGGAT 


AGAGATAGAA 


GGTCAAGAGA 


AAACCAAAAC 


TCAAGGTCGC 


1800 
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ATAGGCAATT TCGATAATAC CTACCAGAGG ATAGGTCAAG GCAGCCACTG CTATCCCCAC 1860 

ATAGAGAACC GTCCAGCTTG GAGTGGCATG AACCCTCCGC CCTGGACAAG CAAACTTGAT 1920 

GGTAAAACCA GCAATCAAGG TCAAATCCAA GAGAAATGAA AACCACCAAA TCCCTTGTGC 19 80 

TACCAAAGGA AGATAAGAGA ATACGCGAAA GACATAGGTC GATAAAATCA TCCCAGCCAT 2040 

AGGAAAGGTT GCCATTCCTG ACAAAAGAGG GGGCTTGGTC AATTCTTGCT TGGTTTCTTT 2100 

CCAATTAAAG AGATGCAGAA TTAGAAAGTA AATCCATAAA ACCAAACCAA TCAGACTAAA 2160 

AAGATGGGAT AGAACCGGCA ACGTATCTAA AATAAGATTT CCAGCTCCTG CCAAACCTAG 2220 

CAAACAACCT GAAAATACTA AGGGGAGTTT TTTCATCCTA ACCTCCAATA ATCATGTTAG 2280 

TTTCAGTATA ACATAAAAGC GCTTAAATGA GGATTTAAAA AAACGAGTCC GCTTATTTCA 2340 

GACTTCATTT TACTCAGATA TGAATTAGGC ATAAGGTTGC AATTCTGGAT TAATTGGTGT 2400 

ATTAGCTAAG TTGTTGGCAT AGTTACAGAG GATTGCTAGG CTGACACCAA AAACCACATC 24 60 

CAAGGCATTT TGTTGAGTGT AGCCAGCTTC TAAAAACTCA GACAAGGCTT CATCTCCTAC 2520 

ACGACCCTTG GTATTGATAA CTGCCAAGGT AAACTTAGCT AGGGTATCCA ATTTAGGATC 2580 

TGTTTCAATT GGAGTACGAT TGCGAAGAGC TTGAATCAAG TCATCATTCA TCTGGATTTG 2640 

TTTGATGGAA AAGGCTGTGT GACCTGCGAC ACAGAAGGCA CAACCATTGG TCACGGCTGC 2700 

CGTGATTTGC ACCACTTCAC GCTCAACGGG TGTCAGGCTG TTGCGACGGT GGATAGATGA 2760 

GACAATTTGG TAGGCTTCTA AAACAGTCGG GGCATTGGCC AAGAGACCGA TTAGGTTGGG 2820 

AATATAGCCA TTGTTGTCTT TTTCTACTGT TTCAAGAATT TCTTTCACTT CTGCTGGTGC 2880 

TGACTCTACT GTATGGATAG TAAATGTTGT CATAAGATAC CTCTTTTCTT ATTATTGACA 2940 

CTAATATTAT TGGAAAATCT TATAAAATCC TGATTCCTAA GTTTATCTAA GATAAAGCTT 3000 

TATTCTCTCA TAAGATTTTC GTTGTTATAT TAGTTTATCA CACTTCCAAT CACTTGTATA 3060 

ATATATATTA TATATCAGGC TGATAAAAAT TATTTATAGG CAAAAAAATC ACACGAGCTG 3120 

TGTGATTCCA TTATTTGTCA AAATACTTTT TAGTTTCAGC AATAACGACT GGCGACAAGA 3180 

CCAAGAGGGC AATCAAGTTT GGCAGAGCCA TCAAGGCGTT AACGATATCT GCGATAATCC 3240 

AGACCATATC CAACTCGATA AATCCTCCTA ACAAGACCAT GAGCACAAAA ACCACACGGT 3300 

AGAGCCAGAT AAAGCGAACC CCAAAGAGGA ACTCAAAACA GCGTTCTCCG TAATAGTTCC 3360 

AACCTAGAAT CGTTGTAAAG GCAAAAAGTA CAAGGAAGAT GGTCAAGAGA GCAGGCCCAA 3420 

AGTGTGAAAA GTTTGTTGAG AAAGCTGACT GAGTCAAGGC AACCCCATTC AAGTCACCGC 3480 

TCCAAACTCC AGTTACCAAG ATGGTCAAAC CAGTTAGAGT A 3521 
(2) INFORMATION FOR SEQ ID NO: 119: 



WO 98/18931 



PCT/US97/19588 



645 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1968 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

AACCTGGGCA AGCAAGCTAA AAGCAATGGG ACCTGGAATC CTAATGGCAA CTGCCGCTGT 60 

TGGAGGTTCC CACATTGTAT CCTCAACTCA AGCTGGCGGT TCTTACGGTT GGTCTCTACT 120 

TCTCTTGGTC ATCTTAGCCA ATGTCTTTAA ATATCCATTT TTCCGTTTTG GTGCTGAATA 180 

CACAGCTGAT ACTGGAAAGA CTTTGGTTGA AGGTTATGCC GAAAAAGGAA AACTCTATCT 240 

CTGGATTTTC TTTATCCTCA ATGTCTTTTC GGCTATGGTC AACACGGCTG GTGTTGCCAT -300 

TCTGTGCTCA GCTATCATCG CCAGTGCCTT CCCAATGATT GGACTTAGCA TTACTCAGTG 360 

GTCCCTCATT CTCGTTGCAA TCATTTGGGC TATGCTACTC TTTGGAGGCT ACAAACTTTT 420 

AGACGGCATG GTCAAATGGA TTATGTCTGC CTTAACCATT GCGACTGTTC TTGCAGTTAT 480 

CATTGCGGCG GTCAAGCATC CAGAATACAG TTCTGATTTT GTCGAGAAGA CACCTTGGCA 540 

AATGGCAGCT CTGCCCTTCA TCGTCTCCCT CCTAGGATGG ATGCCGGCTC CTATTGAAAT 600 

TTCAGCCATC AATTCACTTT GGTCAGCTGA AAAGAGAAAG ACCGTCAACT TTAACACAGA 660 

AGACGCTCTG TTTGACTTTA ACACTGGTTA TATTGGAACA GCTATCCTAG CCGTCTTCTT 720 

TGTGGCACTG GGAGCACTGA TTCAGTATCC TACAGGGCAG GCGGTTGAAG CTGCTTCAGC 780 

CAAATACATC TCTCAATTCG TGGGCATGTA TGCCTCTGTT CTTGGCGAAT GGTCCCGTTA 840 

CTTGATTACC TTTATTGCCT TCCTCTGTAT CTTTGGAACA GTTATAACTG TTATCGATGG 900 

CTATTCTCGC GTTAATCAGG AATCTCTCCG ACTGCTAATC AGTCAAAAAG AGGACAATCG 960 

TAAATCTTTG AACATCTGGA TGACCATCAC TGCTATCATC GGTATCGTCA TTATCAAGTT 1020 

CTTCGCTGGT CAGGTTTCAA CCATGCTCCG CTTTGCCATG ATTGGCTCTT TCCTGACAAC 1080 

ACCTTTCTTT GCTCTTTTGA ATTACGCCTT GGTAACGCGT GAAAACAAAA ATCTTCCTTC 1140 

TTGGCTCAAA CACCTTGCCA TTGCGGGATT GATTTTCCTC TTTGCTTCGC CATCTTCTTT 1200 

ATCTACGCAC TCGCAATCGG AAAAGCAGGG TAAGGGACAA GCGCGAGATG AAGATAAGGT 1260 

TTCATTTCAA GAGAAAATTC AGCAAATATT TCTATGATAA AAAGCATAAG AACAAGGTTT 1320 

TGAAGACCTG AACTTATGCT TTTTTACGTT CTTAAAGACT GTTTATACTC AAAAAACAGT 1380 

TGAACAACTT CAACCACCTC TTATAAGAAC TTTATACTAT TCGAGAATCT CTTCAAACCA 1440 
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CGTCAGCTCT ATCTGCAACC TCAAAGCTGT GCTTTGAGCA ACCTGCGACT AGCTTCCTAG 1500 

TTTGCTCTTT GATTTTCATT GAGTATTAAT TCTCCTTTTC CAACTCATAC AAATCTGCGA 1560 

TAATAGCTGC GACATGTTTG ATATCTTCCA GCATGCCTCG CATTTCAAAG TCAGCCAATA 1620 

CAGGGAAGCC AAAGCGTTGA CTGTATTGCT TGGCTGTTAG GCAGTATTGG TTATTAAAGT 1680 

TACGATTTCC TGACCCAACC ACACCAAAAC ACTTACTAGC ATTGTTACCA TAGGCAATAA 1740 

AATCTCCCAC CGGTGTCGTC AAAATCTCAA CATCTCCGTT ATCCACGCCA TTCCCACCTT 1800 

CGAGATAGGT CGGCAAAAAA GCGACATAGG GATGGTCCAT TTCATAGAAA TTTTTGCCTT 1860 

CCTTGACCAA ATCCTTGATA TGAATCTTTT GAACCTCAAT CCCTTTGTAC TGGGACAAGA 1920 

GATAGTCTTT CAAGCGCGTC ACAAAACTTT CAGTGTTGCC ACTCAAGG 1968 



(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

CCGCATTTTT TATCACTAGA CTCGAGACAT CTTTTGAGTG GCTCTTGCTC TCTGGTTTAA 60 

TTTTCTTCCT TGCTCAAGGA CTCCTGCTAT TTCTCTTGGT CGTCCGACTC AAACATCAAT 120 

TCGCTGAGAT TTATCCTCAA ATCAATAAAA AGATTCGCTT CTACTATTTA GGGGTTCTCA 180 

CCATTGATTT TCTATTTTTT GTTCTCTTAG CCTTCATTAG TTCTCAGCGT TTTTCATCTC 240 

TTATGCCAAT CATCACTGCT TGCCATTCTA CTTTTTATTA TATGACAGCT GACTACCTAA 300 

GAGAAAACTA TCCAGACTTT TACGACAAAC ACATCTCTTT ATGGGAGTGT CTCTAAAGAA 360 

AAGGAGGTTT TAGCATGAAA AAAATCATCT TCATCAAAAC CATTCAACTC CTTGTCATTG 420 

ATGGAATCAT GCTGGCATTT TTGACATTTA AAAGGGGGCT TACTTGGGAC TGGATTTTGA 480 

TTTATAGCGG TTGGCTCATT TTCTTTCATC CTGTGCTATT GACCTATCTT TCAAACCAAC 54 0 

TTTGTGACCA CTTTAGTTAA CTCTATTCCC AGATTAGACC GAGATTCTGG CGTTTTGCTT 600 

TACAAATTCT CCTATGGGAT AGCCTGATGA TTCTCTCCTT GGTGTCTTTA AGTGATATTC 660 

CACTTTTCCT TCAGGGAACT CTCCTCATCC TAGGACATCT CATCCCTTCC TATCGCATCT 720 

GCCAAAGCCT GAAAAGAGAC TTCCCCCAAG CATATCAAGA ACCGATTTCT TTTTGGAGTA 780 

TTTTATGATA GATGAGAAAG ACCAAGCCGA CTGGGCTTGG TCTTTCTTAT CTCTTTTTAG 840 

TATCTAGGAT AATGGTAACA GGTCCATTAT TAACCAGCTC AACCTGCATA TCTGCTCCAA 900 
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AGATGCCTGT 


CTGAACGGGC 


ACTTCTTGCG 


CTAATTTTTG 


ATTGAAAGCA 


TCATAGAAGT 


960 


CTGATGCCAT 


ATCAGGTTTA 


GCTGCCCCTG 


TAAAGgCTGG ACGATTGCCT 


CTCTTAGTAT 


1020 


CCGCAAAGAG 


GGTAAACTGA 


GAAATAGAGA 


GGATTTCTCC 


TTCAATATCT 


TTGACAGACA 


1080 


GGTTCATCTT 


GCCTTCTGCG 


TCTGAAAAAA 


TCCGCATATT 


GACCAGTTTT 


CTCACAGCAT 


1140 


AGTCCAAATC 


TTCCTCTTGG 


TCCTCTGGTC 


CAACACCAAC 


CAGCAATAAA 


AGTCCCTGAT 


1200 


TGATTTTTCC 


CTGAATCTGG 


CCTTCTATAC 


TCACTTGGGC 


TTTTTTAACC 


CGTTGGATAA 


1260 


TGATTTTCAT 


AATAGCCTTT 


CTAGTAAGAG 


CTAGGACAAC 


TAGCCGTTGG 


TCCGTTTGAC 


1320 


AGAGTAAACT 


TCTGGCACAC 


TCTTAATTTT 


ATCGACAACC 


GTGGTCAGTG 


TAGAGAGGTT 


1380 


GGCAATACCG 


AAGgACACAT GGATATTAGC 


AAACTTCATA 


TCCTTGGTTG 


GTTGGGCATT 


1440 


GACCGTTGAA 


ATATTCTTGG 


TTGTATTTGA 


AAGAACTTGC 


AGTACATCGT 


TCAACAGTCC 


1500 


TGTACGGTTG 


AGACCGTAGA 


TATCGATATG 


GGCCATATAC 


TCCTTATTTG 


AGCTAGGGTA 


1560 


CTGGTCTTCC 


CATTCCACAT 


CAAGGAGACG 


. TTGCTCGTAG 


TTTTCTTGGG 


CACGCAGGTT 


1620 


CATACAGTCC 


ACACGGTGAA 


TAGCCACACC 


ACGACCCTTG 


GTAATGTAGC 


CAACAATATC 


1680 


GTCACCAGGC 


ACGGGGTTAC 


AACACTTAGC 


AATCCGCACT 


AGGAGACCAG 


AAGCACCTTC 


1740 


AATAACCACT 


CCCCCCTCAT 


GCTTGACCTT 


GAGGGTTTCT 


TTATTTTCAA 


CCTTGACCTC 


1800 


GCCACCTTTG 


ACAAGCTCCT 


CTGCCTCAGC 


TTTGGCCTTG 


GCACGCTCTT 


CCTCACGGCG 


1860 


TTCCTTTTCA 


GTCAGACGGT 


TAAAGACGGT 


AATCGCACCG 


ATTTCCCCAA 


AACCAATGGC 


1920 


CGCAAAGAGG 


GAGTCTTCTG 


TCTTGTAACT 


GGTCTTTTGC 


AGAACTTGAT 


CCATGTGGCG 


1980 


CTTGTCCATA 


AATTTATTTG 


CCACATAGCC 


ATTTTCTTGG 


AACTGAGCCA 


TCAGCATCTC 


2040 


ACGACCCTTG 


TTGACAGACA ATTCCTTATC 


TTGGTTTTTA 


AAGAACTGGC 


GAATCTTATT 


2100 


GCGCGCCTTG 


CTAGTCTTGA 


CCATATTGAG 


CCAGTCACGG 


CTAGGTCCAA AGGAGTTCGG 


2160 


GTTGGCGATA ATTTCAACCT GATCCCCTGT 


CTTTAACTTG 


GTTGTCAGTG 


GAACCATGCG 


2220 


GCCATTGACC 


TTGGCACCAG 


TTGCTTTTTC 


ACCGACCTTG 


GTATGGATTT 


CGTAGGCAAA 


2280 


ATCAATCGGT 


CCTGAATCTT 


TGGGAAGGGA 


ACGGACAGCT 


CCATCTGGGG 


TAAAAACGTA 


2340 


AATCTCCTCA 


GCCAAATAGT 


TTTCCTTAAC 


AGAGTCCACA 


AATTCCTTAG 


CATCATCAGC 


2400 


CTGGTCTTGG 


AGCTCCATCA 


TCTCCTTGAT 


CCAGTTCATT 


CCAATAGCTG 


ATTCCTTGCT 


2460 


GTTAACTTGC 


CCCTTTATAC 


CTTTCTTATA 


AGCCCAGTGA 


GCCGCAACCC 


CGTACTCAGC 


2520 


CACCTCGTGC 


ATTTCCTTGG 


TTCGAATCTG 


GAATTCAATC 


GGCCCTTTTG 


GTCCATAAAC 


2580 


AGTCGTATGG 


ATAGACTGAT 


AACCATTGGC 


CTTGCGGTTG 


GCGATATAGT 


CTTTGAAGCG 


2640 
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ACCTGGCATC GGTTTCCAAA ATTCATGCAC GTAACCAAGC ATGGCATAAA CATCACTTTG 2700 

GGTATCTAAA ATACAACGAA TAGCAATCAG ATCATAGATT TCCTCAAACC GTTTTCTCTT 2760 

GTCCTGCATT TTGCGGAAAA TTGAGTAAAT ATGCTTGGGA CGACCATAAA TCTTCCCTTT 2820 

CAAGTGACGT TCTGTCGTAT ACTCCTCTAA TTTTGTGACT ACCTCATCCA CCAAGGCCTC 2880 

ACGCTCCCTG CGCTTTTCCT TCATCATATG GGTAATCTTG TAAAACTCCG TTGGATTGAG 2940 

ATAACGGAAA GACAAGTCTT CTAATTCCCA TTTGACACTG GAAATCCCCA AACGATGGGC 3000 

AAGCGGGGCA TAGATTTCCA TGGTTTCTTT GGAAATACGC TCCTGCTTGT CTTTTCGAAG 3060 

ATGTTTCAGG GTCCGCATAT TGTGCAAGCG GTCAGACAGT TTGACCAAAA TAACGCGGAT 3120 

GTCCTCAGAC ATGGCCATGA GCATCTTGCG ATGATTTTCC GCTAATTGCT CCTCGATCGA 3180 

TTTGTACTCG ACCTTGCCAA GCTTGGTAAC TCCGTCAACA ATCATCCGCA CATCAGGACC 3240 

AAACTCTCTT TCCAAATCGT CCAAAGTCGC ATCTGTATCT TCCACCACAT CATGCAAGAA 3300 

TCCACAAGCT ACTGTTACAG CATCCAGCTT TAGCTTAGCT AAAATACCTG CCACTTGGAT 3360 

AGGGTGAATG ATATAAGGCT CGCCTGATTT GCGATATTGA CCACTGTGGC ATTCAACAGC 3420 

ATAGACCAAG GCCTTATGGA CAAAATGAAC ATCCTCTTCC GTTAAATATT CTTTGGTTAA 3480 

AGCGACAACT TCTTCGCCTG TTAAATTCAC TTCTTTCGGC ATCTCTACTC TCCAATTCTT 3540 

CCTACCATTT TATCACTTTT TTAAGAATAT GAAAACTAGA TTGGAACAGA ATAAGAAAAA 3600 

AATAATTCAA AATTGCTTGA TAATTCTGAA TTATTGGTCC GTAATATACT ACGAAGTTAG 3660 

ATTTTAAACT TAGGTGATAG AAGGAGAGAT AGAAGAACGG AAACCATATT GTAACCCAAA 3720 

GACTTTCTGA CTTCCCCAAT TCCATTGAAG ATACGAAAGA TAAACGGTGG AACTCGTATC 3780 

ACATACACTG GTACCTTGAC TGGATTTTGG AATTAATACT AAATGAAAAT CAAAGAGCAA 3840 

ACTAGGAAAC TAGCCGCAGG TTACTCAAAG CACCGCTTTG AGGTTGCAGA TAAAGTTGAC 3900 

GCGGTTTGAA GAGATTTTTG AAGAGTATAA AAATCCTCAA GATACTTTCT TCTATCCTTT 3960 

AGTTTATAAG GAGAATACCT ATGAAAAAAA CTGCTATTTC TATCTTTGCT CTCCTAATGT 4020 

TAGGAGTTTG CTGCCTGTTC CTATTCAGCC AGCAAAGCTA TAAAAAACAG TCGTTCAATA 4080 

CTATGCTAAC GACCAGAACC TGCCCAGTAG GATAACTTAT AGTGAATATA GCGACAAATG 4140 

AGAAGCCAAC TACGGTAGCA CTCTAAACAT CACGTCTATC AAACAAGCTA ATGACGGAGT 4200 

TTATGCAACC TATGAAGGGC AATTGACACC TTTCCAATAT TGATAAATTG ATAACCAGCC 4260 

TGTCTTCATC TAGTCATGCT GGTTTTTAAG TTCATTTTAA ATCCTTACCT ATTCTCCCTA 4320 

ACTGTGCTAT ACTTAATTTA TACTCAATGA AAATCAAAGA GCAAACTAGA AAGCTAGCCG 4380 

CAGGCTGTTC AAAGCACTGC TTTGAGGTTG CAGATAAAGT TGACGCGGTT TGAAGAGATT 4440 
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TTCGAAGAGT ATTAGTACAT TCTTTGAGAT TGGAGCTAGT ATGAAAATCC ATAAAACCGT 4500 

GAATCCTGTT GCCTATGAAA ATACCTATTA TCTAGAAGGC GAAAAGCACC TCATCGTCGT 4560 

CGATCCTGGT AGTCATTGGG AAGCCATTCG TCAGACAATC GAGAAGATCA ACAAACCGAT 4620 

CTGTGCTATT CTCTTGACCC ACGCCCATTA TGACCATATC ATGAGTCTGG ACTTGGTTCG 4680 

CGAGACGTTT GGCAATCCTC CTGTCTATAT CGCAGAGAGC GAAGCCAGCT GGCTCTACAC 4740 

TCCTGTCGAT AATCTCTCCG GTCTCCCTCG CCACGATGAT ATGGCAGATG TGGTCACAAA 4800 

ACCTGCAGAA CACACCTTTG TCTTTCACGA AGAATACCAA CTAGAGGAAT TTCGTTTTAA 4860 

'GGTTCTACCG ACCCCAGGGC ACTCTATCGG TGGTGTTTCC CTAGTCTTTC CTGATGCTCA 4920 

TCTAGTCTTG ACGGGAGATG CTCTATTCCG CGAAACTATC GGACGGACCG ACCTTCCGAC 4980 

TGGTAGCATG GAGCAACTCC TTCATAGTAT CCAGACCCAA CTCTTCACCC TACCAAACTA 5040 

CGATGTCTAT CCAGGACATG GTCCAGCTAC TACTATCGCT CACGAAAAGG CCTTCAATCC 5100 

CTTTTTCTAG CAAGATGATG ACAATCGAAA TTTAAGTAAA CTATCCAGCA AATCTTTCTA 5160 

TT AC AAAAGG CATCCTATCA AGGTTTTCAC ACATGATTGG ATGCCTTTTT TCTGATGACT 5220 

AGATTTTTTG CATTACCAAA TAATCACGCG CTCCTCTGGT GAACGCCACA TTCCGTCTCC 5280 

TTCTTTGACA TCATAGGTTG TAAAGAAATC GTCGAAGTTT GGTACTTGCA CATTGACACG 5340 

GAGTTTGGCT GGTGCGTGCA CATCGACGCT AGCCAAAAGT TTCATAAATT CTGGTCGACC 5400 

TTTCATGCGC CAGATGCGAC CGAAGTTGTA GAAGAACTCT TCTGCTGAGA AGTCTGCTTC 5460 

TCTCTT AG CT GCTTCAAGCG CTGCTGCGAT TCCTCCCAAG TCAGCCACGT TTTCTGATAC 5520 

AGTCAATTTA CCGTTAATGG TTGCTCCATA AGAATCCTGT CCATCAAATT GGTCAATGAC 5580 

TTTTTGTGTT TTCTCCTTGA AGGCAGCATA GTCGCTCTCT GTCCACCAAT CCTTGAGGCT 5640 

ACCATTTTCG TCAAAGGAAG CCCCGTTAGT ATCAAAGGCG TGGGAAATTT CATGGGCAAT 5700 

CACTGCCCCA ATACCACCGT AGTTAGCAGA AGATGACTGA TGCAAGTCAT AGAAAGGCGC 5760 

CTGTAAAATG GCCGCTGGAA AGACAATCAG GTTCTTCTGA GGATTGTAGT AGGCATTGAC 5820 

CATATGAGCA GGCATGCCCC ATTCCTTATA ATCTACAGGC TGGTTCCACT TACTCCAACT 5880 

GTGCTTGATT TCCACACGCG CAAAGGCTAG AGCATTCTCA AAAAGACTGG CAGTTTCATT 5940 

CACTACCTTA TCCTTGTAAC GTGCAGGCAA TTCTTCTGGA TAGCCAATAT AAGGTTTGAT 6000 

CACATTGAGC TTCACGATAG CCTGTTTACA GGTTTCTGGA GTGAGCCAGT CATTCTTAAG 6060 

CAGACGCTCC TTATAAACAT CAATCATGGT TGCCACTTTT TTCTCCACAT CCGCCTTGGC 6120 

TTCTGGAGAG AACTTCTCAC GGGCGTACCA AAGACCCAGG GCTTGCTTGA AAGGTTCTTG 6180 
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TGCTAGATGA TAAGCTGCTT TGACCTTATC TTTTGCCTCT GGAACTCCAG AAAGGGCACG 6240 

GCTGTAGGCA CCAGACAAAA CACGGATATC CTCTGTTAAA TAGCTGGTTG AAAGATTGAC 6300 

AACACTCAAA ATCAAGGTTG CTTTAAGGAG AGACCAGGCT TCCTCACTGT AGAATTGCTC 6360 

TGCTGCTTGC CAGAAACGTT CCTCGTCTAC AATAACCTTG TCTGGTAATT GCCCAATAAC 6420 

TGCTTTGAAG AAGTCATCCA AAGGTAGGGC AGGCGCGAAT TTCTTGAAAT CTTCGTAAGA 6480 

ATATGGATGA TAGAGTTTAG CATATTCTGA ACTTTCTTCA TTAGAGAGCA CCACTGCCGC 6540 

AACTCGGCGG TCCAATTCAA GTCTTTTTTC TAGCAAGTCT TCAATTTCTT CATC AG AG AA 6600 

ATCATAAGCC TTGAGGAGAT TTGCGCTGCT TTCTTTCCAA AGAGTCAAGA GCTCTTCGCG 6660 

CTGAGGATGT TCTTCTGCAT AGTAGGTCGT, ATCTGGCAAG ATTGTGCTTG GAGCGCTAGC 6720 

CCATAGAACA TTGATTCTAG CATCCATAAA GTCTGGCGAT ACACCAAAAG GAAGGAAGTT 6780 

TGGTTTTCCT GCAAGCTCAA ACTCTGCTAG TTTAGCTGTA AAATCCGCAA AAGTCTCCAA 6840 

TTCTTGGAAT TCTTTAAGGA GTGGTAAGAC AGGTGTGATA CCGTCAGCTT CTCTCTTGTC 6900 

AAAATCACGA ACTAGGCGGT GGTATTTGAC AAAGTTTTCC AAGATAGCAT CCTCAGGCAC 6960 

TTCTTCACCT GCTAACCACT TGTCTGTTGT CGCCAGCATC AGGTCTTCAA TTTCCTGGTC 7020 

TAAATCAACA AAACCTCCTG TTTGAGACTT ATCTGCTGGG ATTTCAGCTG TCTGTTGCCA 7080 

TTCTCCATTG ATAGCATCAT AAAAATCATC TTGATAACGT GTCATCTTGT TCTCGCTTTC 7140 

ATTTGTATTT GCATTTATCT TAACAAAAAT CG 7172 
(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

CGGGAAGTTA TGCGATCTAG ACTTCGTTCC TGTACAGCTA CTTTCTCAGG TGGTCTTGTT 60 

GTTTGTATGA GTTTGTTTAG AGAGGATCTT TCTATGTCTT TCTTTCTTAT TTTTGTTTTA 120 

TATGCTTTTC TGATTTCTTA TCTAATTTAT GGTTATTTCA GACTAAAAAG GAAATACCGA 180 

GTAGATGAAT AGCAAGGTTC TAGGTCTTCA GATTGATTTT TAGCACTCTT GATAAAAGAG 240 

TGCTAATTTT TTGAGTTTTT GTCTTGACAT TCTCTTCTAA GGGTGTATAA TAGAATCATG 300 

AGTTAGCACT TGGATGCATT GAGTGCTAAT TGATCAGACA GAGAGGAGTG ATGAGATGGT 360 

TACAGAGCGT CAGCAGGATA TTTTAAATCT GATTATTGAC ATCTTTACCA AAACGCACGA 420 
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ACCTGTCGGA TCAAAAGCCT TGCAAGAGTC TATTAACTCT AGCAGTGCAA CCATTCGTAA 480 

TGACATGGCG GAACTAGAAA AACAAGGGTT GCTTGAGAAG GCTCATACTT CAAGTGGTCG 540 

GATGCCAAGT GTTGCTGGTT TTCAGTACTA TGTGAAACAC TCACTGGATT TTGACCGGCT 600 

GGCTGAAAAT GAGGTATATG AGATTGTCAA AGCCTTTGAT CAGGAATTCT TCAAATTGGA 660 

GGATATTCTG CAAGAGGCTG CTAACTTACT AACAGACCTG AGTGGCTGTA CGGTAGTGGC 720 

ACTGGATGTT GAGCCGAGCA GGCAACGTTT GACAGCCTTT GATATCGTTG TTTTGGGGCA 780 

ACATACAGCC TTGGCGGTAT TTACCCTAGA CGAGTCGCGA ACGGTTACTA GTCAGTTTCT 840 

GATTCCAAGG AACTTCTTGC AGGAGGATTT GCTGAAACTG AAGAGCATCA TTCAGGAACG 900 

TTTCCTCGGT CACACCGTTT TAGATATTCA CTACAAGATT CGGACGGAGA TTCCGCAGAT 960 

TATCCAGCGT TACTTTACAA CAACGGATAA TGTCATCGAT CTCTTTGAAC ACATCTTTAA 1020 

GGAAATGTTC AACGAAAACA TTGTGATGGC GGGCAAGGTC CATCTCTTGA ATTTTGCCAA 1080 

TCTAGCAGCC TATCAGTTCT TTGACCAACC GCAAAAGGTG GCCTTGGAGA TTCGTGAGGG 1140 

GTTGCGTGAG GATCAGATGC AAAATGTTCG TGTTGCAGAC GGTCAAGAGT CCTGTTTAGC 1200 

TGACCTAGCG GTAATCAGTA GTAAGTTCCT CATTCCTTAT CGGGGAGTTG GAATTCTAGC 1260 

CATTATCGGT CCAGTTAATC TGGATTACCA ACAGCTAATC AATCAAGTCA ATGTGGTCAA 1320 

CCGTGTTTTG ACCATGAAGT TGACAGATTT TTACCGCTAC CTCAGCAGTA ATCATTACGA 13 BO 

AGTACATTAA GATTGAAATC ATTAAAGGAG GCGAACATGG CCCAAGATAT AAAAAATGAA 1440 

GAAGTAGAAG AAGTTCAAGA AGAGGAAGTT GTGAAAACAG CTGAAGAAAC AACTCCTGAA 1500 

AAGTCTGAGT TGGACTTGGC AAATGAACGT GCAGATGAGT TCGAAAACAA ATATCTTCGC 1560 

GCTCATGCAG AAATGCAAAA TATCCAACGC CGTGCCAATG AAGAACGTCA AAACTTGCAA 1620 

CGTTATCGTA GCCAGGACTT GGCAAAAGCA ATCTTACCAT CTCTTGACAA CCTTGAGCGT 1680 

GCACTTGCAG TTGAAGGTTT GACAGATGAT GTGAAGAAGG GCTTGGGGAT GGTGCAAGAA 1740 

AGCTTGATTC ACGCTTTGAA AGAAGAAGGA ATTGAAGAAA TCGCAGCAGA TGGCGAATTT 1800 

GACCATAACT ACCATATGGC CATCCAAACT CTCCCAGCAG ACGATGAACA CCCAGTAGAT 1860 

ACCATCGCTC AAGTCTTTCA AAAAGGCTAC AAACTCCATG ACCGCATCCT ACGCCCAGCA 1920 

ATGGTAGTGG TGTATAACTA AGATATAAAG CCCGTAAAAA GCTCGCAGTA AAAATAGGAG 1980 

ATTGACGAAG TGTTCGATGA ACACAAGAAA ATCTATCTTT TTTACTCAGA GCTTAGGGCG 2040 

TGTTCGATTC GGCAATTCTG ACGGTAGCTA AAGCAACTCG TCAGAAAACG GCAATCGCTA 2100 

TGGCGTTTGC CTAGCTTCCT TACTAACTCG TCGTCGAAAT AAAATCGATT TCGACTCCTC 2160 
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GTGTCGCAAT TTACATAATA GAAAACTTGT CCGAAACGAC AATAAACTAT GAAGAAAGAT 2220 

AAAATATGTT TGGCTTTGTA ATAGTGAGCG AAGCGAACCA AACACGATAC TCTTCGCCGT ,2280 

GGCGCTATTT GCGCAAATTT TGAGACCTTA GGCTCAAAGT TTAGTCAAAG AGATTGACGA 2340 

AGTCAAGCTC TGACGGCGTC GCCACTGTCG CCACTTAAGA AGAGTATCAA AAAGAAAAAT 2400 

AGAAAATTAA CTAACAAGGA GAAAAACACA TGTCTAAAAT TATCGGTATT GACTTAGGTA 2460 

CAACAAACTC AGCAGTTGCA GTTCTTGAAG GAACTGAAAG CAAAATCATC GCAAACCCAG 2520 

AAGGAAACCG CACAACTCCA TCTGTAGTCT CATTCAAAAA CGGAGAAATC ATCGTTGGTG 2580 

ATGCTGCAAA ACGTCAAGCA GTTACAAACC CAGATACAGT TATCTCTATC AAATCTAAGA 2640 

TGGGAACTTC TGAAAAAGTT TCTGCAAATG GAAAAGAATA CACTCCACAA GAAATCTCAG 2700 

CTATGATCCT TCAATACTTG AAAGGCTACG CTGAAGACTA CCTTGGTGAG AAAGTAACCA 2760 

AAGCTGTTAT CACAGTTCCG GCTTACTTCA ACGACGCTCA ACGTCAAGCA ACAAAAGACG 2820 

CTGGTAAAAT TGCTGGTCTT GAAGTAGAAC GTATTGTTAA CGAACCAACT GCAGCAGCTC 2880 

TTGCTTATGG TTTGGACAAG ACTGACAAAG AAGAAAAAAT CTTGGTATTT GACCTTGGTG 2 940 

GTGGTACATT CGACGTCTCT ATCCTTGAAT TGGGTGACGG TGTCTTCGAC GTATTGTCAA 3000 

CTGCAGGGGA CAACAAACTT GGTGGTGACG ACTTTGACCA AAAAATCATT GACCACTTGG 3060 

TAGCAGAATT CAAGAAAGAA AACGGTATCG ACTTGTCTAC TGACAAGATG GCAATGCAAC 3120 

GTTTGAAAGA TGCGGCTGAA AAAGCGAAGA AAGACCTTTC TGGTGTAACT TCAACACAAA 3180 

TCAGCTTGCC ATTTATCACT" GCAGGTGAGG CTGGACCTCT TCACTTGGAA ATGACTTTGA 3240 

CTCGTGCGAA ATTTGACGAT TTGACTCGTG ACCTTGTTGA ACGTACAAAA GTTCCAGTTC 3300 

GTCAAGCCCT TTCAGATGCA GGTTTGAGCT TGTCAGAAAT CGACGAAGTT ATCCTTGTTG 3360 

GTGGTTCAAC TCGTATCCCT GCCGTTGTTG AAGCTGTTAA AGCTGAAACT GGTAAAGAAC 3420 

CAAACAAATC AGTAAACCCT GATGAAGTAG TTGCTATGGG TGCGGCTATC CAAGGTGGTG 3480 

TGATTACTGG TGATGTCAAG GACGTTGTCC TTCTTGATGT AACGCCATTG TCACTTGGTA 3540 

TCGAAACAAT GGGTGGAGTA TTTACAAAAC TTATCGATCG CAACACTACA ATCCCAACAT 3600 

CTAAATCACA AGTCTTCTCA ACAGCAGCAG ACAACCAACC AGCCGTTGAT ATCCACGTTC 3660 

TTCAAGGTGA ACGCCCAATG GCAGCAGATA ACAAGACTCT TGGACGCTTC CAATTGACTG 3720 

ATATCCCAGC TGCACCTCGT GGAATTCCTC AAATCGAAGT AACATTTGAC ATCGACAAGA 3780 

ACGGTATCGT GTCTGTTAAG GCCAAAGACC TTGGAACTCA AAAAGAACAA ACTATTGTCA 3840 

TCCAATCGAA CTCAGGTTTG ACTGACGAAG AAATCGACCG CATGATGAAA GATGCAGAAG 3900 

CAAACGCTGA AGCCGATAAG AAACGTAAAG AAGAAGTAGA CCTTCGTAAT GAAGTAGACC 3960 
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AAGCAATCTT TGCGACTGAA AAGACAATCA AGGAAACTGA AGGTAAAGGC TTCGACGCAG 4020 

AACGTGACGC TGCCCAAGCT GCCCTTGATG ACCTTAAGAA AGCTCAAGAA GACAACAACT 4080 

TGGACGACAT GAAAACAAAA CTTGAAGCAT TGAACGAAAA AGCTCAAGGA CTTGCTGTTA 4140 

AACTCTACGA ACAAGCCGCA GCAGCGCAAC AAGCTCAAGA AGGAGCAGAA GGCGCACAAG 4200 

CAACAGGGAA CGCAGGCGAT GACGTCGTAG ACGGAGAGTT TACGGAAAAG TAAGATGAGT 4260 

GTATTGGATG AAGAGTATCT AAAAAATACA CGAAAAGTTT ATAATGATTT TTGTAATCAA 4320 

GCTGATAACT ATAGAACATC AAAAGATTTT ATTGATAATA TTCCAATAGA ATATTTAGCT 4380 

AGATATAGAG AATTATATTA GCTGAACATG ATAGTTGTAT CAAAAATGAT GAAGCGGTAA 4440 

GGAATTTTGT TACCTCAGTA TTGTTGTCTG CATTTGTATC GGCGATGGTA CCGTATCTGA 4500 

CGAACGTTCA GCTTATAT 4518 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 



TGCTATTTTC 


GATTCCCTTG 


GGCGTTTTGA 


TTGCCTTTGC 


CTTGCAAGTC 


CATTGGAAGC 


60 


CCCTCCATTA 


TCTGATTAAC 


ATTTACATCT 


GGGTTATGCG 


AGGAACCCCC 


TTACTCTTGC 


120 


AACTGATTTT 


TATCTATTAT 


GTGCTCCCAA 


GTATTGGGAT 


TCGTTTAGAC 


CGCCTTCCTG 


180 


CAGCTATTAT 


TGCCTTTGTT 


CTCAACTATG 


CAGCTTACTT 


TGCAGAAATT 


TTCCGTGGGG 


240 


GAATTGACAC 


TATTCCAAGA 


GGACAGTATG 


AGGCCGCCAA 


GGTCTTGAAG 


TTTAGCCCTT 


300 


TTGACAGAGT 


GCGCTATATT 


ATCTTGCCCC 


AAGTGACCAA 


GATCGTTCTT 


CCTAGTGTCT 


360 


TTAATGAAGT 


TATGAGTTTG 


GTCAAGGATA 


CTTCTTTGGT 


CTATGCTCTC 


GGAATTTCAG 


420 


ACCTTATCTT 


GGCTAGTCGA 


ACAGCTGCTA 


ACCGCGATGC 


TAGTCTAGTT 


CCTATGTTCT 


480 


TGGCAGGAGC 


CATTTATTTG 


ATTTTGATTG 


GGATTGTGAC 


AATTATTTCC 


AAAAAAGTTG 


540 


AGAAGAAGTA 


TAGTTATTAT 


AGATAGGAGG 


CTGCCATGTT 


AGAATTACGA 


AATATCAATA 


600 


AAGTCTTTGG 


AGACAAACAA 


ATCCTGTCTA 


ATTTCAGTCT 


AAGTATTCCT 


GAAAAGCAAA 


660 


TCCTGGCTAT 


CGTTGGACCT 


TCTGGTGGAG 


GTAAGACAAC 


TCTTTTACGT 


ATGCTTGCAG 


720 


GTCTTGAAAC 


CATTGATTCA 


GGGCAAATCT 


TTTATAATGG 


ACAACCTTTA 


GAGCTGGATG 


780 
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AATTGCAGAA GCGCAATCTA CTGGGATTTG TCTTCCAAGA TTTTCAACTA TTTCCTCATC 840 

TATCAGTTCT GGAAAATTTG ACTTTATCGC CTGTGAAGAC CATGGGAATG AAGCAGGAAG 900 

AGGCTGAGAA GAAGGCGAGT GGACTCTTGG AACAGTTAGG ACTAGGAGGA CACGCAGAGG 960 

CCTATCCTTT CTCACTATCT GGTGGGCAAA AGCAGCGGGT GGCTTTGGCG CGTGCTATGA 1020 

TGATTGACCC AGAAATCATT GGCTACGATG AACCAACTTC TGCCCTGGAT CCAGAATTAC 1080 

GTTTGGAAGT GGAGAAGCTA ATCTTGCAAA ATAGGGAACT TGGGATGACC CAGATTGTGG 1140 

TTACCCATGA TTTGCAGTTT GCTGAAAATA TCGCAGATGT ATTATTGAAA GTAGAACCTA 1200 

AATAGGAGGA AAAATGGATG AAAAAATGGA TGCTTGTATT AGTCAGTCTG ATGACTGCTT 12 60 

TGTTCTTAGT AGCTTGTGGG AAAAATTCTA GCGAAACTAG TGGAGATAAT TGGTCAAAGT 1320 

ACCAGTCTAA CAAGTCTATT ACTATTGGAT TTGATAGTAC TTTTGTTCCA ATGGGATTTG 1380 

CTCAGAAAGA TGGTTCTTAT GCAGGATTTG ATATTGATTT AGCTACAGCT GTTTTTGAAA 1440 

AATACGGAAT CACGGTAAAT TGGCAACCGA TTGATTGGGA TTTGAAAGAA GCTGAATTGA 1500 

CAAAAGGAAC GATTGATCTG ATTTGGAATG GCTATTCCGC TACAGACGAA CGCCGTGAAA 1560 

AGGTGGCTTT CAGTAACTCA TATATGAAGA ATGAGCAGGT ATTGGTTACG AAGAAATCAT 1620 

CTGGTATCAC GACTGCAAAG GATATGACTG GAAAGACATT AGGAGCTCAA GCTGGTTCAT 1680 

CTGGTTATGC GGACTTTGAA GCAAATCCAG AAATTTTGAA GAATATTGTC GCTAATAAGG 1740 

AAGCGAATCA ATACCAAACC TTTAATGAAG CCTTGATTGA TTTGAAAAAC GATCGAATTG 1800 

ATGGTCTATT GATTGACCGT GTCTATGCAA ACTATTATTT AGAAGCAGAA GGTGTTTTAA 1860 

ACGATTATAA TGTCTTTACA GTTGGACTAG AAACAGAAGC TTTTGCGGTT GGAGCCCGTA 1920 

AGGAAGATAC AAACTTGGTT AAGAAGATAA ATGAAGCTTT TTCTAGTCTT TACAAGGACG 1980 

GCAAGTTCCA AGAAATCAGC CAAAAATGGT TTGGAGAAGA TGTAGCAACC AAAGAAGTAA 2040 

AAGAAGGACA GTAAGATAAA ATAGTGGCTG AAACTGCGTT TTGATTAGCA AAACGTAGTT 2100 

TTTTTTGTAA TCTAGGAAAA CGATAATAGC GATTGAATAT GGATAATTGA ATATGGAATA 2160 

GCCCACTGTG ATTTCTAAAA CATTGTTAAA AATTGATTTG ACTTCCAAAA TTAAAATGTT 2220 

CTGTAATGAA ATACTGATGT AACTGTTTTA GGAACAATAA AACGCATAAT ATCAAGGTTT 2280 

TTGCACCTTA CATTATGCGT TTTTGTGATT TTAAGACTTG TTAGCTGATT TTTTACAATC 2340 

CTGCGAAATC TTTGATTTCT TGTGCTGACA TTGAAGAGTC GCAACGGACG TTGATTTGTC 2400 

CATCTGTAAT ATGAACAAAA CCTGGTACAG TTGGGATTCC ATAGCGTGAG CGGAATGCTT 2460 

GCAAATCATT GAGTTGGCTT GGTTCTTCAC TATTGATGAA GTAAATGTGA GCTTTGGTTT 2520 

CAGCTACGAC ACCTGACAAT GTACCTGCAA ATTTACGGCA GTAAGGGCAA GTTTTGCGAC 2580 
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CGATAAAGAA 


GGTTGCAGTT 


TCTTTTTTAT 


CAAGAGCTTC 


TTGCGCACGC 


ACAACTGTAG 


2640 


TGACTTCAAG 


GTCTTTGATG 


TTATCTAAAA 


ATTGTTCCAT 


GAGATTACCT 


CGCTTTCATT 


2700 


GATAAGTCTA 


GTATGCCATA 


AAGTTTCTAA 


AATTGCTTAG 


ATTTGATACG 


AAAAAAGATG 


2760 


AGGTTGGTTG 


GTCTCATCTT 


TTATAGGTCT 


TTATTTTACA 


AATGCATTGA 


TTTCTGCTTC 


2820 


GATGTTAGCA 


ATCTTAGCTT 


GTGATTCTTC 


GTTGGTTTCC 


CCTACAACTG 


CAATGTAGAA 


2880 


CTTGATTTTT 


GGTTCTGTAC 


CTGAAGGGCG 


AACGGCAATC 


CATGAACCGT 


CAGCAAGTGT 


2940 


GTATTTCAAC 


ACATCACTTG 


GAGGAGTTGT 


CAAGTTTGTA 


ACAGTACCGT 


CAGCAACAGT 


30OO 


AGCAGTTTGT 


GCCTTGAAGT 


CTTCTACGAC 


AGTGATAGCT 


GTTGCGTTCC 


ATTCTGTTGG 


3060 


AGCATTGTTG 


CGGAATTTAG 


CCATAATCGC 


TTTGATTTGT 


TCAGCACCAT 


CGACACCTGA 


3120 


AAGAGTAACA 


GAGATTGTTT 


TTTCTGCGTA GTAGCCATAT 


TCTTTATAGA 


TTTCTTCGAT 


3X80 


ACCGTCAGCA 


AGTGTCAAAC 


CACGAGAACG 


GTAGTAGGCA 


GCAAGTTCAG 


CAACTACAAG 


3240 


AACGGCTTGG 


ATGGCATCTT 


TATCACGTAC 


AAATGGTTTA 


ATCAAGTAAC 


CGAAGCTTTC 


3300 


TTCAAATCCC 


ATCATGTAAG 


TGTGGTTGTG 


TTTTTCTTCG 


AATTCTTGGA 


TTTTTTCAGC 


3360 


GATAAATTTG 


AAACCTGTCA 


AGACGTTGAA 


CATAGTTGCG 


CCGTAGCTTT 


CAGCAATCTT 


3420 


CGTTACCAAG 


TCAGTTGAAA 


CGATAGATTT 


GCAGAGAGCG 


GCATTTTCAG 


GAAGAGTTCC 


3480 


AGCGTTTTTG 


TGAGCTTCCA 


AGATGTATTT 


AGCCATGATA 


GCACCGATTT 


GGTTACCTGA 


3540 


AAGGTTGAGG 


TAGCTACCAT 


CTTTTTGAAG 


AACTTCAACA 


CCAACACGGT 


CAGCGTCTGG 


3600 


GTCAGTTGCG 


ACAAGAACAT 


CTGCACCAAC 


TTGACGACCA 


AGTTCTTCAG 


CAAGGGCAAA 


3660 


GGCTGCTTGG 


CTTTCTGGGT 


TTGGAGATGT 


TACAGTTGAA 


AAGTCTGGGT 


CAGCAGTTGC 


3720 


TTGCGCTTCA 


ACAACTTGAA 


CAGAGTCAAA 


TCCTGCTTGG 


GCAAGAGCAC 


GACGAGCCAA 


3780 


CATTTCACCA 


GTACCATGAA 


GTGGTGTGTA 


GACAATCTTC 


ATGTCTTTAC 


CAAATTCTTC 


3840 


AATCAAGGCT 


GGGTTGATGT 


TTATGTCCTT 


AACCTCTTTA 


AGGTATTCTA 


TGTCAACAGC 


3900 


TTCGCCGATA 


ACTTCAATCA 


AGCCAGAAGC 


TTTTTCAGTT 


TCCACATCAG 


CAACTTCAAC 


3960 


TGCAAATGGG 


TTTTCGATTG 


CACGGATATA 


AGTAGTCAAA 


GCGTCCGCAT 


CGTGTGGAGG 


4020 


CATTTGTCCA 


CCGTCTTCAC 


CGTAAACCTT 


GTAACCGTTA 


AATGGAGCAG 


GGTTGTGGCT 


4080 


GGCTGTGACC 


ATGATACCTG 


CGAAACAGTT 


GAGATGACGA 


ACTGCAAATG 


ATAGTTCTGG 


4140 


AGTCGGACGA 


AGGCTTTCAA 


ATACGTAAGA 


TTTGATGCCG 


TGTTTAGCAA 


GAACTGCCGC 


4200 


AGATTCAAAG 


GCAAACTCAG 


GTGAGAAGTG 


ACGGCTATCG 


TAGGCAATTG 


CTACACCGCG 


4260 


TTCTTTCTCG 


TTTCCACCTT 


TTGACTCAAT 


CAAACGAGCC 


AATCCTTCAG 


TAGCTTGGCG 


4320 
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AACAACGTAG 


ATGTTGATAC 


GGTTTGTACC 


AGCACCAACC 


AAGCCACGCA 


TACCTGCAGT 


4380 


ACCAAATTCA 


AGATTTGTAT 


AGAAGGCATC 


TTCCTTAGTT 


TTTTCGTCCA 


TATTTTCCAA 


4440 


ATCTTGACGA 


AGGTAGTCAC 


GAAGCTCCAC 


AAAATCAACC 


CATTTCTGGT 


AATTTTCTTG 


4500 


GTAAGACATT 


CAAATTCTCC 


TTTATTTTTA 


AAACATTTAA 


TCAGTTTAAT 


TATATCATTT 


4560 


TTTTTAGTTT 


TAGTAAAACC 


TTATCTGCTT 


CGAACATCTC 


TTCAAACCAG 


GTCAGATTGA 


4620 


ATTTTGGGGT 


TATATGATGT 


TGAGGCTAGG 


AAAAATTCAA 


TTTCAGTAAA 


AAAAGTAAGT 


4680 


CTTCTCATAA 


CAAAACATTG 


ATATAGTTAC 


TTAGTTTTAA 


ACAAGCATAT 


TATAATAAAG 


4740 


CTATGGCATA 


TAGTACTGAT 


TTTAAACAGC 


GAGCATTAGA 


TTACATCAAA 


GAGGGGCACA 


4800 


GCCATGTCGA 


GGCAGCCAAG 


TTTTTTGGTG 


TTGGCGTCAG 


AACTCTCTTC 


ACGTGGGAAA 


4860 


AGAAAGACGT 


GAACAAGAAC 


ACATAGAGAG 


GAAAAAGCGA 


GTCGTCAAAA 


ACCGAAAGAT 


4920 


TCCTTTAGAG 


GAATTGAAAG 


CCTTTGTAGA 


GGCTCATCCA 


GATGCTTTTT 


TACGGGAAAT 


4980 


TGCGGCACAT 


TTTGATTGTG 


CTGTTCCTTC 


AGTATGGGCA 


GCTTTAAAGC 


AGATTAAGGT 


5040 


CACTTTAAAA 


AAAGATGACG 


AGCTTTAAGG 


AACAAGACCC 


AGAAAAGTAG 


CCTTATTTCT 


5100 


TAAGAATTTT 


AATAGTTTAA 


AGCACCTAGC 


ACCTGTTTAT 


ATTGATGAAA 


CAGGAATCGA 


5160 


CCGCTATCTC 


TATCGTCCTT 


ATGCAGGGGC 


TCCTAGAGGG 


GAGAAAGTCT 


ATGAAAAGAT 


5220 


TAGCGGACGT 


CGTTTTGAGC 


GAACTTCAAT 


TGTTGCAGGA 


CAAGTAGACG 


GAGAGTTTAT 


5280 


AGCTCCCATG 


ATTTACAAGA 


AAAGCATGAC 


AAGCGATTTC 


TTTGTGGAGT 


GGTTCAAAAC 


5340 


GCAACTCCTA 


CCTGCTTTGA 


AGACACCTCA 


TGTTATTGTC 


ATGGGCAATG 


CTGGTTTTCA 


5400 


TCCCAAGAAC 


ATTTTGGATG 


AACTCTGCAT 


CCAAGATAAA 


CACTTTTTCT 


TACCTCTACC 


5460 


ACCTTATTCA 


CCGGATTTGA 


ATCCTATTGA 


GCAAGCTTGG 


GCTATCTTGA 


AAAAGAAAGT 


5520 


GACGGATGTA 


TTAAGGGAAG 


TTCCAACTAT 


TTTTGAATGT 


TTGGAATGCT 


TTTTTAAAAC 


5580 


TAGATGACTA 


TAACGGTTCT 


AAAGGAACCT 


ATCGAGTAGT 


CATTAAAACT 


AAGGATACTG 


5640 


CTGGTTAAGA 


GAAGACGGTA 


TACAATCAAA 


CCATTCACCG 


TGTAGCCGAA 


ATCGTTCAGA 


5700 


ATGAAGACTT 


GTATCAGAAT 


GAAGACTTGT 


ATAAGAAAGG 


TTTGAATGTT 


GAACTTGCGC 


5760 


ACCAACAAAT 


TAAGGGATTT 


TTTGAAGCAG 


AGTTTAAAAA 


TCGTATTAAT 


GGAGTTCTTA 


5820 


ATACTAAAAT 


AAAAAATAGT 


ACATTAAATC 


GTGTAAATAA 


AAAAACTATA 


CACCAGAGCA 


5880 


ACAAAAACTC 


CATGATCAAT 


TTGAAGCAGA 


AGCAACGGAA 


GATGCTAAAA 


AACAAGGCGA 


5940 


TATTGTGTTG 


AATGTTGACC 


AGGATTTCAT 


GAGCATATCT 


AAGTCTAATA 


AAAGTGGTTC 


6000 


AGACTGGAAG 


AAAACTTTCA 


CAGTGAGGAT 


AACCAATAGG CTAGCAAATG ACTTGAATAA 


6060 


TGTCTTGAAA 


CAGGTTGATA 


AAGATACTCC 


TAATACCCCA 


ACTTGGCTAA 


ACTCAGCTGC 


6120 
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rrvri /-%rp » » » r*r*m 
lltl AAAbLI 


AAAGATGATG 


ACAGAGTATA 


TAAACTACTG 


AAGACTCTTA 


TACCAGGAGA 


6180 


AAA! 1ALC1A 


TCATGTTAAG 


GATAATCAGC 


TAGAAGTAGA 


AACAGATAAA 


TACACATATA 


6240 


CTGCCGCTAG 


AAATGGTAGT 


AAGGAAGTTG 


GTATTCAAGA GTCAGATATA GCAGCAACTC 


6300 


TAAGTGCCGA 


TGAATATAAT 


TCTAATCGCC 


AAACTTTTfiA 
fww, i i i i vjn 


unununn 1 


AA&TKPAAKA 
AAA1ALAAAA 


6360 


GCAAATGCCC 


TTAATAATGG 


TTGGGCTAGA 


1 V- 1 VTV> 1 


AAVjAVj 1 i Lnn 


AAAu l 1 v» Itt. 


6420 


CACTTTGTAG 


GGGTAGACAA 


AGGGATTGTG 






1AAAAAAL1A 


6480 


TCTGATAAGA 


TTAGGAAAGA AGTGGGCTCT 


RGAfiATAfifA 


AnL 1 AtsuAAA 


AGGCGGCTAT 


6540 


TTCTCTACTG 


GGGATGTTCT 


ATTAGGAAAA 


GATGTTGTTT 


CTTATACCGT 


ACAAGTATTT 


6600 


TCAGAGAATA 


ATGAAAGAGT 


AGGAGTAAAC 


Al. 1 LHAAu 1 l_ 




GTATAATCTC 


6660 


CCAATTCTAG 


CTGACTTTTC 


AGTCATCCAA 


OA 1 AV, lo 1 ajV» 


AACCATCACG 


AAC CGTTG TT 


6720 


GAAAAAATCA 


TTCCAAAACT 


AAATATTCCC 


U/UiunAunuA 


AA*jUL»AAAA 1 


AACCGAAGAA 


6780 


ATCAAGAAAA 


AGAAAAAAAC 


CTCAGAATTG 


ul. AVjAAL X AA 


1L1 t. AlaAAAA 


TGTGAAAGTT 


6840 


CGCTATGTTG 


ATGAACAAGG 


GCGTTTGCTA 




A ion I AL 1 uu 


AATTGGAGAA 


6900 


AAAGAAAGTG 


ACGGAACCTA 


CATT AC CAAT 




TPATT/'P'PIP 
1 uA 1 lub i AL 


l_ALK_ J. A 1 AA I 


6960 


GTCACAGATA 


AAAAACTCAG 


TAGCATGACT 


ArTArrnAm 


u/vnnn J rl 1 1 n 


1 Al_ 1111 AAA 


7020 


GAAGCAGATA 


CAAATTCTGC 


AAGTTTAACT 


GGGAATATTY3 


TAAnrnA Ann 


Tin* ar - " AfT/-" 

1 nbAALAu 1 \j 


7080 


ACCTTAGTTT 


ATAGAGAAAG 


CGAAGCGCCA 


ACCACTGCTA 


C Af?P A A P AfV 


p» ATTAPTAT 
V— AM 1 1 AL 1 A i 


7140 


AAAGAAGGTA 


GGCAAGAGAA 


GTTGGTAGAG 


TCTGTTATAA 


AAGCTGATTT 


AGCGATAGGT 


7200 


TCTGAGTATA 


CCACAGAATC 


AAAAACTATT 


GAAGGGAAAA 


CAACAACTGA 


GGACAAAGAA 


7260 


GACCGAGTTA 


TCACAAGGAA 


AACAACATAC 


ACCTTGGTAG 


CAACTCCTGA 


AAATGCGTAC 


7320 


CAGAAGACGG 


TGCAACAGTT 


GACTATTACT 


ACCGTGAGAA 


TGTTGAGGAA 


ACAGTGGTTC 


7380 


CCAAAACAGC 


AACCTCTACT 


GAGACGAAGA 


CTATAACGCG 


TATCATTCAT 


TACGTTGATA 


7440 


AAGTTACGAA CCAAAATGTA AAAGAAGATG 


TTGTTCAACC 


TGTAACCTTA 


AGCCGTACAA 


7500 


AAACTGAGAA 


CAAGGTCACG 


GGAGTTGTAA 


CCTACGGTGA 


ATGGACAACA 


GGAAACTGGG 


7560 


ACGAGGTTAT 


ATCTGGTAAG 


ATTGACAAGT 


ACAAAGATCC 


AGATATTCCA 


ACAGTTGAAT 


7620 


CACAAGAAGT 


TACGTCAGAC 


TCTAGTGATA 


AAGAAATAAC 


GGTAAGGTAT 


GACCGTTTAT 


7680 


CAACACCAGA 


AAAACCAATC 


CCACAACCAA 


ATCCAGAGCA 


TCCAAGTGTT 


CCGACACCAA 


7740 


ACCCAGAACT 


ACCAAATCAA 


GAGACTCCAA 


CACCAGATAA 


ACCAACTCCA 


GAACCAGGTA 


7800 


CTCCAAAAAC 


TGAAACTCCA 


GTGAATCCAG 


ACCCAGAAGT 


TCCGACTTAT 


GAGACAGGTA 


7860 
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AGAGAGAGGA ATTGCCAAAC ACAGGTACAG AAGCTAATGC TACCTTGGCT AGTGCTGGTA 7920 

TCATGACCTT GTTAGCTGGT CTAGGATTAG GATTTTTCAA GAAAAAAGAA GATGAAAAAT 7980 

AATAGATTTT AGAATCTAGG AACCAGGAAA AGCTCACAGA TGTGGGCTTT TTTCCTGGTT 8040 

TTGAGAACGA GGTCTTTCGT AAAGAATAAA AACGCTTACA AGTCTGTTGA ACTGGGAAAC 8100 

TATGAATCCT ATTTTTTTAA AAATATTTCC AGAAATCAGT TGCGG 8145 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 8697 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 



CGGTACCGGG 


AACGATACTT 


AGTCTAATTT 


TGCACCTTTT 


CCATGTATGG 


TAAAGGTTTT 


60 


TCTTTTTTTA 


AAAAGGAAAA 


CGAGAAGAGG 


AGGTTCTTAT 


GAAAGCAAGC 


ATTGCCTTGC 


120 


AAGTTTTACC 


CCTAGTACAG 


GGGATTGATC 


GGATAGCTGT 


TATTGATCAG 


GTCATTGCTT 


180 


ATCTGCAwAC 


TCAAGAAGTG 


ACGATGGTAG 


TGACACCATT 


TGAAACGGTC 


TTGGAAGGGG 


240 


AGTTTGATGA 


GCTTATGCGC 


ATTCTAAAAG 


AAGCGCTGGA 


AGTGGCAGGG 


CAGGAGGCAG 


.300 


ACAATGTCTT 


TGCCAATGTC 


AAAATAAATG 


TAGGAGAGAT 


TTTAAGTATT 


GATGAGAAAC 


360 


TTGAGAAGTA 


TACTGAGACG 


ACACATTAGT 


CTATTGGGCT 


TTCTCGGAGT 


ATTGTCAATC 


420 


TGGCAGTTAG 


CAGGTTTTCT 


TAAACTTCTC 


CCCAAGTTTA 


TCCTGCCGAC 


ACCTCTTGAA 


480 


ATTCTCCAGC 


CCTTTGTTCG 


TGACAGAGAA 


TTTCTCTGGC 


ACCATAGCTG 


GGCGACCTTG 


540 


AGAGTGGCTT 


TACTGGGGCT 


GATTTTGGGA 


GTTTTGATTG 


CCTGTCTTAT 


GGCTGTGCTC 


600 


ATGGATAGTT 


TGACTTGGCT 


CAATGACCTG 


ATTTACCCTA 


TGATGGTGGT 


CATTCAGACC 


660 


ATTCCGACCA 


TTGCCATAGC 


TCCTATCCTG 


GTCTTGTGGC 


TAGGTTATGG 


GATTTTGCCC 


720 


AAGATTGTCT 


TGATTATCTT 


AACGACAACC 


TTTCCCATCA 


TCGTTAGTAT 


TTTGGACGGT 


780 


TTTAGGCATT 


GCGACAAGGA 


TATGCTGACC 


TTGTTTAGTC 


TGATGCGGGC 


CAAGCCTTGG 


840 


CAAATCCTGT 


GGCATTTTAA 


AATCCCAGTT 


AGCCTGCCTT 


ACTTTTATGC 


AGGTCTGAGG 


900 


GTCAGTGTCT 


CCTACGCCTT 


TATCACAACT 


GTGGTATCTG 


AGTGGTTGGG 


AGGTTTTGAA 


960 


GGTCTTGGTG 


TTTATATGAT 


TCAGTCTAAA 


AAACTGTTTC 


AGTATGATAC 


CATGTTTGCC 


1020 


ATTATTATTC 


TGGTGTCGAT 


TATCAGTCTT 


TTGGGTATGA 


AGCTGGTCGA 


TATCAGTGAA 


1080 


AAATATGTGA 


TTAAATGGAA 


ACGTTCGTAG 


AATTAGAATG 


TTTCTGAAAA 


AGAAAAGAGG 


1140 
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AAATCAAAAT GAAGAAAACA TGGAAAGTGT TTTTAACGCT TGTAACAGCT CTTGTAGCTG 1200 

TTGTGCTTGT GGCCTGTGGT CAAGGAACTG CTTCTAAAGA CAACAAAGAG GCAGAACTTA 1260 

AGAAGGTTGA CTTTATCCTA GACTGGACAC CAAATACCAA CCACACAGGG CTTTATGTTG 1320 

CCAAGGAAAA AGGTTATTTC AAAGAAGCTG GAGTGGATGT TGATTTGAAA TTGCCACCAG 1380 

AAGAAAGTTC TTCTGACTTG GTTATCAACG GAAAGGCACC ATTTGCAGTG TATTTCCAAG 1440 

ACTACATGGC TAAGAAATTG GAAAAAGGAG CAGGAATCAC TGCCGTTGCA GCTATTGTTG 1500 

AACACAATAC ATCAGGAATC ATCTCTCGTA AATCTGATAA TGTAAGCAGT CCAAAAGACT 1560 

TGGTTGGTAA GAAATATGGG ACATGGAATG ACCCAACTGA ACTTGCTATG TTGAAAACCT 1620 

TGGTAGAATC TCAAGGTGGA GACTTTGAGA AGGTTGAAAA AGTACCAAAT AACGACTCAA 1680 

ACTCAATCAC ACCGATTGCC AATGGCGTCT TTGATACTGC TTGGATTTAC TACGGTTGGG 1740 

ATGGTATCCT TGCTAAATCT CAAGGTGTAG ATGCTAACTT CATGTACTTG AAAGACTATG 1800 

TCAAGGAGTT TGACTACTAT TCACCAGTTA TCATCGCAAA CAACGACTAT CTGAAAGATA 1860 

ACAAAGAAGA AGCTCGCAAA GTCATCCAAG CCATCAAAAA AGGCTACCAA TATGCCATGG 1920 

AACATCCAGA AGAAGCTGCA GATATTCTCA TCAAGAATGC ACCTGAACTC AAGGAAAAAC 1980 

GTGACTTTGT CATCGAATCT CAAAAATACT TGTCAAAAGA ATACGCAAGC GACAAGGAAA 2040 

AATGGGGTCA ATTTGACGCA GCTCGCTGGA ATGCTTTCTA CAAATGGGAT AAAGAAAATG 2100 

GTATCCTTAA AGAAGACTTG ACAGACAAAG GCTTCACCAA CGAATTTGTG AAATAATGAC 2160 

AGAAATTAGA CTAGAGCACG TCAGTTATGC CTATGGTCAG GAGAGGATTT TAGAGGATAT 2220 

CAACCTACAG GTGACTTCAG GCGAAGTGGT TTCCATCCTA GGCCCAAGTG GTGTTGGAAA 2280 

GACCACCCTC TTTAATCTAA TCGCTGGGAT TTTAGAAGTT CAGTCAGGGA GAATTGTCCT 2340 

TGATGGTGAA GAAAATCCCA AGGGGCGCGT GAGTTATATG TTGCAAAAGG ATCTGCTCTT 2400 

GGAGCACAAG ACGGTGCTTG GAAATATCAT TCTGCCCCTC TTGATTCAAA AGGTGGATAA 2460 

GGCAGAAGCT ATTTCCCGAG CGGATAAAAT TCTTGCGACC TTCCAGCTGA CAGCTGTAAG 2520 

AGACAAGTAT CCTCATGAAC TTAGCGGTGG GATGCGCCAG CGTGTAGCCT TACTCCGGAC 2580 

CTACCTTTTT GGGCACAAGC TCTTTCTCTT AGATGAGGCC TTTAGCGCCT TGGATGAGAT 2640 

GACAAAGATG GAACTCCACG CTTGGTATCT TGAGATTCAC AAGCAGTTGC AGCTAACAAC 2700 

CCTGATCATC ACGCATAGTA TTGAGGAGGC CCTCAATCTC AGCGACCGTA TCTATATCTT 2760 

GAAAAATCGC CCTGGGCAGA TTGTTTCAGA AATTAAACTA GATTGGTCTG AAGATGAGGA 2820 

CAAGGAAGTC CAAAAGATTG CCTACAAACG TCAAATTTTG GCGGAATTAG GCTTAGATAA 2880 
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GTAGAAAAAT AGGGAGTTGG TGAAGATTAT CCTTTACCAG CGCCCTTTTT CTTTTAAAAA 2940 

TGAGAAAATT TCGGTATAAT AGTCAAACAA GGTCAAGGTT TAAAGAGAGA GGTGGGTTTG 3000 

TTATGAGATT TAAAAATACA TCGGATCATA TTGAGGCCTA CATCAAGGCG ATTTTAGATC 3060 

AATCTGGTAT CGTGGAGTTG CAACGGAGTC AGTTGGCAGA TACCTTTCAG GTTGTTCCTA 3120 

GTCAGATTAA CTACGTGATC AAGACACGCT TTACGG AAAG TAGAGGCTAC TTGGTTGAAA 3180 

GTAAGCGTGG TGGCGGAGGC TACATTCGTA TAGGACGGAT TGAGTTTTCT AGTCATCATG 3240 

AAATGCTCCG GGAGCTGCTT TACTCGATTG GTGAGCGAGT CAGTCAAGAA ATTTATGAGG 3300 

ATATTCTCCA GCTTTTGGTT GAGCAGGAAT TGATGACCAA GCAGGAGATG AATTTGCTAG 3360 

AATCAGTAGC TTTGGATCGC GTTTTAGGAG AAGAAGCTCC AGTTGTTCGA GCAAACATGC 3420 

TACGTCAGAT CATACAAGAG GTAGATAGAA AAGGGAAGTA AGATGAACTA TTCAAAAGCA 3480 

TTGAATGAAT GTATCGAAAG TGCCTACATG GTTGCTGGAC ATTTTGGAGC TCGTTATCTA 3540 

GAGTCGTGGC ACTTGTTGAT TGCCATGTCT AATCACAGTT ATAGTGTAGC AGGGGCAACT 3600 

TTAAATGATT ATCCGTATGA GATGGACCGT TTAGAAGAGG TGGCTTTGGA ACTGACTGAA 3660 

ACGGACTATA GCCAGGATGA AACCTTTACG GAATTGCCGT TCTCCCGTCG TTTGCAGGTT 3720 

CTTTTTGATG AAGCAGAGTA TGTAGCGTCA GTGGTCCATG CTAAGGTACT AGGGACAGAG 3780 

CACGTCCTCT ATGCGATTTT GCATGATAGC AATGCCTTGG CGACTCGTAT CTTGGAGAGG 3840 

GCTGGTTTTT CTTATGAAGA CAAGAAAGAT CAGGTCAAGA TTGCTGCTCT TCGTCGAAAT 3900 

TTAGAAGAAC GGGCAGGCTG GACTCGTGAA GATCTCAAGG CTTTACGCCA ACGCCATCGT 3960 

ACAGTAGCTG ACAAGCAAAA TTCTATGGCC AATATGATGG GCATGCCGCA GACTCCTAGT 4020 

GGTGGTCTCG AGGATTATAC GCATGATTTG ACAGAGCAAG CGCGTTCTGG CAAGTTAGAA 4080 

CCAGTCATCG GTCGGGACAA GGAAATCTCA CGTATGATTC AAATCTTGAG CCGGAAGACT 4140 

AAGAACAACC CTGTCTTGGT TGGGGATGCT GGTGTCGGGA AAACAGCTCT GGCGCTTGGT 4200 

CTTGCCCAGC GTATTGCTAG TGGTGACGTG CCTGCGGAAA TGGCTAAGAT GCGCGTGTTA 4260 

GAACTTGATT TGATGAATGT CGTTGCAGGG ACACGCTTCC GTGGTGACTT TGAAGAACGC 4320 

ATGAATAATA TCATCAAGGA TATTGAAGAA GATGGCCAAG TCATCCTCTT TATCGATGAA 4380 

CTCCACACCA TCATGGGTTC TGGTAGCGGG ATTGATTCGA CTCTGGATGC GGCCAATATC 4440 

TTGAAACCAG CCTTGGCGCG TGGAACTTTG AGAACGGTTG GTGCCACTAC TCAGGAAGAA 4500 

TATCAAAAAC ATATCGAAAA AGATGCGGCA CTTTCTCGTC GTTTCGCTAA AGTGACGATT 4560 

GAAGAACCAA GTGTGGCAGA TAGTATGACT ATTTTACAAG GTTTGAAGGC GACTTATGAG 4620 

AAACATCACC GTGTACAAAT CACAGATGAA GCGGTTGAAA CAGCGGTTAA GATGGCTCAT 4680 



WO 98/18931 



PCT7US97/19588 



861 



CGTTATTTAA 


CCAGTCGTCA 


CTTGCCAGAC 


TCTGCTATCG 


ATCTCTTGGA 


TGAGGCGGCA 


4740 


GCAACAGTGC 


AAAATAAGGC AAAGCATGTA AAAGCAGACG ATTCAGATTT GAGTCCAGCT 


4800 


GACAAGGCCC 


TGATGGATGG 


CAAGTGGAAA 


CAGGCAGCCC 


AvjV_ I AA1\.UL 


AAAAGAAGAG 


4860 


GAAGTACCTG 


TCTACAAAGA 


CTTGGTGACA 


GAGTCTGATA 


III luALLAL 


CTTGAGTCGC 


4920 


TTGTCAGGAA 


TCCCAGTTCA 


AAAACTGACT 


CAAACGGATG 


v_ 1 AAIjAAu 1 A 


TTTAAATCTT 


4980 


GAAGCAGAAC 


TCCATAAACG 


GGTTATCGGT 


CAAGATCAAG 


CTGTTTCAAG 


CATTAGCCGT 


5040 


GCCATTCGCC 


GCAACCAGTC 


AGGGATTCGC 


AGTCATAAGC 


GTCCGATTGG 


TTCCTTTATG 


5100 


TTCCTAGGGC 


CTACAGGTGT 


CGGGAAAACT 


GAATTAGCCA 


AGGCTCTGGC 


AGAAGTTCTT 


5160 


TTTGACGACG 


AATCAGCCCT TATCCGCTTT GATATGAGTG 


AGTATATGGA 


GAAATTTGCA 


5220 


GCTAGTCGTC 


TCAACGGAGC 


TCCTCCAGGC 


TATGTAGGAT 


ATGAAGAAGG 


TGGGGAGTTG 


5280 


ACAGAGAAGG 


TTCGCAATAA ACCCTATTCC 


GTTCTCCTCT 


TTGATGAGGT 


AGAGAAGGCC 


5340 


CACCCAGATA 


TCTTTAATGT 


TCTCTTGCAG 


GTTCTGGATG 


ACGGTGTCTT 


GACAGATAGC 


5400 


AAGGGACGCA 


AGGTCGATTT 


TTCAAATACC 


ATTATCATTA 


TGACATCGAA 


TCTAGGTGCG 


5460 


ACTGCCCTTC 


GTGATGATAA 


GACTGTTGGT 


TTTGGGGCTA 


AGGATATTCG 


TTTTGACCAG 


5520 


GAAAATATGG AAAAACGCAT GTTTGAAGAA CTGAAAAAAG 


CTTATAGACC 


GGAATTCATC 


5580 


AACCGTATTG 


ATGAGAAGGT 


GGTCTTCCAT 


AGCCTATCTA 


GTGATCATAT 


GCAGGAAGTG 


5640 


GTGAAGATTA 


TGGTCAAGCC 


TTTAGTGGCA 


AGTTTGACTG 


AAAAA(iv»_AI 


TGACTTGAAA 


5700 


TTACAAGCTT 


CAGCTCTGAA 


ATTGTTAGCA 


AATCAAGGAT 




GATGGGAGCT 


5760 


CGCCCACTTC 


GCAGAACCCT 


GCAAACAGAA 


GTGGAGGACA 


AGTTGGCAGA 


ACTTCTTCTC 


5820 


AAGGGAGATT 


TAGTGGCAGG 


CAGCACACTT 


AAGATTGGTG 


TCAAAGCAGG 


CCAGTTAAAA 


5880 


TTTGATATTG 


CATAAAAGAA 


TAAAAGTATC 


AGCATCTGAC 


CATAAGTCAC 


AGTGGAGTGA 


5940 


AATTCAATGA 


AAATCAAAGA 


GCAAACTAGG 


CAGCTAGCCG 


CAGGTTGCTC 


AAAACACTGG 


6000 


TTTGAGGTTG 


CAGATAGAGC 


TGACGTGGTT 


TGAAGAGATT 


TTCGAAGAGT 


ATGAAACTAA 


6060 


AACCTATAGC 


TTCTAAACGA 


TCCGTGGTTT 


TCATCATTCA 


ACACAAAATT 


CATATGTTTA 


6120 


TTACCCTCCG 


TCGTATTTGT CTTAGAGCGT 


GTGTAGTAGA 


AAAAGAGCAG 


TCTTATCTGA 


6180 


AATTTTTATT 


CTTTCAAAAG 


AGACCTGTTT 


CTTTTTTGCA 


TGTCAAATCC 


GTTCTAGCTG 


6240 


GTATTTGAAA 


AATCAAACTA 


ATATTCAATG 


AAAATCAAAG 


AACAAACTAG 


GAAGCTAGCC 


6300 


GCAGGTTGCT 


CAAAACACTG 


TTTTGAGGTT 


GTAGATAGAG 


CTGACGTGGT 


TTGAAGAGAT 


6360 


TTTCGAAGAG 


TATAAGCTGC 


AAGATGAATG 


ATTTTCTTGT 


ATTGACGTTG 


TTGTTGACAA 


6420 
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AAAGTAGCGG ATAAATGAAA 


TCCATTCCAT 


TATCATAGAT 


GATAGGCTGG 


TAGGAAATTT 


6480 


TCAAATAGCA TACAGGAAAT 


AGATGTATGG 


AGTTCTGGTA 


GTAGAAAGGG AGAGAGATGA 


6540 


ACATTTTAGT TGCAGATGAC 


GAGGAAATGA 


TTAGAGAAGG 


AATTGCAGCA 


TTTCTGACAG 


6600 


AAGAGGGTTA TCATGTCATT 


ATGGCTAAGG 


ATGGACAAGA 


GGTCTTGGAA AAATTTCAAG 


6660 


ATCTCCCTAT CCATCTCATG 


GTACTGGATT 


TAATGATGCC 


TAGGAAGAGT 


GGTTTTGAAG 


6720 


TGTTAAAAGA AATCAATCAA 


AAGCACGATA 


TTCCTGTCAT 


CGTCTTGAGT 


GCTCTGGGAG 


6780 


ATGAAACTAC TCAGTCACAG 


GTATTTGATC 


TCTATGCTGA 


TGATCATGTG 


ACAAAACCTT 


6840 


TTTCTTTGGT ACTGCTTGTC 


AAGCGTATTA 


AGGCGCTTAT 


CAGACGTTAC 


TACGTCATAG 


6900 


AGGATCTTTG GCGATATCAG 


GATGTAACAG 


TGGATTTTAC 


CTCTTACAAA GCACATTATA 


6960 


AAAATGAAGA AATTGATCTC 


AAACCAAAGG 


AATTACTGGT 


ACTAAAGTGT 


TTGATTCAGC 


7020 


ATAAAAATCA AGTTTTAAGT 


AGAGAGCAGA 


TATTGGAAGA 


AATTTCAAAA GATGTAGCTG 


7080 


ATTTACCTTG TGATAGGGTC 


GTTGATGTCT 


ATATTCGTAC 


TCTTCGCAAA 


AAATTAGCTT 


7140 


TAGATTGTAT CGTGACTGTG 


AAAAATGTTG 


GGTATAAGAT 


TAGCTTATGA 


TAAAAAATCC 


7200 


TAAATTATTA ACCAAGTCTT 


TTTTAAGAAG 


TTTTGCAATT 


CTAGGTGGTG 


TTGGTCTAGT 


7260 


CATTCATATA GCTATTTATT 


TGACCTTTCC 


TTTTTATTAT 


ATTCAACTGG 


AGGGGGAAAA 


7320 


GTTTAATGAG AGCGCAAGAG 


TGTTTACGGA 


GTATTTAAAG 


ACTAAGACAT 


CTGATGAAAT 


7380 


TCCAAGCTTA CTCCAGTCTT 


ATTCAAAGTC 


CTTGACCATA 


TCTGCTCACC 


TTAAAAGAGA 


7440 


TATTGTAGAT AAGCGGCTCC 


CTCTTGTGCA 


TGACTTGGAT 


ATTAAAGATG 


GAAAGCTATC 


7500 


AAATTATATC GTGATGTTAG 


ATATGTCTGT 


TAGTACAGCA 


GATGGTAAAC 


AGGTAACCGT 


7560 


GCAATTTGTT CACGGGGTGG 


ATGTCTACAA AGAAGCAAAG 


AATATTTTGC 


TTTTGTATCT 


7620 


CCCATATACA TTTTTGGTTA 


CAATTGCTTT 


TTCCTTTGTT 


TTTTCTTATT 


TTTATACTAA 


7680 


ACGCTTGCTC AATCCTCTTT 


TTTACATTTC 


AGAAGTGACT 


AGTAAAATGC 


AAGATTTGGA 


7740 


TGACAATATT CGTTTTGATG 


AAAGTAGGAA 


AGATGAAGTT 


GGTGAAGTTG 


GAAAACAGAT 


7800 


TAATGGTATG TATGAGCACT 


TGTTGAAGGT 


TATTTATGAG 


TTGGAAAGTC 


GTAATGAGCA 


7860 


AATTGTAAAA TTGCAAAATC 


AAAAGGTTTC 


CTTTGTCCGC 


GGAGCATCAC 


ATGAGTTGAA 


7920 


AACCCCTTTA GCCAGTCTTA 


GAATTATCCT 


AGAGAATATG 


CAGCATAATA 


TTGGAGATTA 


7980 


CAAAGATCAT CCAAAATATA 


TTGCAAAGAG 


TATAAATAAG 


ATTGACCAGA TGAGCCACTT 


8040 


ATTAGAAGAA GTACTGGAGT 


CTTCTAAATT 


CCAAGAGTGG 


ACAGAGTGTC GTGAGACCTT 


8100 


GACTGTTAAG CCAGTTTTAG 


TAGATATTTT 


ATCACGTTAT 


CAAGAATTAG 


CTCATTCAAT 


8160 


AGGTGTTACA ATTGAAAATC 


AATTGACAGA 


TGCTACCAGG 


GTCGTCATGA GTCTTAGGGC 


8220 
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ATTGGATAAG GTTTTGACAA ACCTGATTAG TAATGCAATT AAATATTCAG ATAAAAATGG 8280 

GCGTGTAATC ATATCCGAGC AAGATGGCTA TCTCTCTATC AAAAATACAT GTGCGCCTCT 8340 

AAGTGACCAA GAACTAGAAC ATTTATTTGA TATATTCTAT CATTCTCAAA TCGTGACAGA 8400 

TAAGGATGAA AGTTCCGGTT TGGGTCTTTA CATTGTGAAT AATATTTTAG AAAGCTATCA 8460 

AATGGATTAT AGTTTTCTCC CTTATGAACA CGGTATGGAA TTTAAGATTA GCTTGTAGAC 8520 

AGATTAGTTT TTTATTAAAG TTCATATAGG GTTAACATAA GTGTGTTATT CTTTGTGTAG 8580 

ATAAAAGAAA GGATACTAAT ATGGTATTAG CGATTATTTT AGTAACATTC TTTATTCGAT 8640 

TGATTTTTTT AAAGCGTTCG ATAGAGAATG AGAAACGAAT CCTTAGCAAT GGCGGGG 8697 



(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

AACCATACAT ACGGCAAGGC AAAGCTGACG CGGTTTGAAG AGATTTTCGA AGAGTATTAG 60 

TTGCCTTTAA AGGCATCCAC CATCGTTTGA AATTCTTCAT TTGAGAGAGT AATCCCTTTG 120 

CCCATTTTAG TATGGTCTGG ACTCCAAGCA CGAATATCAA ACTTTGCAGG GGCACCATTA 180 

AAGCTCACAC GGTTAATTTC CTTGGTCCAA CCTTTTTCGT TTTCAGAAAG AGTCAACAAG 240 

TGCTCTTCGA TTTCAAATGT AAATTCTGCC ATTTTCTTCT CCTTTTTTAG TTTCATTAGT 300 

TTATTCGTAA AATCTTGTAG ATTTTAGGAA AATTTTATAT AATATTGATA TAAAAGAAGG 360 

GAGGCCAATA TGAGACATAA ATTCCAGCAA GTTCTAAATA AAATACATGA TTTTTTAAAT 420 

GGATATGACC AACCTGACCA GACTGAAACC AACTCCCTTA CAGCCACTAT TGAAGAGGCT 480 

ATCCAGAAAC AAACCGCTGT TCACCTTATC TTGTCTGAGA CAAGCTTTAC AGGTGACATC 540 

ATCAAATATG ATCAGCAAGG CCAGCAAATT ATCGTGAAAA ATTTTTCCAA AAATGTGAGC 600 

CGGATTATCC GTATAAGCGA TATTCAACGC CTGCGATTTG TCCCCTCAAC TGTCCAAACA 660 

GCCCAAAAAA ATAGATTTAA GAAAGAGTGA GATGTAGTTG CTTCATCCCA CTCTTTTTTC 720 

TTAGCGAATT TGTTCAAAAT GTAAATGAAC TGCGATATGA TCTCCATAAC CACTTCTTTC 780 

CAAGTCACGT TGTAAACGAT AGGAAATGTA GTGTTCTGCA ATGGTAATGT AACCTGCGCC 840 

CAATAAACGA TGTTCAACCA TAGATTGAAT CATACTGATA GTCGCACGTT CCACCTTGGC 900 
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TTCTTGTAAA TCCAAAACTA CCTTCTTAGT GACTTGAGCA AGATTTTGAC GCAAATCATC 960 

TGTCAAAACA TAAACAGTTT GGGCTGCCTT CAAGATGGCT TGGTAAATCT TATCTGGATT 1020 

AAATTCAGCA ATTTCGCCAT TACGTTTGAT TACTTGCATA GGTTTCTCCT TTATTCTTTG 1080 

TTTTCTTTGA TTTCTGCCAG CATTTTTTCT TCTTCTACTG TCAGTTGATA ATGTTCAAGT 1140 

AAATCCGGTC TGCGCTCGTA GGTTTTCTTT AAACTCTCGT ACAATCGCCA CTGACGAATC 1200 

TTTTCATGGT GGCCACTCAT CAATACATCT GGCACGACCA TGCCTCGATA ATCATAGGGA 1260 

CGTGTGTACT GAGGATATTC TAAAAGACCT GAAGAAAAAC TATCATCTTG GTGGCTAGAC 1320 

TCCTTGCCAA TCACTTCTGG AATCAGGCGA ACTGTAGCAT CAATCATGGT CATAGCTGCC 1380 

AATTCTCCAC CAGTGAGGAC ATAGTCACCT AGGGAAATCT CATCTGTTAC CAAGGTCTTA 1440 

ATGCGCTCAT CATAACCCTC ATAGTGCCCA CAGATAAAGA TTAGCTCTTC CTCTTGAGCC 1500 

AAATCTTCAG CATAAGCCTG ATCAAACTGC TTTCCAGCAG GATCAAGGAG AATAACGCGC 1560 

GGATTTTTCT TTTCAATAGC ATCAAAGGAA TCGAAAATAG GTTGTGCTCT GAGCAACATG 1620 

CCCTGACCGC CTCCGTAGGG CTCATCATCT ACATGACGGG CCTTTTCAGC ATTTTCTCGA 1680 

AAATTATGAT ACTGGATATC CAAGAGCCCT TTTTCTCGAG CCTTTCCAAC GATTGAGTGC 1740 

TCCAGTGGAG AAAACATCTC TGGAAAGAGG GTTAAAATAT CAATCTTCAT CGTCTAACCC 1800 

TTCTAAGATT TCCACATCGA CCCGTTTACT TGGAATATCA ACATTGAGAA CCACTGGTGG 1860 

GATATAAGGT AAAAGCAAAT CACGTTTGCC TTTTCGTTTG ACCACCCAGA CATCATTAGC 1920 

ACCTGGTTGC AGGATTTCCT TGATGGTTCC AACCAAGCTA TCACCCTCAT AGACTTCCAA 1980 

ACCGATAATC TCGTGATAGT AAAATTCACC ATCGTCTAGG TCATTCAAAT CTTCCTCAGC 2040 

GACCTTGAGA CTGTATCCCT TGTACTTTTC GATAGTATTG ATATGGTACA TATCTTTGAA 2100 

TTTAATAATG TCAAAGTTCT TCTGTTTACG GTGGCTAGCG ATGGTCACTG TTTGGACAAA 2160 

CTGATCTTTT TCATCAAACA AAACCAGCTC AGCTCCTTTT TTAAACCGTT CTTCTGCAAA 2220 

ATCCGTCACA GACAAGACTC GCATCTCCCC CTGTAATCCC TGCGTATTAA CGATTTTCCC 2280 

AACATTAAAG TAGTTCATCT TGTCTCCTGT AATCTCCTTT TTTCCATCTT ATTCTAACAA 2340 

TTCTCGAATA ATAGCCGCAA TTTTTTCCGA TTCTGACCAT TGTAAATAAT GGTGATTCCC 2400 

TCCTAAAATG AGTTTAGTAT TGGAAGTCCA ATATTCTGAT TCTCTGTACT CTTTTTCTCT 2460 

ATAAGGCTGA CAAAAAACAA ATACAGGAAT ATGAGCTTCT ATAGATACAT CCTCAAAATC 2520 

TTCCTCAGTA ATCTCTCCAG ATATCTGAAA TTCTGGATCT TGATTTTCCA ACTCTAAGCC 2580 

TTTTTCTTGC ATTAATTCCC AGATTTTTTT ATTCGTTTCA GGACTAAATG TTGCTTGAGT 2640 

TAAGTTCTTA AAATAAAGTT CAGGACCACA CTCGTCAATC AGCCTCATCT GCTCTTCCAT 2700 
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TTCTGGATAA GGATTTTCTG AAAAATCAGC AAACATGACT TTTTTAGTTG TCGGTTCAAT 2760 

TGCTACTAAA GTCTGACGCT TAATTGGTTT CTCGAGTAAT TTGCAAGCTA AAATTCCACT 2820 

CCAACTATGT GCACAAAGTA TATATTCAGA AATTCCTAAT TCTTCAAGTA CTTCATAAAC 2880 

CGCATCTGCA AGATTATCTA GATTTTTTCC AGCTTGGTCA TGAATCGGAC TCCTACCTGT 2940 

GTTCGGAAAA TCAATTGTCA AATAACCAAT TGTAGGAGGA GGTTTTTCAA GTATAAGTGA 3000 

AAAATTTTCA TAACTTGGTA GCAAACCTGC TCCGTTTAAA CAAACTAGCA CTTTCTTTTG 3060 

CTTTTGATAA GTAACAGAGA GGCTACCAAT TTCTGTAGAT ACTTCAAACC TCTTCATAAA 3120 

GAAATCCACT GATTCTATAT AATGAATTAT TAAAAATCCT TATCCTTTAT TTTATCACGT 3180 

TCCAAGGATT TTCTCAAGTT GGAGGAAGGG GACAATATCT CTACTTTCCC TTCAATAATC 3240 

CTTCCAAATT ATGTTTATGT TGGTAATTAA TGGCTGCGGT TTTGTCTTTC TCAAAGACAG 3300 

TCTTGGTAAG GTCAATATGA TTAATAGCTA CGATTGCGAC GGTGTAGTAA ATGATATCAG 3360 

CCAGTTCTCT GGCAAGTTCC TCGTTCGAAT CCTATCCCTT CTTTTCGACC AGAGCGCCTA 3420 

TTCAAAACCT CGACTACTTC TCCGACTTCC TCCACTAACT TCATAAAGAG ACCTTCATCA 3480 

GTCCGAGACT GCTGTTAATG TTCGATTAAG TAGTCTTGGA ATTGCCTAAA CGTTCAATCT 3540 

TTTATAGTAT ATTGAAACTA GAATAGTACA CCTTTACTTC TAAAACATTG TTAGAAATCG 3 600 

ATTTGACTGT CCTGATCGAT TTGTCCTGTT CTTGTTTCAT TTTACTATAT CTTCTATTCC 3660 

ACACAAAAAA GCGAGACATC CGTCCCGCCC TTCTTATTTT TCGTCAATAA CGATTCTTAC 3 720 

TTTTTTGTAT TCAGTTGGGA CAGAGTAGAC AATCGTTCTT ATCGCAGAAA TAGTGCGACC 3780 

CTTACGACCG ATTACACGAC CCACATCGCT TTGATCAAGA TTCAAATGAT ATTCCAAAAA 3840 

TTCTGGTGTA TCCTCAATCT TGATAGTTAA GGCATCTGGT TGTGAAATTA AGGGTTTCAC 3900 

AATCGCAATA ATGAGATTTT CAATCGTATC CATCTGTCAA CCTACTTTAA ACTTATTTTG 3960 

AAAATTTAGA ATCGTGGAAT TTTTTCAATA CGCCTTCTTT TGAAAGGATG TTACGTACTG 4020 

TGTCTGAAGG TTGAGCTCCA TTAGCCAACC ATGCAAGAAC GCGGTCTTCT TTCAAAGTTA 4080 

CTTGGTTTTC AGCAACAAGT GGGTTGTAAG TTCCAACTGT TTCGATGAAA CGTCCGTCAC 4140 

GTGGTGAACG TGAATCTGCT ACGTTGATAC GGTAGAAAGG TTTTTTCTTA GAACCCATAC 4200 

GAGTCAAACG GATTTTAACT GCCATTTTTA AAGTCTCATT TCTTTAATTT TTTATTTCGG 4260 

TGAAATAGCT GAGCTATTTA GCACATGTTC TATTATAGCA GATTTCTGGC ATGTGTC 4317 

(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4881 base pairs 
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(B) TYPE: nucleic acid 
CO STRANDEDNESS : double 
tD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

AATTTATTTG ACTGGAAATT GTAGAGGGTT CTCGAAATTT CTTGAATGGT TAAAATAAGG . 60 

ACAAGAGAAA ACATGGATAT CTATATCCTT GTGCCAAAAA AACCACTGCC CTCCCCAGAC 120 

CAACCTGAGG AAAGCAGTGA TTCTTATTTT AGGAGTTAGG AATGAATACA CGAAATCAAT 180 

TTAGCTGATT ATTTTTTGTT TTTCAAGAAT TCATCGTATT GTTTTTGCAT TTCGTTCAAT 240 

ACTTTTTCGT AGGCACCTTC AGATTTCAAT TTTTCCATCA ATTCTGGAAT CGCTTTATCT 300 

GGGTCTACAG TACCAGTGTT GATAGCTGTA TCAAATTGTT GCATTGTGTT AGCAATAGCT 360 

GAGATTTCAG ATTTCACATT GTCAGTATTG AAGATAAATC CAAGCGCTGG AGATTCTTTA 420 

GCTTCTGCCA ATTCTTTCTT AGAATTTTCG ATTTGTTGGT CTGTAACGTT TTCGTTGATG 480 

TAAAGGATCC AGTTGTTACC AGTGTTCCAT CCACCCATGT GAGTGTTTCC TTTGTAGCCA 540 

TCAAGAACGC GAACACGGTT TTCTTTACCT TCAATTTTTT CCCAGTTCTT GCCTTCTGGA 600 

CCGTAAACAA GACCGTTCAA GAGTTCTGGG TTCGTATTCA AGAGGTTCAA GATTTCCATT 660 

GATTTTTCTT TGTTCTTAGA GTTGTTTGAG ATGACAAAGT TAGCAACTTG TGTTGTTTGG 720 

TTTTTCTTGA TGAAGTTAGT AATTGGTTTG ATTTGGATAT CTTTGTTGGC AACACGTGAA 780 

AGCAAGCTGT TACCGTAGTC AGCTGGTCCT ACTGTTTCTT CACGAACGAA CCAAGTATCT 840 

TGTTGAAGGT CAAAGGAAGT ATCGCTTGTT GCGACGTCTT TTGGAATGTA GCCAGCTTCA 900 

TAGAATTTGT GAAGAGTCTT CAAGTGTTCT TTGAAACGAG GCACTTCGTA ACGGTTTACA 960 

ACTTTAGTAG TATCGCCTTC AAGGTCGATA ACGAATGGAA GACCGTTTGC TACTGGGTAG 1020 

TCAAAATTAT CAGATGGGAT GAAAACTTTA CCAATAGCAA ATGGTACTAC GTCTGGAGCT 1080 

TTTTCTTTGA TTTGTTTCAA GACTGGCTCA AGAGTTTCGT AAGAAGTAAC ACCTGAAATA 1140 

TCGATACCAT ATTTAGCAAG GAGAGTTCCG TTGAAGGCAA AGTTTTGAGA TGATGCAACG 1200 

TTGGCTGCAA CTGGAACAGC GTAAATCTTA CCATTTACAG TATTACCCTT GATGTAAGCT 1260 

GGGTCAAGTG CTTTGTAAAG GTCTTTACCT TCTTTTTTGT ACAATTCTGT CAAGTCAGCG 1320 

TAAGCACCTT TTTGAGCATT TACAATATAG TTATCTGCAA AGGCAATATC ATAGTTTTCA 1380 

CCAGATGATG TGATAACTGA CATTTTCTTA CCATAGTCAC CCCAGCCAAG GTATTGGATA 1440 

TCCAATTTGG CACCAACTTT TTCTTCAATG ATTTTGTTGG CATTTGCTAA CAATTCATCC 1500 

AAGTTGTCTG GTTTGTCACC GATTTGGTAC ATTTTGATAA CAGGTTTGTC ACCTGAATCA 1560 
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GCAGCTTTTT TGCTGTTACC TGTCAAATTT CCACAAGCAG CAAGACCTGC AGCCAGAGCG 1620 

ACTACACTAG CAGATGCAAA AGCATATTTT TTCCAGTTTT TCATGATAAA AACTCCTTTT 1680 

TTTATTTTTA AACTTATAAA CAATGTAATG ATCTTATACT CAATAAAAAT CAAAGAGCAA 1740 

ACTAGAAAAC TAGCCGCAGG CTGCTCAAAG CACTGCTTTG AGGTTGTAGA TAAGACTGAC 1800 

GAAGTCAGTT ACATATATCT ACGGCAAGGC GACGTTGACG CGGTTTGAAT TTGATTTTCG 1860 

AAGAGTATTA ACTTCACACA AGGGAAGTTG GGAACTGAGA AATGTTATTT CTCAATAAGC 1920 

ACT ATT CTTT CACACCACCG ATAGTCAAAC CTTTTACAAA GTAGCGTTGG AAAAATGGAT 1980 

ACAAAATCGC GATTGGAAGG GTTGCAACCA CAACCATGGC CATACGACCT GTTTCTTTCG 2040 

GTAGAGCAAC TCCCAGTTGA CCAATCAAGC CGACCGCTTT GGCAATGTAG TCCATATTTT 2100 

GTTGGATTTG CATGAGCAAA TATTGCAATG GATACAAGTT GTCACTCTTG ATGTAAAGAA 2160 

GGGCGTTGAA CCAGTCATTC CAGAAACCAA GAGCTGTTAA GAGCGTGATG GTTGCGATAC 2220 

CTGGTAGTGA CAATGGCAAA CAGATTTGGA AGAAAATCCG GGCCTCACTG GCACCATCGA 22 80 

TACGAGCCGA TTCTAGAATG GCTTCTGGAA TGGTCTTCTT GAAGAAGGAA CGCATCAAGA 2340 

TGATGTTAAA TGGTGAGAGA AGCATTGGAA CAATCAAGGC CCAAACAGTG TCACCAAGCT 2400 

GAAGTACACG GGTCACCATG ATATAACCTG GTACCAAACC AGCGTTGAAC AACATACTGA 2460 

GAAGGACGAA GATGGTAAAG AATCTGCGAT ACTTAAAGGT TGTCCGTGAA ATAGCGTAGG 2520 

CATAGGTTGT TGTGATAAAG ACATTTGTCA ATGTCCCAAC TACGGTTACA AAGACAGAGA 2580 

TGAAGAGGGC TTGTAGGATT TTATCCTTAA ACTGTGCCAA AAACTCAAAA CCGTCTAAGC 2640 

CAAATTGGGA TGGGAAGAAG CTATAGCCGT ATTGGAGGAG GCTTTTCTCG TCTGTCACTG 2700 

AAATAATGAT AACGAATACA AAAGGTAGGA TACAAGAGAG GGCAATCAAA CCCGAAATGA 2760 

TACTGAAGAA GATATCTGCT TTCTTACTGA AGGAGTGAAT GCCGACATTA TCAATTTTTT 2820 

CTTTTTTAAT TTTCTTTTTT GCCATATTCT CCTCCTTTCT AGAACAAAGC TGAGTTTGGA 2880 

TCGACTCGTC TTGCAAGCAA GTTTGATAGG ATAACCAGAA TCAAACCAAC AACGGATTGG 2940 

TAAAGACCGG CTGCTGCAGC CATACCGATA TCTGCTGTCT GAGTCAAACC ATTAAAGACA 3000 

TATACGTCCA AAACGTTGGT TACATTGTAA AGCTGACCAG CATTGTGTGG GATTTGATAG 3060 

AAGAGACCGA AGTCTGCGCG GAAGATATTT CCGACTGCAA GGATGGTCAA TACAGTTACA 3120 

AGCGGAGTCA ACTGAGGAAT GGTTACGTTG CGAATACGTT GCCACTTGCT AGCTCCGTCC 3180 

ACTGTCGCTG CTTCGTAGTA GGTTGGATCA ATTCCCATGA TCGTCGCATA GTACATGACA 3240 

CTGCTATATC CAAAGCCTTT CCAAATACCT AGGAAAAGTA GGAGATAGGG CCAGATGCCC 3300 
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AGGTCAGCGT 


AGAAATTGAC 


TTCTTTGAGA 


CCAAGACTTT 


CCAATAGATG 


ATTGAACACC 


3360 


CCTTTATCAA 


TATTTAGGAA GGCATCTGTA AAGAAACTGA TGATAACCCA 


AGACAAGAAG 


3420 


TAAGGGAACA 


ACATAGAAGT 


TTGAAAAATC 


TTCACCATTC 


TCTTAGAACG 


GAGCTCGCTG 


3480 


AGGATAATGG 


CAATCCCTAC 


AGATACAACT 


AAACCTAGAA 


AGATAAAGCC 


AAGATTGTAG 


3540 


AGGACAGTAT 


TTCGTGTGAT 


AATAAAGGCG 


TCTCTTGAAC 


TAAATAAGAA 


TCTAAAATTA 


3600 


TCGAGTCCGA 


CCCATTTACT 


ATTTATGATA 


CTATCTATGA 


AACCATTACT 


GGTCATGTGG 


3660 


TAGTCTTTGA 


AGGCAACCAC 


GTTCCCAAAT 


ACTGGAATGT 


AAAAGAATAG 


AATCAACCAG 


3720 


AGTGCCCCTG 


GCAAAACCAT 


CAAGAGAAAG 


ATCCAGTTGT 


CTCTCAATGT 


TTTTGAAAAC 


3780 


TTTTTCATAA 


TTTCCTCCCT 


TTTTATTTTG . ATATCCATCT 


AAAAATTCTT 


TTTTAGACTT 


3840 


TTGATAACGA 


TTACATTATT 


AGTATACTCC 


TATTTGCAGG 


TTAGGTTAAA 


CTCCTAATTA 


3900 


TAGAAAAAAC 


TCCACAAATT 


ATGTAGCAGA 


TTTAAAACTT 


TATCACCACT 


ATCAAACAAA 


3960 


TGTCCTAAAT 


CAATTGTTTA 


TTTTATCTCT 


ATTAGCCCAG 


TGATGGCGTC 


ACTCTGTTAT 


4020 


AAGCATCCAA 


CAACGGGGTA 


TACTGAAAAA 


TCTCCAGACT 


AGGGAACTCA 


GCGATAGTTC 


4080 


CTAATCTGGA 


GATTTTTAAT 


ATGTTATTAG 


GCGTTTGCTT 


TCAACTTAGC 


AATAACCTCT 


4140 


TTAAGATTAT 


CAATCAACTC 


TGCTGCAGTA 


TGCTCAGAGC 


CTTTTTCATC 


TGCCAAGAAC 


4200 


AAAACTGCTT 


TTTGAAGTTC 


TTTTTGAGAG 


TTTTCAAGGA 


CATCCTTATC 


TACTGTTTCA 


4260 


AGGTTTGAGT 


CTTTAAGAAG 


TTTACTTAAT 


TCCTTGGCTA 


ATTTCTTGAG 


TTTGATTTGC 


4320 


AGACTCATCT 


TCTCCTGCTG 


TTTCTTTGCC 


CGCTGTTTGT 


CCTCCATCCT 


TAGTTGCTGA 


4380 


CTGGCTTTCC 


TTAATGGACT 


CTAGGGAAGC 


AATGGCATCT 


TTGACTGTTT 


GCAAGATATC 


4440 


ACGTAAACCT 


TGCTCTGTCA 


AACTATCATC 


TGCAAAAGCT 


TTATTAGCCT 


CTGCCAAAAC 


4500 


C AG ACGTGCT 


GAATCTGTGG 


TAGGATTCGA 


TACACCTGTC 


AATGATCTCA 


AAAGATTTTC 


4560 


TAAGGTTTGA 


GTCTGCTTAC 


TAATACTAGA 


CTAAAATCAA 


AAAGTATTAT 


ATAACAGTGA 


4620 


TATGAAATCA 


ACTAAAGAAG 


AAATCCAAAC 


CATCAAAACA 


CTTTTAAAAG 


ACTCTCGTAC 


4680 


AGCTAAATAT 


CATAAACGCC 


TTCAAATCGT 


TCTATTTTGT 


CTGATGGGCA 


AATCTTATAA 


4740 


AGAGATTATA 


GAACTTTTAT 


AGTAGTTTGA 


AATAAGATGT 


GAACATCTCT 


ATCAGGAAAG 


4600 


TCAAATTAAT 


TTATAGAAAT 


ATTTTAGCAG 


CCAAGGTGTA 


CTGTTATAGA 


TTCAATACAC 


4660 


TATACTTGGT 


GGTTTAGCTC 


G 








4881 



(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13121 base pairs 
(BJ TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

AGGATCCCCG GAAAAGGAGA CTAAAAATGA AGAAAAAATT TCTAGCATTT TTGCTAATTT 60 

TATTCCCAAT TTTCTCATTA GGTATTGCCA AAGCAGAAAC GATTAAGATT GTTTCTGATA 120 

CCGCCTATGC ACCTTTTGAG TTTAAAGATT CAGATCAAAC TTATAAAGGA ATTGATGTTG 180 

ACATTATTAA CAAAGTCGCT GAGATTAAAG GCTGGAACAT TCAGATGTCC TATCCTGGAT 240 

TTGACGCAGC AGTCAATGCG GTTCAAGCTG GGCAAGCCGA CGCTATCATG GCAGGGATGA 300 

CAAAGACTAA AGAACGTGAA AAAGTCTTCA CCATGTCTGA TACTTACTAT GATACAAAAG 360 

TTGTCATTGC TACTACAAAG TCACACAAAA TTAGCAAGTA CGACCAATTA ACTGGCAAAA 420 

CCGTTGGTGT TAAAAACGGA ACTGCCGCTC AACGTTTCCT TGAAACAATC AAAGATAAAT 480 

ACGGCTTTAC TATTAAAACA TTTGACACTG ■ GTGATTTAAT GAACAACAGC TTGAGTGCTG 540 

GTGCCATCGA TGCCATGATG GATGACAAAC CTGTTATCGA ATATGCCATT AACCAAGGTC 600 

AAGACCTCCA TATTGAAATG GATGGTGAAG CTGTAGGAAG TTTTGCTTTC GGTGTGAAAA 660 

AAGGAAGTAA ATACGAGCAC CTGGTTACTG AATTTAACCA AGCCTTGTCT GAAATGAAAA 720 

AAGATGGTAG TCTTGATAAA ATTATCAAGA AATGGACTGC TTCATCATCT TCAGCAGTGC 780 

CAACTACAAC TACTCTCGCA GGATTAAAAG CTATTCCTGT TAAGGCTAAA TATATCATTG 840 

CCAGCGATTC TTCTTTTGCC CCTTTTGTTT TCCAAAATTC AAGCAACCAA TACACTGGTA 90 0 

TTGATATGGA ATTGATTAAG GCAATCGCTA AAGACCAAGG TTTTGAAATT GAAATCACCA 960 

ACCCTGGTTT TGATGCTGCT ATCAGTGCTG TCCAAGCTGG TCAAGCCGAT GGTATCATCG 1020 

CTGGTATGTC TGTCACAGAT GCTCGTAAGG CAACTTTTGA CTTCTCAGAA TCATACTACA 1080 

CTGCTAATAC CATTCTTGGT GTCAAAGAAT CAAGCAATAT TGCTTCTTAT GAAGATCTAA 1140 

AAGGAAAGAC AGTCGGTGTT AAAAACGGAA CTGCTTCTCA AACCTTCCTA ACAGAAAATC 1200 

AAAGCAAATA CGGCTACAAA ATCAAAACCT TTGCTGATGG TTCTTCAATG TATGACAGTT 1260 

TAAACACTGG TGCCATTGAT GCCGTTATGG ATGATGAACC TGTTCTCAAA TATTCTATCA 1320 

GCCAAGGTCA AAAATTGAAA ACTCCAATCT CTGGAACTCC AATCGGTGAA ACAGCCTTTG 1380 

CCGTTAAAAA AGGAGCAAAT CCAGAACTGA TTGAAATGTT CAACAACGGA CTTGCAAACC 1440 

TTAAAGCAAA CGGTGAATTC CAAAAGATTC TTGACAAATA CCTAGCTAGC GAATCTTCAA 1500 

CTGCTTCAAC AAGTACTGTT GACGAAACAA CGCTCTGGGG CTTGCTTCAA AACAACTACA 1560 
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AACAACTCCT TAGCGGTCTT GGTATCACTC TTGCTCTAGC TCTTATCTCA TTTGCTATTG 1620 

CCATTGTCAT CGGAATTATC TTCGGTATGT TTAGCGTTAG CCCATACAAA TCTCTTCGCG 1680 

TCATCTCTGA GATTTTCGTT GACGTTATTC GTGGTATTCC ATTGATGATT CTTGCAGCCT 1740 

TCATCTTCTG GGGAATTCCA AACTTCATCG AGTCTATCAC AGGCCAACAA AGCCCAATTA 1800 

ACGACTTTGT AGCTGGAACC ATTGCCCTCT CACTCAATGC GGCTGCTTAT ATCGCTGAAA 1860 

TCGTTCGTGG TGGTATTCAG GCCGTTCCAG TTGGCCAAAT GGAAGCCAGC CGAAGCTTGG 1920 

GTATCTCTTA TGGAAAAACC ATGCGTAAGA TTATCTTGCC ACAAGCAACT AAATTGATGT 1980 

TGCCAAACTT TGTCAACCAA TTCGTTATCG CTCTTAAAGA TACAACTATC GTATCTGCTA 2040 

TCGGTTTGGT TGAACTCTTC CAAACTGGTA AGATTATCAT TGCTCGTAAC TACCAAAGTT 2100 

TCAAGATGTA TGCAATCCTT GCTATCTTCT ATCTTGTAAT TATCACACTT TTGACTAGAC 2160 

TAGCGAAACG CTTAGAAAAG AGGATTCGTT AATGGCAAAA TTAAAAATTG ATGTAAATGA 2220 

TTTACACAAG CACTATGGAA AAAATGAAGT CCTAAAAGGA ATTACGACTA AGTTCTATGA 2280 

AGGAGATGTT GTTTGTATCA TCGGTCCTTC AGGTTCTGGT AAGTCAACTT TCCTCCGTAG 2340 

CCTCAATCTT TTAGAAGAAG TCACTAGCGG TCACATCACT GTGAACGGCT ATGATTTAAC 2400 

TGAAAAAACA ACCAATGTTG ACCACGTCCG TGAAAATATC GGCATGGTAT TCCAACACTT 2460 

CAACCTCTTC CCTCATATGT CTGTATTGGA CAACATCACC TTTGCTCCTA TTGAGCACAA 2520 

GTTGATGACT AAGGAAGAAG CTGAGGAATT GGGAATGGAG TTGCTTGAAA AGGTTGGACT 2580 

AGCAGATAAA GCTAATGCCA ATCCAGATAG CCTATCAGGT GGTCAAAAAC AACGTGTGGC 2640 

CATCGCTCGT GGCCTAGCAA TGAATCCAGA CATCATGCTC TTCGATGAAC CAACTTCTGC 2700 

CCTTGACCCT GAGATGGTTG GAGACGTACT TAACGTTATG AAGGAATTGG CTGAGCAAGG 2760 

CATGACCATG ATTATCGTAA CCCATGAGAT GGGATTTGCT CGTCAGGTTG CCAACCGCGT 2820 

TATCTTTACT GCAGATGGCG AGTTCCTTGA AGACGGAACA CCTGACCAAA TCTTTGATAA 2880 

CCCACAACAC CCTCGTCTGA AAGAGTTCTT AGATAAGGTC TTAAACGTCT AAACTCAAAC 2940 

TGTAAGGATT TCCTTGCAGT TTTTCTACCT CGTATTGGAA TTTTTGATTT TTCGGAAAAT 3000 

TATGTTAGAA TTAAGTTTAT GAAATGAGGT TTCCTCATAC CTAGCAAGAC TAGGAATAAA 3060 

AATAGAAATT AGGTAGCTAG ATGTCATCTA AGGTTATTGT TACAATTTTC GGTGCGAGTG 3120 

GAGACCTGGC TAAACGCAAG CTCTACCCTT CCCTTTTTAG ACTATATCAA TCCGGCAATC 3180 

TTTCCAAGCA CTTTGCCGTT ATTGGAACTG CCCGTAGACC TTGGAGTAAG GAATATTTTG 3240 

AATCTGTAGT TGTCGAGTCC ATCCTTGATT TGGCAGATAG TACCGAGCAA GCCCAAGAAT 3300 

TTGCTAGCCA CTTCTACTAT CAAAGCCATG ATGTCAATGA TTCGGAACAT TATATTGCTT 3360 
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TGCGTCAATT ACAAGCTGAG CTTAATGAAA AATACCAAGC TGAACACAAT AAGCTCTTCT 3420 

TCTTGTCTAT GGCACCTCAG TTCTTTGGAA CCATTGCCAA ACACCTCAAA TCTGAAAACA 3480 

TTGTCGATGG CAAAGGTTTT GAGCGCTTGA TCGTTGAAAA ACCATTTGGT ACAGATTACG 3540 

CAACTGCAAG CAAGTTGAAT GACGAACTCC TAGCAACATT TGACGAAGAA CAAATTTTCC 3600 

GTATCGACCA TTATCTTGGT AAGGAAATGA TCCAAAGCAT CTTTGCAGTT CGCTTTGCAA 3660 

ACTTGATTTT TGAAAACGTT TGGAACAAGG ATTTTATCGA CAATGTTCAA ATTACCTTTG 3720 

CGGAGCGCTT GGGTGTAGAA GAACGTGGTG GCTACTATGA CCAATCCGGT GCCCTCCGTG 3780 

ACATGGTCCA AAACCACACT CTACAACTTC TTTCGCTCCT CGCCATGGAC AAACCAGCAA 3840 

GCTTCACAAA AGACGAGATT CGTGCTGAAA AGATTAAGGT CTTTAAAAAC CTCTATCATC 3900 

CAACTGATGA AGAACTCAAA GAACACTTTA TCCGTGGGCA ATACCGCTCT GGTAAGATTG 3960 

ATGGCATGAA ATACATCTCT TATCGTAGCG AGCCAAATGT GAATCCAGAA TCAACAACTG 4020 

AAACCTTTAC ATCTGGTGCC TTCTTTGTAG ACAGCGATCG ATTCCGTGGT GTTCCTTTCT 4080 

TTTTCCGTAC AGGTAAACGA CTGACTGAAA AAGGAACTCA TGTCAACATC GTCTTTAAAC 4140 

AAATGGATTC TATCTTTGGA GAACCACTTG CTCCAAATAT TTTGACCATC TATATTCAAC 4200 

CAACAGAAGG CTTCTCTCTT AGCCTAAATG GGAAGCAAGT AGGAGAAGAA TTTAACTTGG 4260 

CTCCTAACTC ACTTGATTAC CGTACAGATG CGACTGCAAC TGGTGCTTCT CCAGAACCAT 4320 

ACGAAAAATT GATTTATGAT GTCCTAAATA ACAACTCAAC TAACTTTAGC CACTGGGATG 4380 

AAGTTTGTGC GTCATGGAAG TTGATTGACC GTATTGAAAA GCTCTGGGCT GAAAATGGTG 4440 

CCCCACTTCA TGACTATAAA GCTGGAAGCA TGGGACCTCA AGCCAGCTTT GACCTACTTG 4500 

AAAAATTCGG TGCCAAATGG ACTTGGCAAC CAGATATCAC CTATCGTCAA GATGGTCGCT 4560 

TAGAATAAAA AAATTTCCTG CAAGTTTATG CcTTGCAGGA TTTTTGCTTC TGATTAGATT 4620 

AAACCTTCCA AGAGACCTTT CATAAAGTTT TCTGAGTTAA ACTCTCCAAT ATCATCGATT 4680 

TTTTCACCAA AACCAATCAA TTTTACAGGA ATATTGAGTT CTTCACGAAT GGCTAGAACC 4740 

ACACCTCCTC GAGCAGTTCC ATCAATCTTA GTCAAAACAA TTCCCGTTAA AGGTGTGATT 4800 

TTCGAAAATT CTTTGGCCTG TACTAGGGCA TTTTGACCTG TTGATGCATC AAGTGCCAAG 4860 

AAGGTTTCAT GTGGTGCTTC TGGCACAACA CGTTTGATAA TACGACCAAT CTTTTCCAAC 4920 

TCAGCCATAA GGTTATCCTT ATTTTGCAGA CGACCAGCAG TATCAATCAT GAGAATATCG 4980 

ATACCTTCAG TCACGGCACG TTCCATACCA TCAAAGACCA CGCTGGCTGG ATCAGCTTTT 5040 

TCAGGTCCAG TTACTACTGG AACATCTACT CGTCGGCCCC ATTCAGCTAG CTGAGCTACT 5100 
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GCACCCGCAC GGAAGGTATC TGCTGCAACC AGCATGACCT TCTTACCAGC TTGTTTGTAG 5160 

CGGTGGGCTA GTTTTCCGAT AGAAGTTGTT TTCCCAACAC CATTCACACC AACAAAGAGC 5220 

ATAACTGTCA AGTTATCTTG GAAGTGGATG CTTTCATCGT AGCTACCATC CTTTTCATAA 5280 

AGCTCAACCA ATTTCTCAAT GATGACACGA CGAAGTACAT CAGGTTTCTT GGCATTTTCA 5340 

AGCTTGGCTT CGTAACGTAG TTCCTCCGTT AAGTTAGAAG CGACTTGGAC ACCAACATCA 5400 

CTCATAATCA GCAGTTCTTC CAGTTCCTCG AAAAATTCTT CGTCAACAGA GCGGAAGTTA 54 60 

GCAAAGAAGG CATTCAAGCG GGCACCGAAA CCTGTGCGAG TTTTCTTAAG ACTGCGGTCA 5520 

TATTTTTCCT GAACAGTTTC TTCTGTTTGA GGAGCTTCTG GTTCAAGCAC TTCAGAATTA 5580 

TTTTCTTCTA CAGTTCCTTC GTGCTCAAGC TTCTCTTCCT CTGGTAATTC TTCTGAGTTT 5640 

GGTAATTCTT CTATTTCTTC TTGAGAAACC CCTACAGCTG GCTCTGAATC CTGACTTTCT 5700 

TCAACTGTGT CTTGGATTTC CTCTTCTTGG AACACAGCTT GTTCAACAAT TTCAACCTCT 5760 

GCTTCTTCCT GAGAAACTTC CTCAACTTCT GTGAAGGTAG GATCAACATC TTCAGACAAA 5820 

TCAAGATTTT CCAGAGCTTC TTTTACAACT TCTTCGATTT TAGGTTCTTC TTTTTTTCCG 5880 

AATAGACGGT CAAACAATCC CATATCTTAG TTCTCCTTTA GCACATATTC TTCGATAGCC 5940 

CAGGCGACAG CTTCCTCATC GTTGGTCATC GGCGTCACTA CATTTGCGGC TGCCTTTACT 6000 

TCAGGAACAG CGTTTTGCAT AGCAACACCA AGACCTGCCC ATTCAATCAT AGAGAGGTCA 6060 

TTGGCCTCGT CACCACAAGC CATCACTTGA CTTTGGTCGA TTCCAAGATG GCTGATTAGT 6120 

TTTGCCAAAC CTGTTGCTTT ATGAACATTC TTTGGTGACC ATTCTAGCAA CATTTCACGT 6180 

GATTTAAAGA TTTCATATTG GTCAAACAAT TCTGGAGAAA TCTTCTGAAT GGCTGCATCC 6240 

AAGGGTTCTT GAGCAAAGGC AGTCACGCAT TTGTTGTAGG TCATTTGACT AGATAAGTCT 6300 

TCAAAGTCCA CTGGAACAAA GGTCAAAGCT GGATTGAATT TGGCATAAAG ACTTTCTTGG 6360 

TCCGATTGGA TTTGATAAAC TGTTCCTTCT GAGATGGCAT CAAGAGGCAG TGATAATTTC 6420 

TCTGTTTCTT CATACAAACG TGCCACATCA TCATATGAAA AGACTGTTTT ATCAAGGATT 6480 

TCTCCTGTAT TTTTCTGAAC TAATCCACCA TTAAAAGTAA TGGTATACTC ATCTTCCTGA 6540 

CCGTCAGTCC CTAACTCATG GAGAAAGAAA TCCATGGCTT TTAAGGGACG ACCAGTTGTC 6600 

AATACGACCT TGATACCACG ATCACGCGCA gCTTGCAAGG TTTCCTTGGT ACGATCCGTC 6660 

AGCCTTTTAT CAGTAGTCAG CAAGGTCCCG TCCAAGTCCA ATGCAATCAA TTTTATATCT 6720 

GCCATTATAA GCCCTCCATA TAAGCTATAA CCGACCGTTC CTTATGGTGA CCAATCACAG 6780 

TCTTTGCTAA TTCTAAAATT TCAGGTCGTG CATTTTCAGG AGCTACAGGA TGTCCCACAA 6840 

CCTGCATCAT ATGTAAGTCA TTAAGATTGT CTCCAAAAGC CATGACCTGA TCCATTGTGA 6900 
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TACCAAGTTT TTTAACTAAT TCAACAATGG CCACTCCCTT ATCGACATAG TCCAGAACAA 6960 

TATCAATGGA TTCAAAGCCA GTTGTCATGG CCTTAACACC AGGAACGTTT TCGTTTACCC 7020 

AAGCCTCCCC ATCTTCCAGC GTTTCTTCTG TGAAGTTGGT TGTAAATTTG AAAATGTCAT 7080 

CTGTGATATC TTCCAAACTC GCTACTTTTT GGATATTTTC ATTATAGTGC TGACTCACTT 7140 

TCAAATAGGT CTCATCAACC GTATCTAGAA CATATGAACC CTTCTTACCC GTCAAGAGCA 7200 

GTTTATTGAT ATCTACATAA GGTGAAGTTT TCAGCTTTTC AAAAGTTGCC AGATAAAAGT 7260 

CACGAGACAT AGTCGCTTCA TACAAGTCCT GACCTTGATA CTCTACCAAA CTGCCATTTT 7320 

CCGCGATGAA AATAATGTCA TCACGAACAC CAGCAAATAA TTTTTCTAGA GACAGAAATC 7380 

CCCGACCCGA AGCTACCGCA AAGTAAATCC CTTTTTCCTT GTAGGAAACC AAGAGAGACT 7440 

TGAGACGATC CATATCAAAG CGTCCATTCC CATCTAGGAA GGTTCCGTCC ATATCCGTTG 7500 

CTACTAGTTT AATTGTCATC CTTCAATACT TTCTAAATCT TTTAACTTAA CTGAAACAAT 7560 

CTTTGAAACA CCCGATTCTT GCATGGTCAC TCCATAGATG GAATCAGCCG CTGCCATGGT 7620 

TCCCTTACGG TGGGTTACGA CGATGAACTG GCTGTCCTTG TCAAAGCGGT TGAGGTAATC 7680 

CCCAAAACGT TTAACATTGG CTTCATCCAG CGCAGCTTCC ACCTCATCCA AGATAACAAA 7740 

TGGAATAGTC TTGACACGAA TAATGGAGAA GAGCAAGGCA AGAGCCGATA GGGCTTTTTC 7800 

ACCACCACTC ATGAGATTAA GAGACTGGAT TTTCTTGCCT GGTGGTTGGA CAGAAATTTC 7860 

AACCCCAGCT GTCAGCAAGT CTCCTTCAGT CAAAATGAGG TCAGCCTGAC CTCCACCAAA 7920 

CATCTGCTTG AAGGTCACTT TAAAGGACTC ACGAATGACC TCAAAGGTTG ATTTAAAGCG 7980 

TTCCTTGACC TCATCATTCA TCTCTGTAAT GGTCTCAAGG AGCAGGTTTT TCGCAGACAA 8040 

AATATCATCA CGTTGGCTAT TTAGGAAATC CAGACGGTTG TGAACTTCTT CGTACTGTTC 8100 

AATAGCGTCT AAATTGACAG GACCCAGTGA GCGTATAGCC TTCTCTAAAT CCTTAACTTC 8160 

TTGCTCTGCC AGATTGAGAT TTTCCAACTC ATGCGCCTTT TCTAAAGCTT CTGTGTAGCT 8220 

GATCTGGTAC TGGTCTGTTA ATTGACTTTG TAGATGGCGC AAGCGCTCGC TAACCTTTTC 8280 

TTTCTTGGCT TCAGCACGAG TTTGCTTGCG AATCCACTCT TCATTCTGCT GGCGAGCCTG 8340 

ATCCAAATGA CTAGCAATAT CATCCAGTTG ACCCTCAATA TCATCCAACT CAAACTGCTT 8400 

GCGAATCAAA CCTTGTTGGA GATTTGTTTT TTGAGTTTTG GATTCTTCCG CCTGTTGACT 8460 

GAGCAATTCT GTATCAACCT TCTCAAGATT ATCAATCTTT TCTTGAAGAA GGCGCTGGAT 8520 

TTCCTCTTGT TCAAAATCAA GATTGTCCAA TTCCTTGCCT AAGCGTTCAA TATCAGCAAC 8580 

TTCATAACGT TTTTGCCCTT GCAGTTCTGT CTTAAGCAAA CGAGCTTGCG CTAGCTCTTC 8640 
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CTGCAAGTTT TGATAGCGTT CTTGGATGGC ATTTTTGTTA GACTTAATCT CTTCAATCTC 8700 

AGCTTCCAGA TTTTGCTTGT CACTGGAGAT TGCAGCAAGA CGCTCTTGGC AGTTTTCCTT 8760 

ATCCGCTTGC CAATCTCCCT CGGAAAGACG ATCTATTTCC TCTTCTTGGA GTTTCCAAAG 8820 

AGTTTCCAGT TCTTCAACTT GCTGACTAGT TTGCTGATAA GCGAGGAACA AGCCTTGCTC 8880 

CTGAATACGT GCCTGCTCTC CTTGAGATTT AATAGCTTCT AATGACTCGG TCAATCTGGC 8940 

CATCTCATCT TGCAAGGTCT TCAAAGTCGC CTCTTCTGAA CCCAAGCTTG CTTCTTCTTC 9000 

AGCAATTTCT TTTTGTAATT GCTCCAGTTC TGGCTTGATA AAAATGCTGT TATTCTGGCG 9060 

ATTGGCACCA CCTGCATAAG AACCACCTGT GCGCAACTCT GTCCCATCCA ATGTCACCAT 9120 

ACGAACCTGA TAACGAACTT GGCGAGCTGC TGCACGCGCA TGTTCTACGG TATCAAAGAT 9180 

AGCCGTCGTA GCTAGCAAGT TCTTGAAAAT GGCTTCCAGT CTAGTATCAA AAGTCACCAA 9240 

CTCATCTGCC ATCCCAAGGA AACCTGGGCT TACAGCGATA GCATCTTGGT TCTGACTAGA 9300 

AATCGTACGC GCCTTGATAG TGGTCAAAGG AAGAAAGGTT GCACGACCGG CTCTGTTCCG 9360 

TTTAAGGAAG TCAATAGCCT TGGTTGCCGA CTCTTCATCT TCTACGATGA TATGCTGGCT 9420 

ACTTGCCCCT AAGGCAATCT CTAGGGCAGT TTGATAATAA ACATCAAAGG TCAGATGCTC 9480 

ACTGACTGCA CCAATAATCC CACCTAGGCG ATCTTTTTCT TGGAGAACAC TCTTAACACC 9540 

TGCATAAAAG TTACTATGAT TTCTCAGGAT ATTTTCCAAA CTTTGAGCTC TGGCCTGCTT 9600 

GTTTTTGAGA TTATCCAGAC GGTCAAAGAG TTGGCTTTGT TGAGCTTGAT AGGAAGTTTT 9660 

CTGCTCCTCT TGCTCCTTGG CAATAGCTTG GTAGTCAGCC AATAATTTCT GAACCTGCTC 9720 

CTTGGCAGTT TCAAGCTCTT CCTTTTGCTG ACTAGCCTTC TCTTTAGCTA TAGCTAATTG 9780 

CTCTTTCAGC TTTTCTAGTT GATCTGCTTG TTTTTGAGAA AGCTGACGAC TATTTTCCAA 98 4 C 

CTCATTCTCA ATACGGGTCA ACTGGTTTGA GACATCCGCT TCTTCTTGTA AAAGAGCTAC 9900 

AAAGCGTTCA CGTAAGAGCT CAATCATCTG ATCAGGATCG TCTGAGAAAG CCAGCAATTC 9960 

AGCTTCTAAA CGATTGAGTT TTTGATTATT TTGGACTAGA TTTCCCTCTA ACAGAGCTAA 10020 

AGAGCTTTCT TTATCAGACT TTTCTTTGCT GAGTGAATTT CTCTTATCCT CCAAAGCAGC 10080 

CAAACGGGCT TGTGCCTCCT GTTGATTCAA GGCCACTTGC TCGGACTCCA GTTTCGATAG 10140 

GGCTAATTTT CTTTCTAAAT CACTAATCAG ACTAGTCAAG TCCATCAAAC TGCCTTGGTC 10200 

TTTGGCCATT TCAGCCTGTA AATCTTGGCG TTGCTTTTTA AGAGTTTGAT TTTCTTCTTC 10260 

TAATTTTTCA CGCTTTTGGT AATAACTCAT CAAGAGTTCT TGAACCTGAG TCAACTCTTC 10320 

TTCTGTCGAC TCTAGTTCAG CCTTATTTTC CTTGATTTGA GCAACCAGAA CATCTAAATA 10380 

AATAGCCTTA CGTTGTCCTT CCAAGTCTAA AAACTTACGG GCATTCTCAG CTTGCTTCTC 10440 
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AAGAGGCTTG ATTTGATTAT CCAACTCGTA GATAATGTCC TCTAAGCGGT CCAGATTATC 10500 

CTGAGTTTGC TGCAGTTTAC TCTCGGTTTC TTTTCTGCGA GTCTTGTATT TTAAAACTCC 10560 

AGCAGCTTCT TCAAAAATAG CTCGTCGTTC CTCAGGCTTG GAATTAAAAA TCTCCTCAAC 10620 

CTTCCCTTGG GAAATAATAG AGAAGGAATC TCGTCCCAAT CCAGTATCCA AGAAGAGGTC 10680 

ATGAATATCA CGCAGACGGA CTTTCTTGCC GTCAATCTTG TATTCGCTAT CTCCACTACG 10740 

ATAGACATGG CGTTCCACCC TGATTTCTTG ACCTGCATCC TTGATAAATC CGTCATGATT 10800 

ATCCAGAGTC ACAACTACAG AAGCATAATT GAGCGGTTTG CGACTTTCGG TTCCAGCAAA 10860 

GATGATATCC GGCATCTTGC CCCCACGGAG ACTCTTGACA CTAGACTCCC CCAAAGCCCA 10920 

ACGCAGACTT TCTGTAATAT TGGACTTTCC AGATCCATTG GGTCCAACAA CTGCCGTCAC 10980 

ACCTTGGTCA AAAACGACCT TGGTCTTATC AGCAAAAGAC TTGAACCCCT GAATTTCGAT 11040 

TTCCTTTAAA TACATGAATC CAGCCCCTTC TCAACGGCAT TTTTGGCAGC TTCCTGCTCT 11100 

GCTAATTTCT TAGAACGACC TTGGCCTTGA CCGATGCTCT TACCTTCAAC AAGAACTTCT 11160 

ACATCAAAAA CCTTATCGTG AGCAGGCCCT GTTTCAGAAA TCACCTGATA ACGAATAGCC 11220 

ACATCACCAT TGACCTGAAG CAACTCTTGG AGATGGGTTT TATAGTCTGT AATCATCTCA 11280 

AACTCGCCTG CTTCAACCTT AGGAATCATG ACTTGATAGA TAAATTCCTT GACCTTGGCC 11340 

ACATCCTTAT CCAAAAGAAG GGCACCAAGA AAGGCTTCAA AGGCATCACC AAGAATGGTG 11400 

TCACGATTGC GACCACCTGA TTTTTCTTCC CCTTTACCCA ACTTGATAAA CTGGTCAAAC 11460 

TGGCAATCAC GCGCAAAACC AGCTAAACTC TCCTCACGGA CAATCATAGC ACGGAGTTTT 11520 

GATAGGTCAC CTTCAGGCTT TTTAGGATAT TTTTTATATA GATATTCTGA AATCAATAAC 11580 

TGTAGAACAG CGTCTCCTAA AAATTCCAAG CGTTCATTGT GTGAAATTTT TAAGAGGCGG 11640 

TGCTCATTGG CATAACTCGT ATGAGTAAAG GCAGTTTCCA GTAACTTTTT GTCTGCAAAT 11700 

TCGATTGCAA AATGATTCTT TAGTACAGTT TGTAATTCTT TCATACCAAC CTCTTTCTAA 11760 

CTGATAATAG TCCTTTTTAT TATATCAAAA AAAGCCCCCT GAGTCACTCT AAAACGGGAC 11820 

TGGAAAGCAT TTGGGAATTC TTTAGACAGA GATTCTCAGT TTTAGCGGCA AATTTGGGTC 11880 

AGGATAAAGA AAAAAGCCCT ATTAAAGGCT TTTTAGGATG TTTACATCCA CCCTGAGGGA 11940 

ATCGAACCCC CATCTCAAGA ACCGGAATCT TACGTGATAT CCATTACACT AAGGGTGGAA 12000 

ACTTGTTTTA TTATAACAGA AATTTGCTCT AATAACAAGT TTTTTGGTCA AAGACCCCGT 12060 

CTTAGTGGGA AGCATCCCCA TTCCAGATGG AGTTTTTCAC GATCACATAA TCAACGTGTT 12120 

TAAGGTCAGC AACCTGACGT CCACCTGCAT AAGAAATAGC ACTTTGAAGG TCTTGTTCCA 12180 
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TCTCAGTTAA 


AGTGTCTTGC 


AGATGACCTT 


TAGCAGGAAG 


CAAGATACGT 


TTGCCTTCCA 


12240 


CATTTTTGTA 


AGCACCTTTT 


TGATATTGTG 


AGGCTGAACC 


ATAATATTCT 


TTGAACTGTT 


12300 


CACCATCGAC 


TTCAATCGTT 


TTCCCTGGAC 


TTTCAATGTG 


TCCTGCAAAG 


AGGGAACCAA 


12360 


TCATGATCAT 


GCTAGCACCG 


AAGCGGATAG 


ACTTAGCAAT 


ATCACCGTGA 


GTACGAATTC 


12420 


CTCCATCAGC 


GATAATCGGT 


TTACGCGCAG 


CCTTGGCACA 


CCAGCGTAGA 


GCAGCCAACT 


12480 


GCCAACCACC 


TGTACCAAAA 


CCAGTCTTAA 


CCTTGGTGAT 


ACAAACCTTA 


CCAGGACCGA 


12540 


TTCCGACCTT 


AGTAGCATCC 


GCACCAGCAT 


TTTCCAATTC 


ACGCACAGCT 


TCTGGTGTTC 


12600 


CCACATTTCC 


AGCAATGACA 


AAGGTATCTG 


GCAATTCTTT 


CTTGATGTG T 






AAATCACGCT 


ATCCGCATGA 


CCATGAGCAA 


TATCAATAGT 


GATATACTCA 


GGAGTATCAG 


12720 


CCTTGAGCTG 


GCTAACAAAA 


TCATACTCAT 


AATCCTTAAC 


ACCGACAGAG 


ATAGAAGCAA 


12780 


TGAGCCCTTG 


ATTGTGCATT 


CGTTTAATAA 


AAGGAATGCG 


TCCTGCCTCA 


TCAAAACGGT 


12840 


GCATAATGTA 


GAAGTAACCA 


CCTTTAGCCA 


GTTGCTCTGC 


TACATTTTCA 


TCCAAAATCG 


12900 


TCTGCATATT 


CGCTGGCACA 


ACAGGTAGTT 


TAAAGGTGTG 


ATTTCCTAAA 


GTGACACTTG 


12960 


TATCCGCTTC 


TGCACGGCTT 


TTAATGACAC 


ATTTATTTGG 


AATCAATTGA. 


ATATCTTCGT 


13020 


AATCAAAAAT 


TGGAAATTCA 


TTTAACATAT 


CGATGTCTCG 


TTTCTTTTGT 


AATGACCTAC 


13080 


CTATGCTCTT 


GCATCACTAC 


GCCTTTTCCG 


ACGTTTCCTG 


G 




13121 



(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

CCGAATGCAA TGTTTACGGT TGAACTTGAA AATGGACATC AGATTTTAGC AACAGTTTCT 60 

GGTAAAATTC GTAAAAACTA TATTCGTATT TTAGCGGGAG ATCGTGTTAC TGTCGAAATG 120 

AGTCCATATG ACTTGACACG TGGACGTATC ACTTACCGCT TTAAATAATC GAAAAACTTG 180 

GAGGGATAAG AAATGAAAGT AAGACCATCG GTCAAACCAA TTTGCGAATA CTGTAAAGTT 240 

ATTCGTCGTA ATGGTCGTGT TATGGTAATT TGCCCAGCAA ATCCAAAACA CAAACAACGT . 300 

CAAGGATAAG ATAGAAAGGA GAAAACATGG CTCGTATTGC TGGAGTTGAT ATTCCAAATG 360 

ACAAACGCGT AGTAATCTCA TTGACTTATG TTTATGGTAT CGGACTTGCA ACATCTAAGA 420 

AAATTTTGGC TGCTGCTGGA ATCTCAGAAG ATGTTCGTGT ACGTGATCTT ACATCAGATC 480 
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AAGAAGATGC TATCCGTCGT GAAGTGGATG CAATCAAAGT 
AAGTAAACTT GAACATCAAA CGTTTGATGG AAATCGGTTC 
GTCGTGGACT TCCTGTCCGT GGACAAAACA CTAAAAACAA 
AAGCTGTTGC GATTGCTGGT AAGAAAAAAT AATATAGGAG 
AACACGTAAA CGTCGTGTGA AAAAGAATAT CGAATCTGGT 
ATTTAATAAC ACTATTGTTA TGATTACTGA TGTGCATGGT 
AGCTGGTGCT CTTGGTTTCA AAGGTTCTCG TAAATCTACA 
TTCTGAAGCT GCTGCTAAAT CTGCACAAGA ACACGGTCTT 
AAAAGGTCCA GGTTCTGGTC GTGAGTCAGC TATTCGTGCG 
AGTAACAGCA ATTCGTGATG TGACTCCAGT GCCACACAAT 
TCGCCGTGTA TAATCATCGC ATTACACTGC TTTTCGTTTA 
CGAGTTTGAA AAACCAAATA TAACAAAAAT TGATGAAAAT 
AATCGAACCA CTTGAACGTG GCTACGGTAC AACTCTTGGT 
TCTAGCTTCT CTACCAGGAG CAGCTGTGAC ATCTATCAAC 
GTTTGACACA GTTCCAGGTG TTCGTGAAGA CGTGATGCAA 
AATTGCAGTG AAATCGTACG TTGAAGACGA AAAAATCATC 
TGCTGAAGTA ACAGCTGGTG ACATTTTGAC AGATAGCGAT 
TCATTATCTC TTTACAATCG GTGAAGGTTC TTCTCTAAAA 
TGGTCGTGGA TATGTACCTG CTGATGAAAA TAAAAAGGAT 
TGCTGTAGAT TCTATTTATA CACCAGTTAC AAAAGTCAAC 
TGTAGGTAGC AATGATGGTT TCGACAAATT AACCCTTGAA 
TATTCCAGAA GATGCTTTAG GGCTTTCAGC ACGTATTTTG 
TACAAATCTT ACTGAGATTG CTAAGTCAAC TGAAGTGATG 
TGACGACCGT ATTTTAGATC GTACGATTGA GGAACTGGAC 
CTGTTTAAAA CGTGCCGGTA TCAATACTGT GCATGATTTG 
GATGATGAAA GTACGAAATC TTGGACGCAA GAGTTTGGAA 
TGATTTGGGT CTTGGATTAA AAGATAAATA AAGGAGGAAT 
GGACGCACTA GCTCACAACG TAAAGCAATG CTTCGCGATT 
AACGAATCAA TCGTGACAAC TGAAGCTCGT GCTAAAGAAA 



TGAAGGTGAC CTTCGTCGTG 540 

ATACCGTGGT ATCCGTCACC 600 

CGCCCGCACT CGTAAAGGTA 660 

GTAAAAGTCT TGGCTAAACC 720 

ATTGCTCATA TTCACGCTAC 780 

AATGCAATTG CTTGGTCATC 840 

CCATTCGCTG CTCAAATGGC 900 

AAATCAGTTG AAGTTACTGT 960 

CTTGCTGCCG CTGGTCTTGA 1020 

GGTGCTCGTC CTCCAAAACG 1080 

AGAGGGAGTA ACTAAATGAT 1140 

AAAGATTATG GCAAGTTTGT 1200 

AACTCTCTTC GTCGTGTACT 1260 

ATTGATGGTG TGTTACATGA 1320 

ATCATTCTGA ACATTAAAGG 1380 

GAACTGGATG TTGAAGGTCC 1440 

ATTGAAATTG TAAATCCAGA 1500 

GCGACTATGA CTGTTAACAG IS 60 

AATGCACCAG TTGGAACACT 1620 

TATCAAGTGG AACCTGCTCG 1680 

ATCTTGACAA ATGGAACAAT 1740 

ACAGAACATC TTGATTTGTT 1800 

AAAGAAGCTG ATACTGAATC 1860 

TTGTCTGTGC GTTCATACAA 1920 

ACAGAAAAAT CTGAAGCAGA 1980 

GAAGTGAAAC TCAAACTCAT 2040 

ACATGGCTTA CCGTAAACTA 2100 

TGACAACTGA CCTTTTGATC 2160 

TCCGTAAAAC TGTTGAAAAA 2220 
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ATGATTACTC TAGGTAAACG TGGTGATTTG CATGCACGTC GTCAAGCAGC TGCTTTCGTA 2280 

CGTAATGAAA TCGCATCTGA AAACTATGAT GAAGCAACTG ATAAGTACAC TTCTACTACA 2340 

GCACTTCAAA AATTGTTCTC AGAAATCGCA CCTCGTTATG CTGAACGTAA CGGTGGATAC 2400 

ACTCGTATCC TTAAAACTGA ATCACGTCGT GGTGATGCAG CGCCAATGGC GATCATCGAA 2460 

TTAGTATAAA ATCATCAATT TTGTTGAGTG TTATGATGAT GGAGTCTTGT GCTCTTAGTC 2520 

TAGCTCTGGT CTACCGCTAG GATTTCGGTC CTAGCGGGAA CACTCATCAT AAGTTGGGAT 2580 

AGTAGACGCT TGTTTACGAA ATTGTTTTTT TCTTAAGAAC AACTTCGTAA GCAGGCGTTT 2640 

TTGAGTATTT TCGTTAGAAT TATGCTATAC TATTTGAAAA GAATCCTGTT TAATGTTAAG 2700 

GTTTCTTATT TTAAGAAGAA TTGGAGTTTA CTTATGAAAG CCATTATAAC TGTTGTTGGT 2760 

AAAGATAAAT CTGGAATTGT TGCAGGTGTT TCTGGTAAAA TTGCAGAATT AGGATTGAAT 2820 

ATTGACGATA TCTCTCAAAC TGTCTTGGAT GAATATTTTA CGATGATGGC TGTTGTATCT 2880 

AGTGATGAAA AGCAAGATTT TACCTATCTT CGTAATGAAT TTGAAGCTTT TGGGCAAACT 2940 

TTGAATGTAA AAATCAATAT TCAGAGTGCA GCGATTTTCG AAGCTATGTA TAATATCTAG 3000 

GAGGTCATCA TGGATATTAG ACAAGTTACT GAAACCATCG CCATGATTGA GGAGCAAAAC 3060 

TTCGATATTA GAACCATTAC CATGGGGATT TCTCTTTTGG ACTGTATCGA TCCAGATATC 3120 

AATCGTGCTG CGGAGAAAAT CTATCAAAAA ATTACGACAA AGGCGGCTAA TTTAGTAGCT 3180 

GTTGGTGATG AAATTGCGGC TGAGTTGGGA ATTCCTATCG TTAATAAGCG TGTATCGGTG 3240 

ACACCTATTT CTCTGATTGG GGCAGCGACA GATGCGACGG ACTACGTGGT TCTGGCAAAA 3300 

GCGCTTGATA AGGCTGCGAA AGAGATTGGT GTGGACTTTA TTGGTGGTTT TTCTGCCTTA 3360 

GTACAAAAAG GTTATCAAAA GGGAGATGAG ATTCTCATCA ATTCCATTCC TCGCGCTTTG 3420 

GCTGAGACGG ATAAGGTCTG CTCGTCAGTC AATATCGGCT CAACCAAGTC TGGTATTAAT 3480 

ATGACGGCTG TGGCAGATAT GGGACGAATT ATCAAGGAAA CAGCAAATCT TTCAGATATG 3540 

GGAGTGGCCA AGTTGGTTGT ATTCGCTAAT GCTGTTGAGG ACAATCCATT TATGGCGGGT 3600 

GCCTTTCATG GTGTTGGGGA AGCAGATGTT ATCATCAATG TCGGAGTTTC TGGTCCTGGT 3660 

GTTGTGAAAC GTGCTTTGGA AAAAGTTCGT GGACAGAGCT TTGATGTAGT AGCCGAAACA 3720 

GTTAAGAAAA CTGCCTTTAA AATCACTCGT ATCGGTCAAT TGGTTGGTCA AATGGCCAGT 3780 

GAGAGACTGG GTGTGGAGTT TGGTATTGTG GACTTGAGTT TGGCACCAAC CCCTGCGGTT 3840 

GGAGACTCTG TGGCACGTGT CCTTGAGGAA ATGGGGCTAG AAACAGTTGG CACGCATGGA 3900 

ACGACGGCTG CCTTGGCCCT CTTGAACGAC CAAGTTAAAA AGGGTGGAGT GATGGCCTGC 3960 

AACCAAGTCG GTGGTTTATC TGGTGCCTTT ATCCCTGTTT CTGAGGATGA AGGAATGATT 4020 
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GCTGCAGTGC AAAATGGCTC TCTTAATTTA GAAAAACTAG AAGCTATGAC GGCTATCTGT 4080 

TCTGTTGGAT TGGATATGAT TGCCATCCCA GAAGATACGC CTGCTGAAAC TATTGCGGCT 4140 

ATGATTGCGG ATGAAGCAGC AATCGGTGTT ATCAACATGA AAACAACAGC TGTTCGTATC 4200 

ATTCCCAAAG GAAAAGAAGG CGATATGATT GAGTTTGGTG GTCTATTAGG AACTGCACCC 4260 

GTTATGAAGG TTAATGGGGC TTCGTCTGTC GACTTCATCT CTCGCGGTGG ACAAATCCCA 4320 

GCACCAATTC ATAGTTTTAA AAATTAAGAA AATAGGAGAA ATTTTAAGTT CTATTTAAGA 4380 

TTAGACGTGT ATACTATAAT CATTAAATAA AGACCTCCTA ATATTATTTG AAACAGATAA 4440 

CACTGAATTA GTTTGAATTT GATTTTCATC TAATATCTTT ATTTAATGAA CTCCTAAACT 4500 

TTTTCATAAT AATCTCCTTC AAAAGTCGCC TGTATGGGTG GCTTTTATTT TATCATTCAT 4560 

GATATAATAG AAGCAAACGG AGGACGGAAA ATGGTAAAAG TACGATTGTA TTTGGTACGT 4620 

CATGGCAAGA CCATGTTTAA CACGATTGGT CGCGCGCAAG GTTGGAGCGA TACTCCCTTA 4680 

ACTGCTGAAG GTGAACGAGG GATTCAAGAG TTAGGAATCG GTTTGCGAGA ATCTGATCTA 4740 

CAGTTTGAGC GTGCTTATTC GAGTGATTCT GGTCGTACCA TTCAGACCAT GGGAATTATC 4800 

CTTGAAGAAC TTGGCTTGCA GGGGGAAATC CCTTATCGCA TGGACAAGCG TATCAGAGAA 4860 

TGGTGTTTCG GTAGTTTTGA TGGAGCCTAT GATGGCGATC TTTTCATGGG CATTATTCCT 4920 

CGTATCTTTA ATGTGGACCA CGTTCACCAA TTGTCTTATG CTGAACTGGC TGAGGGCTTG 4980 

GTAGAGGTCG ATACAGCTGG TTGGGCTGAA GGCTGGGAAA AACTCAGTGG CCGAATCAAG 5040 

GAAGGCTTTG AAATGATTGC AAAAGAAATG GAAGATCAAG GTGGAGGTAA CGCCCTTGTT 5100 

GTCAGCCATG GAATGACTAT TGGAACCATT GTTTATCTGA TTAATGGCAT GCATCCGCAT 5160 

GGTCTGGATA ATGGTAGCGT GACAATCCTT GAATATGAGG ACGGCCAGTT TAGGGTTGAA 5220 

GTTGTCGGTG ACCGTAGTTA CCGAGAGCTA GGACGTGAGA AGATGGAAGA AGGCTCTATT 5280 

TAATCAGTCT AGACTTGCTT GCCATGAGCT AGGGATTTGA TAAGAATATC AAGATAAGAA 5340 

AAAACAGCCG AGGGCACTCC TTTCGGCTGT TTTTGATGTG GAAAACTAAA GTGTAATGCT 5400 

ATTGCTTTTA GAGATTTTCA TAAACAAGAG CAAGGAACCT ACTGTTAGAA CAGTCAGGAT 5460 

AGTTGACAAG GTTGCGGCTA CACCGTAATT TCCTCTGAGA ACCTCTGTAT AAATAGCTAC 5520 

AGTCATTGTT CTTGTTTTGA CATTGTAGAG GAGGATAGAA GTAGAGAGTT TTGAAATCAT 5580 

TGTGACTCAA GATAAGATGG CTCCAGAAAT GATACCAGAT AGCATCATTG GAGTTGTAAT 5640 

CTTAGCAAAG GTATTGAGAC GACTACTTCC TAAGCTTTCA GCAGCTTCTT CAATACTTGG 5700 

TGCTATTTGT TGTAAGCTAG CAACAGATGA GCGAATAGTA TAAGGTAATC TTCTGGCAGA 5760 
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TAGAGACATA ATCAAGATGA AAGCAGTCCC TGTAATCATA AGAAATCCAC TTCCAAATAG 5820 

ACCAGTATTG AAGGAAGAAA TGAAGGCAAT CCCTAGAACG GTTCCTGGTA CAATATAAGG 5880 

TACCATACTG AGGCTGTCAA TTAAGTTTGT AAACAAATTC CGTTTTCTAA CGGCTAGGTA 5940 

GGAGATAAAT GTCGCAAATA GAACAACTAG AACTAAGGCA ATCAAAGGGA TACGAATGGT 6000 

ATTGAAAATA GCAGATCCCA TACGATGGAA AGCTACCTTG TAACTGTTTG GAGAATAACC 6060 

TTTAACAGAT ACCATACCTG ATGTTTTTAG GAAAGAGGTA TAAATTAAGT AGATTTGAGG 6120 

TAAAACAGAG ATAAAGATAA TTCCGTAGAC TGTTGCATAA ATGGCAGCCA TTTTTCCTTT 6180 

TGTAGTTTTT TTAGGCTCAA TTGGATGGAG CAGATTCATG CTGAAACTGT AGCGGTTTGC 6240 

AATGTGTTTT TGGATAAGGA AAATTGCCAA GGCAATGATA ATCGCCATAA TTGCAAAAGC 6300 

AGAATTTCCT CCAACCTCGC TAATAAATTG GGTATAAATC AGGACAGGGA AAGTCCGATA 6360 

CCCTTCGCCA ATCAACATAG GCGTTCCAAA GTCTGAGAAT GCTCTCATAA ATACAAGCAA 6420 

GGAGCTGCTA GTAAGGTTGG AACTAGGAGA GGTAAAACAA CCGTTACGAT AGGTTTAAAT 6480 

CCGAAGGACC CCATGCTTTC AGCTGCTTCA AGTAGAGAAT TGTCAATACT GTTCATTGTT 6540 

CCAGCAACAT ATAGAAATAC CAGTGGGAAT AGTTGCAGTG TAAAGACAAG TACAATTCCT 6600 

TTGAATCAAT AAATATCGAT AGCTGGAAGA TAAAGGGCAT TTGTCAAAAA TTTAGTGATG 6660 

ACCTCATTTC GTCCTAGCAA GAGAACCCAG GAGTAGGCTC CTACGAAAGG AGCTGACATG 6720 

GAAGCAATGA TAATCAATAT TTGTAGAAAT TTCTTCCCCT TGAAGTCATA CATAGAGAAG 6780 

AGATAAGCTA ATAGGGTTCC TACAACTAAG GAAGTGATAG TAGCGGTAAT GGAAACCTTG 684 0 

AAACTGTTGA CTAGTGTCTC AGAGTAGTAG GCTTTACTAA AGAAAGTGAC AAAATTAGCT 6900 

AGTGAGAATT GTCCTTCATG TATAAGTGCT TGCTTGAGCA CGGTAACGAT AGGATAAACG 6960 

AGAAAGATAG GATAGGTAAG AAAGAGGAAG AAAGAGGAAA CTGTCCAAAT ATTTAGTTTT 7020 

TTACGTTCCA TGGTTGACTC CTTTTATCAG GTTTTGGGAA CCATCTGCAG AAAAGATGTT 7080 

TAATTTTTGC GTATTGATTC GTAGACGAAT ACGATTGCCT TTTTGTAGAT CTTCTTCAAA 7140 

AGTTGATTCT TCACTAACTT GAATTTTTGA GGCAAAACCT GTCTCAATGA AATAATCCGT 7200 

ATTTAGTCCA AGATAGACGC TATCTCTAAT AGTTCCTTCA ATATCTCCAG ATTCATCTTT 7260 

GATAAACTCT TCGGGACGAA TGCTTACATG AATAGCTTGC TCCTCAACCT GATCAAGAGC 7320 

TGGCATTCGA AGGGCATAGC CATCTGAAAA GACGATATAA GCGCCGTCGC TCCGTTTTTC 7380 

AAGATTGGCA GGGATAATAT TTGTGCGTCC GATAAAGGTT GCCACAAACT CATTAGCTGG 7440 

TTTATGATAG AGTTCTTTTG GTCGGCCGAT TTGTTGGATC ACCCCATCTT TCATAACAGC 7500 

AATTTGGTCT GAAATAGCCA TGGCTTCTTC TTGGTCGTGG GTTACATAAA CAGTTGTAAT 7560 
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TCCCACTTCG 
CAGATTACTA 
CAAGGTGACA 
AGCAATTTGC 
CTTCTTTTGC 
GTAGTTTTGG 
ATCATCGAAG 
GGTCGTTTTC 
AATGTTCAAA 
GATCTCACTC 
GTATTTCTTA 
TTTGATTTTG 
GTTAGTAGTG 
TTTCTTGGCA 
GACGGTTCCT 
TGCTGGATCT 
ACTAGATGAA 
CCAAGCCTTA 
GGGGCTAGAA 
AAGATCGTTA 
ACTACCATCT 
ATTATCATTT 
ATAAGAACCA 
GAAAAGTTCT 
GGCAGGAATA 
GCCGTCTCCT 
ATACATCCAT 
ACAATGTAAG 
TACACTTTGA 



TGTTGGATTT CTCGGATGGC 
AGTGGCTCGT CCATGAGGAG 
CGTTGTTGTT GTCCACCACT 
ATGAGTTCAA GATACTTGTT 
ATAAGACCAA AAGCAACGTT 
AAAACCATCC CGATATTGCG 
TAAAATTCTC CACCTTCGAT 
CCACATCCTG AAGCTCCAAG 
TTCTCAATAA CAGGGACATC 
ATAGTGAACC TCTTTTACTG 
ACGATATCTG ATTTATTCTT 
TCAATTGGTT TCATGTTTTC 
GTTGTACCAA GTGTATCTTG 
TTTTCCATAT TTTTAGATTT 
TCTTTTGGAT AGACTACCTT 
TCATAAGAGA GACCAACAGC 
CTTGAACCGA TTTTACCATC 
TCATCTTTGT AACCACCTTG 
GAGTTTGCTG GGTCAGCAGT 
TATCCTTCGA TGTTCATGCC 
AGTGTATAAG GAGTAGAGTA 
TCTTTTGAAG TATAGTTTTC 
CCAAAGATAA CATCAGCTAC 
CCAGTACCAG CTTGAATCAG 
GTTGCTCCAA TTAAGCCCTC 
TTATCAGATG AACTGTCATC 
TTCTTTTTCA TGATGGATAC 
CGTTTTTAAA ACATACAATT 
CTGCTAAAAT ATTTCTATAA 



881 

TTGACGCATA TCCAAGCGAA 
AACACTTGGA TTAACCGCTA 
GAGTTTATCG GGCTTTCGAT 
GGTCTGTTGA ATCAATTCTT 
GTCTCGGACA GTCAAATGTG 
TTTGCTGGGT TCCATATTAT 
ACTGTTGAAA CCTGCAATCA 
AAGGGTAAAG AGACTTCCTT 
GTGGTAGATT TTTTTGGCGT 
TTTAGATTGG ATATCTGTAA 
GATGACATAA TCATAATCTT 
GCTTGTTTTA GCATTTTTAC 
TACTTCTTGA GAGATAATAA 
TTTAACGATA GCAGCACTAG 
AATGTTAGCT CCGTCATTTA 
CATTTCTCCA TCAGCGACTA 
AATAAGTGTG AAAAGATCTT 
AGCTTGTAGC ATATTTGTTA 
TGCGATTTTT CCTTTTAGTT 
TTTAGTTAAA TCAGGGTTGA 
GCCAGTTGTG TTTTGATATT 
AAAGAGTTCT CCGTGGGTAG 
AGGAACTTCT TTTTCTGACT 
TTCTACTTTG ATACCATATT 
TGAGTTTGGT GAATAAACGA 
GGCAGATTCA TTAGAAGAAC 
CTCCGTTGTG TTATTTAAGT 
CTATTCTATA GTGTATTGAA 
ATTAATTTGA CTTTCCTGAT 



GTTTGGCCTC 7620 

AGGCGCATGC 7680 

CCGCATATTG 7740 

CTTTTGGAAC 7800 

GGAAAATAGC 7860 

TGATTTTTGT 7920 

TACGAAGAAG 7980 

TTGGAATTGT 8040 

T AATAATTTT 8100 

AGACTTCGTT 8160 

CAGTGAGTGT 8220 

GAACAGGACG 8280 

AATCG AT AAA 8340 

CAGGTAGGAA 8400 

AGAGTTTAAC 8460 

CTTTATAGAC 8520 

TTACATAAGA 8580 

ATTGAGCAAA 8640 

CAGGTTTGAA 8700 

CGATTAAAAC 8760 

CTTTGATAAC 8820 

TATATTGTGT 8880 

CTAGTTTTTT 8940 

TTTCTTCAAA 9000 

CTAGCGAACC 9060 

AAGCAGCATA 9120 

TTATTTTAAA 9180 

TCTATAACAG 9240 

AGAGATGTTC 9300 
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ACATCTTATT TCAATTCACT ATATTAGAGT AAAATTCTCT ACAAAAAGAA GAATAGCCTA 9360 

TTTTACTATT CTTCTGAGTG ATTTCAATTC CTTTGGGGAA ATATGGAGAT ACTTTTTAAA 9420 

. TCCTGACAAA TGGTTGTTTC TTTTTCTAAA TCGGTGATAC TGTATCGGAG AATGCGCGTG 9480 

AGGTCACAAA GGCTGCGATA GAGCTTCTAT GGAGAATTTC TTTTTGGAGA GATTTTTTAA 9540 

AGGAATGAGA CATCCGCTAC CTCCTTGGAA GGTTTTTG 9578 



(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

CGGGCTGTTG TGACGATTCT TATTTCTATC TGTGTTATCT TTTTGGGAAC TATTTTGGGT 60 

GTTGTCTTGG CTTTTGGGCA ACGTTCAAAG TTTAAACCGC TTGTTTGGTT GGCCAACTTG 120 

TACGTTTGGA TTTTCCGTGG GACACCGATG ATGGTTCAAA TTATGATTGC CTTTGCTCTT 180 

ATGCATATCA ATGCTCCGAC TATTCAGATT GGAATTTTAG GTGTTGATTT TTCGCGTCTG 240 

ATTCCAGGGA TTTTGATTAT CTCTATGAAT AGTGGTGCTT ATGTTTCGGA GACTGTTCGT 300 

GCCGGAATCA ATGCGGTTCC AAAAGGTCAG CTAGAAGCGG CTTATTCGCT AGGGATTCGT 360 

CCTAAAAATG CGATGCGTTA TGTGATTTTG CCACAAGCAG TCAAAAATAT CTTGCCAGCA 420 

TTGGGGAACG AATTTATCAC CATTATCAAG GACAGCTCCC TCTTATCAGC TATTGGGGTC 480 

ATGGAGTTGT GGAATGGGGC . TACAACAGTT TCTACAACAA CCTATCTACC TTTAACACCA 540 

CTTTTATTTG CAGCATTTTA CTACTTGATT ATGACCTCTA TTCTGACAGT AGCCTTGAAA 600 

GCTTTTGAAA AACATATGGG ACAAGGAGAT AAGAAATAAT GACAGAAACC TTGATAAAAA 660 

TTGAAAATTT ACATAAATCC TTTGGAAAGA ATGAAGTATT GAAGGGCATC AACCTCGAGA 720 

TTAAAAGAGG AGAAGTTGTC GTTATCATCG GTCCTTCAGG GAGCGGGAAA TCTACCTTGC 780 

TTCGCTCTAT GAATTTGTTG GAAGAAGCAA CCAAGGGGAA GGTTATCTTT GAGGGAGTCG 840 

ATATTACGGA CAAGAAGAAT GACCTGTTTG CCATGCGTGA GAAGATGGGC ATGGTTTTTC 900 

AACAATTCAA TCTCTTTCCT AATATGACTG TGATGGAAAA TATCACCTTG TCCCCTATCA 960 

AGACCAAAGG TGACAGTAAG GCCGTTGCAG AGAAAAGAGC TCAGGAACTT TTGGAAAAAG 1020 

TTGGTTTGCC AGATAAGGCA GACGCTTATC CACAGAGTTT GTCAGGTGGC CAGCAACAGC 1080 

GGATTGCCAT CGCGCGTGGG TTGGCTATGG AACCAGATGT TTTGCTCTTT GACGAGCCAA 1140 
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CTTCAGCCCT AGATCCTGAG ATGGTTGGAG AAGTTCTGGC TGTTATGCAA GATCTAGCCA 1200 

AGTCAGGAAT GACCATGGTT ATCGTAACAC ATGAGATGGG ATTTGCCCGT GAGGTGGCAG 1260 

ATCGTGTCAT CTTTATGGCA GACGGTGTGG TTGTTGAAGA CGGAACACCT GAGCAGATTT 1320 

TTGAACAAAC CCAAGGACAA AGGACTAAAG ACTTCTTGAG TAAGGTTTTA TAAGTTAGCT 1380 
TTGTTTAGCT ATTTGTAGCC AGCTTTAAAC GTTAAAGAGA AGATTAGTGA AAAGCTCAAC ' 1440 

CAGAGCTTTT TCTTATAGTT TAAAGCTATA GGATTGCCTA GGAAAGAAGT GTTAGAGCTA 1500 

CATTGTATTT TTTGGTATAA TTAAAGATAT TTGTAAGAAA AGAGAAGTGA TATGACACAG 1560 

ATTATTGATG GGAAAGCTTT AGCGGCCAAA TTGCAGGGGC AGTTGGCTGA AAAGACTGCA 1620 

AAATTAAAGG AAGAAACAGG TCTAGTGCCT GGTTTGGTAG TGATTTTGGT TGGGGACAAT 1680 

CCAGCCAGCC AAGTCTACGT TCGCAACAAG GAGAGGTCAG CCCTTGCGGC TGGTTTCCGT 1740 

AGCGAAGTAG TACGGGTTCC AGAGACCATT ACTCAAGAGG AATTGTTAGA CCTGATTGCT 1800 

AAATACAATC AGGATCCAGC TTGGCATGGG ATTTTGGTTC AGTTGCCATT ACCAAAACAC 1860 

ATTGATGAAG AGGCGGTTCT ATTGGCTATT GACCCAGAAA AGGATGTGGA TGGTTTCCAT 1920 

CCTCTAAACA TGGGGCGTCT TTGGTCTGGT CATCCAGTCA TGATTCCTTC GACACCGGCA 1980 

GGAATTATGG AAATGTTCCA TGAATATGGG ATTGACTTGG AAGGTAAAAA TGCAGTCGTC 2040 

ATCGGTCGAT CCAATATTGT CGGAAAACCT ATGGCCCAGC TTCTTTTGGC AAAGAATGCA 2100 

ACAGTAACCT TGACTCACTC ACGTACTCAT AATCTTTCCA AGGTGGCTGC AAAAGCAGAT 2160 

ATTCTGGTTG TTGCAATCGG TCGTGCCAAG TTTGTGACTG CTGACTTTGT CAAACCAGGT 2220 

GCGGTAGTCA TTGACGTTGG GATGAACCGC GATGAAAATG GTAAGCTCTG TGGGGATGTT 2280 

GATTATGAGG CGGTTGCCCC ACTTGCTAGC CACATTACGC CAGTCCCTGG AGGTGTCGGT 2340 

CCTATGACCA TTACTATGCT GATGGAGCAA ACCTATCAGG CAGCACTTAG GACATTGGAT 2400 

AGAAAATAAG ATAAAAATTT TCTGAGGAAA GTGTATTTTC TATAGCTATA TCTAAAATGA 2460 
TAGAAATGAA TATTAAATTT TAGAAATAAG TTTATAAAAG GAGGTTTGCG CCTCCTTTTT ' 2520 

GTTGTATAAT GGAGTGAGGT GATTAGATGA TTTTAAAAAT TTATAATGGG GAATATAGTT 2580 

TACAATGGGA TGGAATATAC TACTTAGCAC TAATTGATTA TCCAAATATT CAAGAGTGGG 2640 

AATTAGAAAA AATTGCTAAA TTTATAGCTT ACGAAAAACT TCATAAACGT CAAACAAGTA 2700 

TTGAGTGTGC TGATTCTTGT TTAAAAAAAG AAATTTTAGA TTACATCTGT CAGCATCCCT 2760 

TTCTGCCACC ATTTACTCCT ACAGATAAAA GAGTAGCCTC GACTTATGAC CTACATAAGA 2820 

GGTTAGTGAC TTCAGACTAC TGTAGTCATA CTACGACTAT AGATGCAGCG ATTTCTATTT 2880 
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TTAAAACTGG TCGTCTTTTA TCTGCTGTGA AAGCCTTTGG GCGAGATGCT GAGGAGTTGG 2940 

TTTTGGATAG TCGAAATGCT GCATCTGATC CGATAGATTA TTTTGACTAT GTCATGTTAG 3000 

GGTGGTCAAA TACAAGTTCT GGTTATCGAT TGGCGATGGA GCGTTTATTA GGTCGAGCTC 3060 

CTTCAGAGAA AGAATTACAA GACAAGTTTA TTCCTGGAGT AAGTTTTCAT TTTATCTATA 3120 

CAGATTTGAT TAAAGTTCCT GGTTATATTT TTGATGGTTA CCATGCTGTA AAAATTAAGG 3180 

ACATGCTTAA TTTATTAAGT GAGTTGTATA TTTGCATTAT TCCAACTCAT AATAAGAGCC 3240 

AATTTGAAAA TATTATTCCA ACCAAAATAC AAGATAGGGT GTATTATCTT GACTATGCTG 3300 

GAGAAGACTT AGAAGAGTGG ACTAAGAAAG TCTATCAAGT TGTTTTAAAA CAATCAGATA 3360 

AAGGATAGTT GAGGAAAAAA CGATGAAAGT GATTGATCAA ACCTTACTAG AAAAAGTCAT 3420 

TATTGAACGT TCTTGTACAA GTCATAAAGG AGACTACGGT CGTCTGCTGT TGCTTGGTGG 3480 

GACTTATCCT TATGGTGGTG CCATCATCAT GGCTGCTTTA GCAGCTGTAA AAAGCGGTGC 3540 

AGGATTGGTA ACCGTTGGAA CGGACAGGGA AAATATCCCT GCTCTACACA GCCATTTGCC 3600 

TGAGGCTATG GCCTTTTCTC TGCAAGATCA GTAATTGTTA CAAGAGCAAT TGGAGAAGGC 3 660 

AGAAGTTGTC TTGCTGGGGC CTGGTTTACG AGACGATACG TTTGGAGAAA ATCTTGTAAA 3720 

ACAGGTCTTT GCTAGCTTAA AAAAGAATCA GATTTTGATT GTAGATGGAG GGGCCTTAAC 3780 

CATCCTTGCT AGGACAAGTT TGTTGTTTCC ATCTAACCAG CTTATCTTAA CTCCCCACCA 3 840 

AAAAGAATGG GAAAAACTGT CTGGTATTGC TATTGAAAAG CAAAACGAAG GTACAACATC 3 900 

TAGTGCCCTG ACTTCTTTCC CTCAAGGAAC AATTTTGGTA GAGAAAGGTC CAGCTACTCG 3960 

TATTTGGCAA GTTGGCCAGT CTGATTATTA CCAGTTAAAG GTTGGCGGTC CCTATCAGGC 4020 

GACTGGTGGT ATGGGTGATA CACTGGCTGG AATGATTGCA GGATTTGCAG GCCAATTTCG 4080 

ACAGGCCAGT CTCTACGAAC GTGTGGCAGT AGCAACCCAT CTTCATTCAG CCATAGCCCA 4140 

AGAACTATCT CAAGAAAATT ATGTGGTCTT GCCGACGGAA ATTAGTAATT GTCTTCCTAA 4200 

AGTAATGAAA AGATATGTCT AAAATAGTTA GACAAAAAAT GTTGATAATT TGTATCATTA 4260 

TTCTTAATTC ACAAAAAACG AACGTTTAGT ATTCTTCTTG CTAAGAAACT AAATTTGTTC 4320 

GTTTTTTTAC TCTTGTAAAT CTATTTTTGT TAGAGTTGAT TTGGTTTACA TCCGTACTTA 4380 

AATTGATTTG TTAGAGCTCT ACTTTTATTA AAAAAATTCA ATTTCAAGGA TAAATAAGCA 4440 

GTATTCTAAA GGTACTTTTA GATGAAATAA AAGCCTTTAC ATGGTATAAT AGAGGTAGCT 4500 

CTTTAATGGA GGTGTTTGAG TGGAAAATCT GAAGAAAATG GCAGGTATCA CGGCTGCTGA 4560 

ATTTATCAAG GATGGGATGG TTGTAGGGCT AGGAACAGGT TCTACTGCCT ATTATTTTGT 4620 

CGAAGAAATC GGTCGTCGAA TCAAGGAAGA AGGCTTGCAG ATTACAGCTG TGACGACTTC 4680 
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TAGTGTGACC AGTAAACAGG CTGAAGGGCT CAATATCCCG CTCAAGTCTA TTGACCAAGT 4740 

AGACTTTGTC GATGTGACAG TCGACGGGGC GGATGAAGTG GATAGTCAGT TTAATGGAAT 4800 

CAAAGGCGGT GGTGGTGCCC TTCTCATGGA AAAGGTGGTC GCAACACCAT CAAAAGAATA 4860 

CATTTGGGTG GTGGATGAAA GCAAGCTGGT CGAAAAACTA GGTGCTTTTA AATTGCCAGT 4920 

AGAAGTGGTT CAGTATGGTG CAGAGCAGGT CTTTCGTCAT TTTGAACGAG CTGGCTACAA 4980 

ACCAAGTTTC CGTGAAAAAG ACGGCCAACG TTTTGTGACC GATATGCAGA ATTTTATCAT 5040 

TGACCTCGCC TTGGATGTCA TTGAAAATCC AATTGCTTTT GGACAAGAAT TGGACCATGT 5100 

CGTTGGTGTT GTGGAGCATG GTTTATTCAA CCAAATGGTG GATAAGGTAA TCGTTGCTGG 5160 

ACGAGATGGA GTTCAGATTT CAACTTCAAA AAAAGGAAAA TAGAAGGGGG CATAAGATGT 5220 

CTAAATTTAA TCGTATTCAT TTGGTGGTAC TGGATTCTGT AGGAATCGGT GCAGCACCAG 5280 

ATGCTAATAA CTTTGTCAAT GCAGGGGTTC CAGATGGAGC TTCTGACACA CTGGGACACA 5340 

TTTCAAAAAC AGTTGGTTTG AATGTCCCAA ACATGGCTAA AATAGGTCTT GGAAATATTC 5400 

CTCGTGAAAC TCCTCTTAAG ACTGTAGCAG CTGAAAGCAA TCCAACTGGA TATGCAACAA 5460 

AATTAGAGGA AGTATCTCTT GGTAAGGATA CTATGACTGG ACACTGGGAA ATCATGGGAC 5520 

TCAACATTAC TGAGCCTTTC GATACTTTCT GGAACGGATT CCCAGAAGAA ATCCTGACAA 5580 

AAATCGAAGA ATTCTCAGGA CGCAAGGTTA TTCGTGAAGC CAACAAACCT TATTCAGGAA 5640 

CGGCTGTTAT CTATGATTTT GGACCACGTC AGATGGAAAC TGGAGAGTTG ATTATCTATA 5700 

CTTCAGCTGA CCCTGTTTTG CAGATTGCTG CCCACGAAGA CATTATTCCT TTGGATGAAT 5760 

TGTACCGTAT CTGTGAATAC GCTCGTTCGA TTACCCTTGA GCGTCCTGCC CTTCTTGGTC 5820 

GCATCATTGC TCGCCCTTAT GTAGGTGAAC CAGGTAACTT CACTCGTACG GCAAACCGTC 5880 

GTGACTTGGC TGTATCTCCA TTTTTCCCAA CTGTTTTGGA TAAATTGAAT GAGGCTGGTA 594 0 

TCGATACTTA TGCTGTGGGT AAAATCAACG ATATCTTTAA CGGTGCTGGT ATCAACCATG 6000 

ACATGGGTCA CAACAAGTCA AATAGTCATG GAATTGATAC ACTATTGAAG ACTATGGGAC 6060 

TTGCTGAGTT TGAAAAAGGA TTCTCATTCA CAAACCTAGT TGACTTTGAT GCCCTTTACG 6120 

GCCATCGTCG TAATGCTCAC GGTTACCGTG ATTGCTTGCA TGAGTTTGAT GAACGCTTAC 6180 

CTGAAATTAT CGCAGCTATG AGAGAGAATG ACCTTCTCTT GATTACTGCG GACCATGGAA 6240 

ATGACCCAAC GTATGCAGGA ACGGATCACA CTCGGGAATA TATTCCATTG TTGGCCTATA 6300 

GCCCTGCCTT TAAAGGAAAT GGTCTCATTC CAGTAGGACA TTTTGCAGAT ATTTCAGCGA 6360 

CTGTTGCCGA TAACTTTGGT GTGGAAACTG CTATGATTGG GGAAAGTTTC TTAGATAAAT 6420 
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TGGTATAAGA TGACGCGCTA TGCTTTGCTG GTGAGAGGTA TCAATGTTGG TGGTAAGAAT 6480 

AAGGTCGTCA TGGCGGAGCT TCGTCAAGAA TTGACAAACT TGGGACTGGA AAAGGTTGAG 6540 

AGCTACATCA ATAGTGGCAA TATTTTCTTT ACTTCGATAG ATTCCAAAGC CCAATTGGTT 6600 

GAAAAGCTAG AGACTTTCTT TGCAGTCCAT TATCCATTTA TTCAGAGCTT TTCTTTACTG 6660 

AGTCTAGAGG ACTTTGAGGC GGAACTTGAA AATCTACCAG CTTGGTGGAG CAGAGACTTG 6720 

GCACGAAAAG ATTTTCTCTT TTACACTGAG GGTTTGGATG TGGACCAAGT CATCGCGACA 6780 

GTTGAAAGTT TAGAGCTGAA AGATGAAGTG CTTTATTTTG GAAAACTTGG GATTTTCTGG 6840 

GGGAAATTTT CTGAAGAATC CTATTCTAAG ACTGCCTATC ATAAGTACTT GCTGAAGGTG 6900 

CCTTTCTACC GCCACATTAC TATTCGTAAT GCTAAAACCT TTGACAAAAT TGGTCAAATG 6960 

CTAAAAAAAT AATAAAGGAG ACACACAATG ACATTTTTAA ACAAAATCCA TGAAACTGCT 7020 

ACTTTCCTGA AAGAAAAGGG AATTGCAGCC CCTGAGTTCG GTCTAATCCT TGGATCAGGA 7080 

CTTGGAGAAT TGGCAGAAGA AATCGAAAAT CCAGTTGTAG TAGACTATGC TG AG ATT CCA 7140 

AACTGGGGCC GTTCAACAGT AGTCGGTCAT GCTGGTAAAT TGGTATATGG TGAACTGGCA 7200 

GGTCGCAAGG TCTTGGCTCT TCAAGGGCGT TTCCATTTCT ATGAAGGGAA TCCTCTGGAA 7260 

GTGGTGACTT TCCCAGTTCG TGTGATGAAA GTTCTTGGAT GTGAAGGTGT TATTGTAACC 7320 

AATGCAGCTG GCGGTATCGG ATTTGGTCCT GGTACCTTGA TGGCTATCTC AGACCATATC 7380 

AACATGACGG GGCAAAATCC ATTGATGGGT GAAAACTTGG ATGACTTTGG CCCACGTTTC 7440 

CCAGATATGT CTAGGGCCTA CACACCAGAA TACCGTGCCA CTGCCCATGA AGTGGCTAAA 7500 

AAACTTAATA TCAAGCTTGA TGAAGGTGTC TATATCGGAG TTACTGGTCC GACTTATGAA 7560 

ACACCAGCAG AAATTCGTTC CTATAAGACA CTGGGAGCAG ATGCAGTTGG TATGTCTACG 7620 

GTTCCTGAAG TTATCGTGGC AGCCCACTCT GGCTTGAAAG TTCTGGGAAT TTCATGTATC 7680 

ACTAACTTTG CGGCCGGTTT CCAAGAAGAA CTCAATCACG AAGAAGTTGT AGAAGTGACT 7740 

GAACGTGTTA AAGGTGATTT CAAAGGCTTG CTTAAAGCGA TTCTTGCTGA ATTGTAAGAA 7800 

AAAAGATTTA AAAGGGGGAG TGCCTCTGTT TTTTCAGGAT TGACTGCCTA TCCGGATTAA 7860 

AGAAGAAACA GAGGAATACT ATGAGCTTCT TCCTGCTCTT ATAACTGAAA GAAGCGGAAG 7920 

AATAGGTATG TCTGATCTGA TAGCCAGCAT TGTGAAAGAC AAGATTCTAG GATACTAGCA 7980 

TTAGCTTCCT AGCCAAGCAG ACTAGTATGA TAAGGAGAGA TGAGAATGAA TTGACTTTCT 8040 

GAATTTCTCA GTCTTATCAT ATATAGCACA ATGAGATTTC GCTTGAGTCT GCTTGTAAAT 8100 

AAACGAAAAG AAAGATAAGA AATAATGAAA ATTGGTCAAC GAATTATGCG CTTTGGCATA 8160 

AAAAATTAAG TATCGGAGTT GTATCTGTTG TAGTCGGCTT TGATTTCTAG CTCCAGCTGG 8220 
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AATTTCAGCC AATGAAGTAA AGCAAGATGT AACATCTGAA GTGGTAATAG GTGTGCTAGA 8280 

TTCTAAGGAG GAATTGAAAG AGTCAGAAAA TGATGCTCCA AAACTAGAAA CTCCTCTTAG 8340 

AGAGGAGCCA AGACTAGCTC CTCAAACGCT TCCGGAAGCA AGTGAAGTTC TTGAAAACAA 8400 

AAGGGAAGAG TCAAAAGTAG AGATAACATA ACCAGCTCAA GCGGATGATA TCCGCAAGGT 8460 

TGTTGGGGAA TTAGCCAAGG ATATAAGTAT TACTAAGTTG TATATGACAG GTCATTCTCT 8520 

TGGATGTTAC CTAGCTCAGA TTGCAGCGGT TGAAGCTTAC CAAAAATATC CTGATTTTTA 8580 

TAACCATGTA TTGAGGAAAG TGACAACTTT CAGTGCTCCT AAAGTGATTA CTTCCAGAAC 8640 

TGTTTGGAAT GCTAAGAATG GTTTCTGGGA TGTTGGTTTG GAAAGTCGTA AATTAGCTGT 8700 

TAGTGGAAAA ATTAAGCATT ATGTGGTTGA TAATGACAAT GTTGTGACTC CCTTGATTCA 8760 

TAATAATCGT GATATTGTTA CATTTACAGG TAATTCACGC TTTAAACACC GTTCTCGTGG 8820 

CTATTTTGAA AGTCCAATGA ATGATATTCC TAACTTTAAT ATTGGTAAAC AAGCTACCTT 8880 

GGATAAACAT GGTTATCGTG ATCCGAAATT GGATAAAGTG CGATTCTTTA AGAAACAGGC 8940 

TCTGCCTCGA TCTTCTAGTC AACCAAGCGC TGAACCAATG GAAAATATTG CCTCAGGAAA 9000 

ACAGGTTACT CAAAGTTCGA CAGCTTTCGG AGGAGATGCT AGAAGAGCTG TGGATGGCAA 9060 

AGTCGATGGT AACTATGGTC ACAATTCTGT CACTCATACA AACTTCCAAT CTAAGCCTTG 9120 

GTGGCAAGTA GATTTGGCTA AAGAAGAAAC CATTCGCCAA ATCAATATTT ACAACCGAAC 9180 

AGACACTGCC CAGGATAGAT TGGCAAACTT TGATGTCATT CTTTTAGACA GTTCTGGTAA 9240 

AGAAATTGAG TGAAAACGTA TAACATCTCC TAAAGATGTG TCAGCACAAA TTACGATTAA 9300 

CCATAAAAAA GCGCGCTATG TTCGGATTGA GCTAGAAGGC TATAATGCCC TCAGTCTTGC 9360 

AGAAGTTGAA GTTTTCTGCT TTATAGCTAC GAATGCTGAA ACGGCGACAC AAGTTTCTAA 9420 

GCCAGTTCAA CCAATCAGTC AGACTCCTGT GAAGGATAAA ACATTGACAA TTCAACACAG 9480 

TGGAGCTTAC ATTGCCCGCT ACTCCATAAC TTGGGAAGAA GTTCCAGTAG ATAAAGATGG 9540 

AAACCAAGTT GTTCGTAGTC ATTCTTGGGA AGGAAGCGGT CGCAACCAGA CTGCAGGTTT 9600 

TGTCCTCAAC CTCCCAATCA AAGAAAATAT GAGAAATCTG CGAGTTAAGA TTGAGAAAAA 9660 

GACGGGCCTA CTATGGAATA GATGGCAAAC AATCTATGAA AACAGACCAA TTTTAGCTCA 9720 

ACCCCACCGT AAAATTACCC ATTGGGGTAC GACATTGAAT TCCAAGGTGA GTGACGATGA 9780 

TGTCTTGTAA TCTGATGGTA GAATGACAGT TAGTTTGTCT AGTTTATAAG AAAGTACTAC 9840 

CTGAGCTTGA ATAGGACTCA GGTAGCTCTC TATGAAAGAA CAAAATTAAT ACTCAATGAA 9900 

AATCAAAGAG CAAACTAAGA AACTAGCCGC AGGTTGCTCA AAGCACTGCT TTGAGGTTGT 9960 
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AGATAAGACT GACGAAGTCA GTCACATATA TAATCCAAGG CGACGTTGAC GTGGTTTGAA 10020 

GAGATTTTCG AAGAGTATAA ACAGAAAGGT AGAGCGCGTG TTCTAATTTG AACACGAGTA 10080 

GAAAACTTTT CTAAAAACAA AAACGAAAGG ATGGGTAAAC TGTATTCGCT GAACTGAATA 10140 

CGGGCGACTC TCCTCTAAAT CAAAATTAAG AAAGGAATTG ACCCCACCCT AAAAGTAGTG 10200 

GGAAAAAGAT AGTTGATCTA GCGAGCATCG CTCACTGCGC CCAACTCCTA TTTTCCCTTC 10260 

GCTTTTTGAT GGGTTTGGTA TCTTTCTCAA TATAAAATAT AAAATAAAGA AAGGTAGAGC 10320 

GTGTGTTTTG ATTTGAACAC GAGCGGAAAA CTCGGAAAAT AGATAATCTG ACTGAAAAAT 10380 

CAGGATTTCT CGTCAGGTTC CTAATTTTCA GTCGTTTTCT TCTCGCTCTT TGTATCATAA 10440 

ATTATGTCTA TCCATATTGC TGCTCAGCAG GGTGAAATTG CTGATAAAAT TCTTCTTCCT 10500 

GGGGATCCTC TTCGTGCTAA GTTTATTGCG GAGAATTTCC TTGATGATGC TGTTTGTTTT 10560 

AACGAAGTGC GTAACATGTT TGGTTACACT GGTACTTACA AGGGTCACTG TGTATCTGTC 10620 

ATGGGAACTG GGATGGGAAT GCCATCTATT TCGATTTATG CGCGTGAGTT AATCGTAGAC 10680 

TACGGTGTGA AGAAATTGAT TCGTGTGGGA ACTGCAGGTT CTTTGAATGA AGAGGTTCAT 10740 

GTTCGTGAAT TAGTTTTGGC GCAGGCGGCT GCAACCAACT CAAACATCGT TCGTAATGAC 10800 

TGGCCACAGT ACGATTTTCC ACAAATTGCT AGCTTTGATT TGCTTGATAA AGCCTACCAT 10860 

ATCGCCAAAA AACTTGGTAT GACTACTCAC GTTGGGAACG TTTTGTCATC TGATGTCTTT 10920 

TACTCAAATT ACTTTGAAAA GAATATCGAG CTTGGTAAAT GGGGAGTCAA GGCTGTGGAA 10980 

ATGGAAGCAG CAGCTCTTTA CTATCTTGCT GCCCAATACC ATGTTGATGC GCTAGCTATC 11040 

ATGACCATCT CTGATAGCTT GGTCAATCCA GACGAAGACA CAACTGCAGA AGAACGTCAA 11100 

AATACCTTCA CTGATATGAT GAAGGTTGGT TTGGAAACCT TGATTGCAGA ATAATTATAG 11160 

CCAAAAAGGG GCTCTTTGTC AACTGTAGTG GGTTGAAAAA AAGCTAAGCT TGAGAAAGGA 11220 

CAAATTTCGT CCTTTCTTTT TTGATATTCA GGGCGATAAA AATCCGTTTT TTGAAGTTTT 11280 

CAAAGTTCCG AAAACCAAAG GCATTGCGCT TGATAAGTTT GATGAGATTA TTGGTCGCTT 11340 

CCAGTTTGGC ATTAGAATAG TGTAGTTGAA GGGCGTTGAC GATTTTCTCT TTGTTCTTTA 11400 

GAAAGGTTTT AAAGACAGTC TGAAAAAGAG GATGAACCTG CTTCAGATTG TCCTCAATGA 11460 

GTCCGAAAAA TTTCTCAGGG TCTTTGTTCT GAAAGTGAAA AAGTAAGAGT TGATAGATCT 11520 

GATAGTGGTG TTTCAAGTCT TCTGAATAGC TTAAAATCTT GTCAAGAATT TCTTTATTTG 11580 

TTAAGTGCAT GCGAAAAGTA GGGCGATAAA AACGTTTATC GCTsArTTTA CGACTATCCT 11640 

GTTGGATGAG TTTCCAGTAA CGCTTGATAG CCTTGTATTC ATGAGATTTT CGTTCAAACT 11700 

GATTCATAAT TTGAACACGA AAACGACTCA TGGCACGGCT GAGATGTTGG ATAATATGGA 11760 
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AACGATCTAG AACGATTTTA GCACACGGAA AAAGCTGTTT AGCCAAGTCA TAGTAAGGAC 11820 

TAAACATATC CATCGTAATG ATTTTCACTT GACAACGAAC GGCTCTATCG TAGCGAAGAA 11880 

AGTGATTTCG GATGACAGCT TGTGTTCTGC CTTCAAGAAC AGTGATAATA TTAAGATTAT 11940 

CAAAATCTTG CGCAATGAAA CTCATCTTTC CCTTAGTGAA GGCATACTCA TCCCAAGACA 12000 

TAATCTTTGG AAGCCGAGAA AAATCATGCT CAAAGTGAAA GTCATTGAGC TTGCGAATGA 12060 

CAGTTGAAGT TGAAATGGCC AGCTGATGGG CAATATCAGT CATAGAAATT TTTTCAATTA 12120 

ACTTTTGAGC AATTTTTTGG TTGATGATAC GAGGGATTTG GTGATTTTTC TTTACCAGGG 12180 

GAGTCTCAGC AACCATCATT TTTGAAsAGT GATAGCACTT GAAACGGCGT TTTCTAAGGA 12240 

GAATTCTAGA AGGCATACCA GTTGTTTCGA GGTAAGGGAT CTTAGACGGT TTTTGAAAGT 12300 

CAT rTTTCTT CATTAGACTT CCACAATCAG GGCAAGATGG AGCCTCATAA TCCAGCTTAG 12360 

CGATAATTTC TTTGTGGGTA TCCATATTGA TGATATCTAG AATCTTGATG TTTGGGTCTT 12420 

TAATATCGAG CAGTTTTGTG ATAAAATGTA ATTGTTCCAT ATGATTCTTT CTAATGAGTT 12480 

GTTTTGTCGC TTTTCATTAT AGGTCATATG GGACTTTTTT TCTACACAAA AATAGGCTCC 12540 

ATAATATCTA TAGTGGATTT ACCCACTACA AATATTATAG AGCCCAAAAA GGAAGCCCTT 12600 

TATGAATTGT AGGACTTCCT TTTCTTATCC AGAAATTGAT CTAGCTCTCT CTGATTTCGA 12660 

AGAATAGTGA CTTTATGTGA ATATTCTTGG CAAAGTTTTT GGTAATTTTC TTTTTGAGTT 12720 

TTGCGGACGC CCATCCCAAA GAATCCATCT GATAAACTCC CACTCAAAGC GTTCAGGGCA 12780 

ATCTACCGCC ATACTTTCTC TGACTTTTCC ACGGTATTTA AGATAACGCT TAAAGGCTCT 12840 

AAAGAGACAG GTCAATGGCG AAAAATTGAG AAAGATGATT TGGTCAGCTT CTTGCATTCG 12900 

TTCTTGGTAG TAGCACCAAG AATAATTACC ATCGATGACC CAAGCTTTAT GCTTGGTGAG 12960 

AAAGTTTTTT ATCTCGGTTA ACATCCATTC GCAGTCACTG TCTTGCCAAC CAGGTTGAAA 13020 

TTGGAGTGTG TCCATGTGCA GTTTTGGAAT GGAGTAGTAG TTAGATAACT TTTCTGCTAT 13080 

AGTTGACTTA CCAGAACCAG AATATCCGAT AATTGCGATT TTCATTTTCT ACCTTTTCCT 13140 

ATTTGGAGAC AAAAAAACAG CCTCTATGGA CTGTTTCTTA TTTAACAAGT TTAGCTGAAA 13200 

GACGAGCTTT ATCGCGGCTT GCTTTGTTTT TGTGAATCAA ACCTTTAGTT TCTGCTTTAT 13260 

CGATAGCTGA GCTAGCAGCA CGGAAAAGTT CTTCAGATGG GTTTGCTTCG AAAGCTTTTA 13320 

TAGCAGTACG CATAGCTGAT TTTTGAGCTG AGTTCTTTTC GATTCGTCTA ACGTTCAATT 13380 

CAGCGCGTTT GATAGCTGAT TTAATGTTTG CCAATGGTCT TACCTCCATA TTTACTAACT 13440 
(2) INFORMATION FOR SEQ ID NO: 129: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8512 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

CCTTTTTTCA AAAACTAGAT ACTAGTCTAT CAAAAGTAGG AAAGGGTTTC AAGAAAATTG 60 

ATTGGAAATT TTTTGAAAAT CATAGAACTA TTAGCTAATC CCTAGTATTG AAAAGACTGG 120 

ATAGCTTCTT TCAGGTCATC TTGTAAACTA TTTCTCTGGT CAAGTTGGAC ATAGACTTCC 180 

ACCAGACAGG ATCTAAAGTT GGAAAATTTG TAAAAATCCT CCCTTTCTTC TATCGGAAAA 240 

TCAACAGTTT TTATCCAAGA AGCTACTTGT TCTTGCTCCA ACTTCCCTTG TAAAATAGGT 300 

TCATAGATCA CTCTTGCTAA ACGCCAATCC TCATCATCTG TAAAGCGAAT CGACATTCTT 360 

TTAAATAGTT GGCCAAGTAT ATCAAATACT TCATGAACTC TGTTTTTAGG AAAGTCTGGA 420 

TGACAAACCA CCTCTGTCAG TAAATCGGCT CCATGTGCAA AAGCGTGAAC CCAACCATAC 480 

TGACTTGAGA AACCCCTTGT ATCCTTTTCT TTTGAAAGAT AGTGCAAGCC TTGATTTAAA 540 

AGGACATTAC GAATTTCTGG AGAAGGATTT CCCAAATGAT CAAACAACCA CTGGATTTCT 600 
TCCTGGTTAT AATTTGGTTT TTCTTCTGCT ATTTTTCTTA GTAAATCTTG ATACATGGTC ' 660 

AATACCTCTA CATTTCTAGC AACTGTTCAA AAAGGCAGTC TTAAATGACT CAATATTGAA 720 

TTCTCAATTA AATACAATCT GATATAAAAT GACGTAAATA ACTATCAATA CCAGTTCTAC 780 

AGTAAGTTCA AATTTAACAT CACGACCTTC AACGACATTT TTGAAAATAG CTACAACTAA 840 

GACAAATAGA ATGACGCTTA ACAAGCCCAT AAACATCATT CTAAAAAATT TTTCTATTCC 900 

CCTACTCTCC CAACTCAGCA CTATAGGAGA TAATCTGGTC AACTGTGTCA GACAAGAATT 960 

GGATGGTATC ACGGAGTGGT TTGTCTGTTG AAATATCAGC ACCGATAATC ATGGCTGACT 1020 

CAAGTGGTGT CTTGCTACCA CCTGATTTGA GGAGATTGAG CCAGTCTTCA GCTCCAGTTT 1080 

CAGAATGTTT TAGATGAAGG TAACCAGCAG TCGAGATAAC TAGTCCTGCT GAGTAAGTGT 1140 

AACTATACAA GCCCATATAG TAGTGAGCTT GGCGCATCCA AGTCAGAGTT GCATCATCGT 1200 

CAATTTCAAT AGCATCTCCC CAGAAATCCG TCAAAACTTC CTTCATAATG CTGTTGAGCT 1260 

TGCTTGCTCC AAAGGTCTCC CCTTCTTCAA TCAATGTATA CACCTTACGC TGGAAGGCGG 1320 

CTTCCAAGAG GTGGGTGATA AAGTTATGGA AGTAGGTGTC TGTCAAGCGA TGAGCCAGAG 1380 

CGAAGCGTTT TTGACGTGGG TCATTAGACT GGTTCTCCAA GTAATCACTG AGTAGCAATT 1440 

CATTGAAGGT TGACGGTGCT TCAACATAGT AGGTCGACAT ATGGGCATTG AAGTAACTTT 1500 
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GATGATTGTC TGAAAAGATG AATTGACCAG AATGCCCGAT TTCATGAATC AAGGTATAGA 1560 

CATCGCTCAA ACGGCCTGTC CAGCTCATGA GTACATAAGG GTGTACGCGA TATGGGTCCG 1620 

CCGCATAACC ACCGGAATCC TTGCCACTGT TAGCAGCAAA GTCCACCCAG CGCTCTTCTT 1680 

GGTAACGAGC AACTTCCTGA CAATATTCTT GCCCCAAAGG TTCTACCGAC TTCATGACCA 1740 

AATCATAGGC ATCGTCAATA GTCACTTCAG GATTCAGGGC GCTGTCCAAG TCCAATTTCC 1800 

AGTCTGCAAA GGTCATCTTT TCAAGACCAT TTACCTTGGC AACATGCTTG AGGTATCTCT 1860 

GAGCGACTGG TGCAAAGTCC TTCATGATGA GGTCAATCTG GCGGTCAAAC ATGACACGGT 1920 

CCACTTCTTG TTCAGCTAGA AGATAGTCAA AGACAGAGTC GTATCCCTTC ATATCAGCCA 1980 

AGAGTTTTTC AGACTTGACC TGAGCCAGAT AGGCTGCTGC AGCCGTATTT TGGTGCTTAC 2040 

GAAGTCCCTC TGAGAAGGAA CGGAAGGATT TCTCACGAAC CTCAGCATCC TCATGGTTTT 2100. 

GGTAGAAATT CTCATAGGTC ACAAAGCTGT TTTTGTAGGT CTTGCCATGG GCTTCAAAGT 2160 

CAGCCATTTC AAAATCCCCA GCTCGCATCT TAGTATAAAT GTCCTGCGGA CTGTAGAAAA 2220 

CTTCACCGAG ATTTGTCAAG GCCTTCTCCA CATCTGCCCC TAAGTAGTGG GCTTTTTTGA 2280 

TTTTAGCCTG ACGAATGGCA GCTGTTAAAT GTGGCAATTT ACCCAAACGG TCCAAGACTT 2340 

CCTCATCTGC TGCCACCAAG GCATCGTCAA AGAAGGTCAA GGCTACGCTG GCATCTGTTT 2400 

CAAATTCCAT CCCAGCTTGG GCAATATTGG CAAATTCGTC ATTGCTATAG TCCGTCGTCT 2460 

GAGGCATAAA ACCATAGTTG CCAATATGGC TCATCTGAAT GTAGATCTGT TCCAATTCCG 2520 

CAAAGGCCTT CTCGAAATCC TCAAAAGTGT GAAGATTGCC CTTGTAATCA CGGCTAAACT 2580 

GGTTGATGTC TTCGCGAGCT TTCTCGATTG CACGCAAGAA ATCCTCACGG TCTTGGTATA 2640 

GGGCTGTTAA GTCCCAGAGT TCCTTCTCTG GAAATTCTGA ACGGTGTTTT TGTTCCATTT 2700 

TCTTCCTCTT ATTTCTCTAA TTCTACTAAA ACACTAAGGG CTGATAAAGC GTAAAGCGGT 2760 

GCTGTTTCTG CTCGCAAAAT ACGAGGACCT AGGCCTGCCA AAACGGCTCC TTTAGCTTCA 2820 

AAACTTTCGA TTTCTGCAGG TGAGAGACCG CCTTCTGGAC CAAAGATAAA GAGCAGTTTG 2880 

GCTCCTGTTT CAAGACCAGT GACTGCTTGC AGAAGCGCAG CGGCTTCTCC TTCTTTAGCT 2940 

GATTCTTCAT AGGCTACTAT GATAGAGTCA AACTGGTCCA GCTGAGCTAG AAAATCTGCT 3000 

TTTTTCTCGA AAAGTTTAAT ACTTGGTACA ATATTACGCT TGCTTTGCTC GGCTGCTCCA 3060 

AGGGCAATTT TTTCTAGTTT TTCAACTTTT TTACCCAATT TCTTGCCATC CCACTTGGCA 3120 

ACTGACCAGT CTGCAGGAAA GGCCCAGATT TGGCTAGCCC CCAGTTCGGT TACTTTTTGA 3180 

GCGATGAACT CCAGCTTGTC TCCCTTGGGA AATCCAGATG CGATGGTCAC TTGGACTGGT 3240 
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AGTTCCACAT TGTCATTTAA TTCTTGGACC AACTCAAACT GACGATTTTC CATATCCAGC 3300 

ACGCGCGCCA AGCGCTTGAT GCCATCATCA AAGACTAAGG TAACCTCATC CTCTTCTTTC 3360 

AAGCGCATAA CCTGAAACAT ATGCTTACTG GTTTCCTTGT CCTCGATAGT GACAGGAGAG 3420 

ATAGCACTGC CTTTTACAAA ATACTGCTGC ATGCTAGCCT CCAATCACAC CAGAGATATC 3480 

CTTGGTTTTC TTAAAGACAC AGGTATTCCA TTCCCCTTGA ACCATGTGAG TTTCGAGGAA 3540 

AAATCCAGCT GACTCAGCCG ACTGGCGCAC CATGTCCAAC TTGTCCTTGA TAATGCCACT 3600 

CATGATCAGG TAGCCTTCAT CCTTTACCAA GCGATAAGCA TCGTCTATTA GATGAATGAG 3660 

GATATCCGCC AAGATATTAG CCACAATCAC ATCTGCCTCA ATTTCCACAC CCTTAAGCAA 3720 

ATCTCCAGCC GCTACATGGA TATTTTCCAT GCCAGGGTTG AGCTCAATAT TTTCCTGAGC 3780 

CACACGAACC GCCACATCAT CCAGGTCATA GGCGAAAATT TCTTTAGCCC CCAGAAGCGA 3840 

GCTGGCAATA GAGAGAACCC CTGAACCAGT CCCCACATCT AGCACCGTTT CGCCACCACG 3900 

AAGAACCTGT TCCAAGGCAA AAAGGCTCAT CTTGGTAGTT GGGTGGGTTC CAGTACCAAA 3960 

AGCCATGCCA GGATCCAGCT TGATAATCAT TTCCCCCGCA GTCGCCTCAT AGTCTGTCCA 4020 

AGAGGGAACG ATGGTCAAAT CATGAGTGAT ACGAGCAGGT TCATAGTATT TCTTCCAGTT 4080 

GTCTGCCCAG TCTTCCTCAG CCAAGGCAGT CGTACCTATT TTTAACTCTC CCAAATCCAT 4140 

AAAATCTGTC AATTCTGCTA GACGAGCCTG CAAATCCGCC TCAACCACTG TCACATCCAC 4200 

CGTGTCAGGG TAGTAGGCTG TCACTACGAT TTCTTCTTGC TGCTCCACCT CTGGGAAAAT 4260 

CTCTCCAAAG CGGTCCACAT TTCCCACATA GTCCATACTG TCTTCGATTG CGACTCCTTG 4320 

CGCTCCCAGC TCAATCAAGA GATTGGAAAC CAACTCCTCT CCCTCACGCT TCACTGTAAC 4380 

TTTTAACTCT TGCCATGTTT CCATTATTAA TACCAAGCCC GTAAAACACA AAACCAAAAT 4440 

AGGAAATTCT CTGAAGACGC TTGTGTCTAA GAGAAGTTTA TCTTTTTGGC ACAGTGTTTA 4500 

GGGCGGGTTC AGTTTAGAAA TGTAACTGAA CCATCCTTTC TAATCACTTA CTTTTAAATA 4560 

ATCTTTTAAT CTCTCTTGCA ACTGAGGCAC AACTTGACTG GAACTAAGAA ATTCCTCAAC 4 620 

ATTCATCAGC TGATAGCCCT GTCCTTCATC TCCGAAGATG ATATTGTCAA ATTGTTCTTG 4680 

TCTTAGCTGA CCAACCATAA AGACCGATTT CTTGCCTTTA AAAATTACGC TAGGATAAAT 4740 

CTTGCTCCAA AGCAGACAGT CTTCATCTAA ATGAATTCCC AGTTCCTCAT AAACTTCACG 4800 

CCGAGCGCAT TCAAAAGGGC TTTCGTCCCC TTCACGGCCA CCACCTGGCA GTTCCCACAT 4860 

ATTGGCCCAG GGAATACTTG CCTTATCATC GCGTAAGATA GTCAAAAGCT TATCCCCACA 4 920 

AAACAAAGCA ATCTTGCAAC CTGTGAAATC AGAAATTTCT AGTTCCATCT TCAGTTCCTT 4980 

CTAACATTTC CTTTTCCAGC TCGGCTAACC AGTTTTCATA ATATCTTTTC TCATCCCTCA 5040 
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ACATTCGACT ACTATCCATT TTCTGTCTAG CAATCTTGAG AGCCTTACGA GTTCGATCTA 5100 

CATCTTTCTT CACCTTTAAT TGATACCAGG CTTGTATCAC TTGAAGATTG GACAGTTTGA 5160 

GAGACAGAAA CGATTTGACC TGTCGAATAC TAGCATATTG CTCCGCTTGC TCAAAATCTC 5220 

CTTCCAACAA GGCGATATGA AGCAGGGATA GTTGGGCAAC TGTCTGCATC ATCGGAGTAG 5280 

TTGTCCTCTC AAGTAATGCT TGAAACTGCT GTTTAGCTAC TTCTTCCTTC CCTTCCAAAA 5340 

TGGAAACTTC ACCTTGCATA CCTAATACAC CATCCGCAAA ACTCCCTCGT GCATCCTCAG 5400 

GAACTGCTTG AACAAAGTCT TTCAAATCAT ATTCTTGAGG AGCTAGCAAG GTCTGGGCAG 5460 

AATGTCTCAA TACCAGGTAG GCGTATTTGG TATTTTCAGG GTGTTGTAGT AATTCCCAAA 5520 

TTTTTGCTCC ATCGGTGATG TCGACTGGCA AAATGTTATT TAGGAAGAAA GATAAATTAA 5580 

GAAAAATCCA AGTCCCTGCA AAATACCAGC TTCTTGTCAA AAATCCAAAC AATATCGCCA 5640 

ATAATATCAA GCCGAGATGA ACCATCAAGC CTCCTGAAAG CATCAGGATG ATTCTTTGAT 5700 

CGCTTTCATC CTCTTTTAAA CCAATGTATT GAGCACCAAC ATTTTTCAGA ATGGCTGTTC 5760 

TACTAAGATG AAACCTGCCT GACTTTTTGG TCAAAATAAA ATGTCCTAAT CCAAAAGCCA 5820 

CCAGCCGATA GCCTGTCAAG TAGCCACAAA AAGCATGACC CAGCTCATGA AGAATAAAGA 5880 

TTAAATACAT GCTTAGAAGA GCGAAGGCAT AACCAAAAGT AAAGGCTAAA ACTGCGGAAT 5940 

ACCCCAACTC TGCAAATGCG ATTGTTCCAC AAGCAAAAGC TAGCATAATA AAGACAACAG 6000 

CTAGCACATA AACCAAATAA GTCCCAATTT TCTTCATAAC ACCTCCAACC AACTCCTAGT 6060 

ATCTTGGATA AGGATAAAAT TCTCCCTTTT CCAAGCCAAT TTTTCCTTCT TCAAAGACTT 6120 

CTTGGTTCCA TTCCATGACA AATTCCTCTG CTTCTGGGTC TTCCAAAAAG TCCATGAGGA 6180 

CATCTAGCCC AACCTCAGCA GTATCTTTAA GGAAAAGCGC AAAATAAGCT AAAAATTCAC 6240 

GGGAAAATCC TTTTTTAGGC AGGTAAGGAA TAACAGTCAA ATAGTCTTCC TCATTGACTG 6300 

TTGACTTGGC AGGATTGTAG AAAAGGACCG CTTCCTCAAA AAGAATGTCA TCTGATGAAA 6360 

CCTCTCCGTC TTCATCCACC ATCTCCACAC CGCAGCATTT TGCGCTTCCA ATAGAAAACT 6420 

CACTTCTACC GCATGGTTGC GTTTGTCCCA GCTAATCTCA AAGTCAAAGG GAAAGTTCTT 6480 

GTCCAACTCT TCCTCTAAAA TATCTAAAAA TCCGTATGTT GCCATTTTGT CCTCTTTCTA 6540 

TGCGACTCTT TAATCGCCCC GATTGCTCGG AAATATGCTA AAATAGATAC TACCATCTTA 6600 

CCACAAAATT ATTTTATGTC CTAATTATAC CATATTACCT CATTTAAACC CTTGGTATCA 6660 

GTGATTTTCT TAAAAGTCTG ATTTCTTCAT TTCTCATAAA AATCAATATA AAAAGCCCTC 6720 

GAAAGGGCTA ATAAATCTAT AAAATCAATA GGCGAGTAAC TAGCACAAGT GGACGTGCTT 6780 
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TTTTATTGAC TATTACCACG ATACCACGCT TAATCTTAGG CTTGAACTTT CTTATCTGCA 6840 

ATAGCGTCTG TCAAAGTCTG AGAAAAGTTA AGCCCCATTT CTCGTCCCAA CTTATCTGCC 6900 

CATTTTGGTA TGGTCAAAGT CTTTTTAATG GGTTCCTGAC TTCCTAGGTA TTCTGATACA 6960 

TCAACAGATA CCATAGAAAT AAAAGATTTA TCAAGGTCAT AGGTTGACAC GAAATCTTCA 7020 

TCATCTTTAA AAGGATCATT ATCAATTAAA GACAAGCTAT TGATATCTGA TGGCTGAGGT 7080 

AACTCTCCAT CACTCTCTAT CAAATCTGCA ACAGTTATCC CTAGCCACTC CGACCCCATA 7140 

GCCAAAGCCT CAGAAATCCC CTCTCCTTGT GTAGCTGAGT ATTCAAAATC TGGGAAATGG 7200 

ACAAAATAAG TCGCTTCTGT TCCGTCTGTG TCGTCATAAT AAAATAAAGC TGGATACGTA 7260 

ACTAACATTT CACTACCTCC ATATCAAAAA GCAGGGACTG AATTTTACAA CCCAGCTTGC 7320 

TTTCTTATCC CTCTTTCAGT GTACTTATTC AGCTCACCAT GAAGGATTGT GATAGGTCTT 7380 

TCCCCTTGCT TTTCCATTTT AATATGGGAG CCTTTACCGC CTCTAGTCTT TATCCAACCA 7440 

TGGGCCGTAA GGAGTTTAAC CATCTCTTTT TGTGTCATAG GCATAGCGCT TTTACCTCCT 7500 

GACAACACCA TTATAACACG TGTTACACGT ATTGTAAAGG AGTGATACTT ATTATTCTAT 7560 

TATACATAAA AGCCCCTAGA TGTGGTTCTA AGGGAAGCCA ATTTATTCAT ACCTATTTTT 7620 

CTAATGAGTA GTAAAAACTG CTTCTTTATC GAGCAATTCA TCATCTGTAT AGTCAATTGT 7680 

AAAAGTATCT CGATCTAAGA CAGATTGAGG CGGAGTTGAA TGAATCATAG GAACACTGCG 7740 

TACTCTATAT TTTTTATCTC CAATTTTTAC AAACTGATAC TCTTCGAAAA TCAAATTCAA 7800 

ACCACGTCAA CGTCGCCTTA CCGTACTCAA GTACAGcCTG CGGCTAGTTT CCTAGTTTGC 7860 

TCTTTGATTT TCATTGAGTA TGATTAACTC TCAAGTCTTC GAAATCAGGA TTTTCAACAG 7920 

TTATTACAAG GAGGCGATTT ACTACTTCAA AAACATCAAT TATTCTATTT TTCATATTTT 7980 

TTCAACCCAT TATTAGAATG AACTTCTTGG TAAGCAAAAT CAAGTTTAGA TTTAATGTTT 8040 

TCGTACAAAT CTAAAATCTC TTTTGGAGTA TCTTCCCGGA AGAAAAGTTT TCTTTTCCCT 8100 

GAAATAACTT GATCACTAAG AATCCAATGA CGAATTTGTT TTGTAAAAAT CAAAATTTCC 8160 

TGACTTGGTA GTTCCATCAT TTCCATTGCT TATCACCTCT CTTTTCATTA TAGTTCATAC 8220 

AATGACATTC AGCAATATTA TTTCTCAAGT CAGCACTTCC ACTTCTTTAG GCTCAACTAT 8280 

CCTATTTTGA GCTTTAAGGA AAATCAAATC TCTCATGCTG ATACCTCTCC TCATTAAATT 8340 

AAATAGTAAA AAAGATTCTA TCTCACTCCC TGATTATTAC AAAACCATTG AAATATCACA 8400 

ACTAATAGGC TAGAATGGAC ATAGTAAGAT ATAGTAGATG AGTCATTCTA CTCAAATCCA 8460 

CGTTAGAAAG GACTGCTATG CCAGACAATC TCGCCGTTCG CATGCGCCCn GG 8512 
(2) INFORMATION FOR SEQ ID NO: 130: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2869 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

CTCGTTTCAA GGTTGAGTCT CTTGCAAATC TTGTTCGCGT TCTTCCTTTT GCCAAGGCAT 60 

CTCTCCCATG GTTGGTGCcA GCCATTGTTG GAATCTTGCT CTCATTGGTT CTACCAAACA 120 

AGCAAGAAAG CGATGTTTTT GAAATGGAAT AATCACTTAA ATCACTTTTG TAGCCAAGTC 180 

TACAGGAGTG ATTkTCTTTT TTTATCCGAT GATAAATGTG TTATAATAGG TAGCGAAAGA 240 

GGTGAAGAAA TGAATCAAAC AGTAGAATAT ATCAAAGAAC TGACAGCCAT TGCGtCGCCA 300 

ACAGGCTTTA CTCGTGAGAT TGCGGACTAT TTAGTCAAGA CTCTAGAAGG TTTTGGTTAC 360 

CAGCCGGTTC GCACATCCAA GGGCGGTGTC AATGTAACTA TTAAAGGTCA AAATGATGAG 420 

CAACATCGCT ATGTGACTGC CCATGTAGAT ACGCTTGGTG CTATTGTCCG TGCTGTCAAA 480 

CCAGACGGCC GTCTCAAAAT GGACCGTATC GGTGGCTTTC CTTGGAACAT GATTGAAGGA 540 

GAAAACTGTA CCATTCATGT GGCTAGCACA GGTGAAAAAG TATCAGGAAC CATCCTCATC 600 

CACCAAACTT CTTGCCATGT CTATAAGGAT GCAGGAACTG CAGAACGCAC GCAAGACAAT 660 

ATGGAAGTGC GTTTGGACGC CAAAGTAACT AGTGAAAAAG AAACTCGTGC TCTTGGCATT 720 

GAGGTCGGTG ATTTTATCAG TTTTGACCCA CGAACTGTCG TGACAGAGAC AGGTTTTATC 780 

AAGTCTCGCC ATTTGGATGA CAAGGTCAGT GCGGCGATTT TGCTCAATCT CCTTCGCATT 840 

TATAAGGAAG AGAAGATTGA ATTGCCCGTA ACAACTCATT TTGCTTTTTC AGTCTTTGAA 900 

GAAGTGGGAC ACGGTGCAAA CTCTAACATT CCTGCTCAGG TAGTAGAATA TCTGGCTGTG 960 

GATATGGGAG CCATGGGAGA TGACCAGCAA ACAGACGAAT ATACAGTGTC TATCTGTGTC 1020 

AAGGATGCTT CTGGACCTTA TCACTATGAC TTCCGTCAAC ATTTGGTGGC TTTGGCGAAA 1080 

GAGCAAGATA TTCCATTTAA GCTGGATATC TATCCATTTT ATGGTTCGGA CGCTTCAGCG 1140 

GCTATGTCTG CAGGGGCAGA AGTCAAACAC GCCCTTCTCG GTGCTGGTAT AGAGTCTAGC 1200 

CATTCCTATG AGCGTACCCA TATTGACTCG GTGATCGCAA CAGAACGAAT GGTCGATGCT 1260 

TATCTTAAGA GCACGTTGGT GGACTAATAT GTGCCTTATT TGTCAGAGAA TTGACCTCAT 1320 

CAAGAAGGAA GAAAATCCTT ACTTTGTCAA AGAGTTGGAA ACAGGCTATC TTGTGGTTGG 1380 

AGACCACCAG TATTTTGAAG GCTATAGTCT CTTTCTAGCC AAGGAGCATG TCAGCGAATT 1440 
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GCACCATTTG 


AAAAAGGAGA 


CAAGACTCCG 


TTTTCTAGAA 


GAAATGAGTT 


TAGTCCAAGA 


1500 


GGCAGTTGCC 


AAGGCCTTTG 


CTGCTGAGAA 


AATGAATATC 


GAACTGCTAG 


GAAATGGCGA 


1560 


TGCTCATCTT 


CATTGGCATC 


TGTTTCCACG 


ACGGACAGGT 


GATATGAATG 


GTCATGGTCT 


1620 


CAAGGGTCGT 


GGACCAGTCT 


GGTGGGTTCC 


CTTTGAAGAA 


ATGACAGCAG 


AAACCTGCCA 


1680 


AGCAAAACCG 


GATGAGATTA 


AAAGATTAGT 


CAAACGTTTA 


TCGTCAGAAG 


TAGATAAACT 


1740 


ATTAGAAATA 


AAGGAGTAGA 


AATGAAGAAA 


AGATACCTAG 


TCTTGACAGC 


TTTGCTAGCC 


1800 


TTGAGTCTAG 


CAGCTTGTTC 


ACAAGAAAAA 


ACAAAAAATG 


AAGATGGAGA 


AACTAAGACA 


1860 


GAACAGACAG 


CCAAAGCTGA 


TGGAACAGTC 


GGTAGTAAGT 


CTCAAGGAGC 


TGCCCAGAAG 


1920 


AAAGCAGAAG 


TGGTCAATAA 


AGGTGATTAC 


TACAGCATTC 


AAGGGAAATA 


CGATGAAATC 


1980 


ATCGTAGCCA 


ACAAACACTA 


TCCATTGTCT 


AAAGACTATA 


ATCCAGGGGA 


AAATCCAACA 


2040 


GCCAAGGCAG 


AGTTGGTCAA 


ACTCATCAAA 


GCGATGCAAG 


AGGCAGGTTT 


CCCTATTAGT 


2100 


GATCATTACA 


GTGGTTTTAG 


AAGTTATGAA 


ACTCAGACCA 


AGCTCTATCA 


AGATTATGTC 


2160 


AACCAAGATG 


GAAAGGCAGC 


AGCTGACCGT 


TACTCTGCCC 


GTCCTGGCTA 


TAGCGAACAC 


2220 


CAGACAGGCT 


TGGCCTTTGA 


TGTGATTGGG 


ACTGATGGTG 


ATTTGGTGAC 


AGAAGAAAAA 


2280 


GCAGCCCAAT 


GGCTCTTGGA 


TCATGCAGCT 


GATTATGGCT 


TTGTTGTCCG 


TTATCTCAAA 


2340 


GGCAAGGAAA 


AGGAAACAGG 


CTATATGGCT 


GAAGAATGGC 


ACCTGCGTTA 


TGTAGGAAAA 


2400 


GAAGCTAAAG 


AAATTGCTGC 


AAGTGGTCTC 


AGTTTGGAAG 


AA1 AV. 1 A L\jKj 




2460 


GGAGACTACG 


TCGATTAATA 


CTCTTCGAAA 


ATCTCTTCAA 


ACCACGTCAG 


CGTCGCCTTA 


2520 


CCTACTGACT 


GCGTCGGTTC 


TATTCACAAC 


CTCAAAACAG 


TGTTTTGAGT 


CGATTCGTCA 


2580 


GTTTTATCTG 


CAACCTCAAA 


GCTGTACTTT 


GAGCAstGCG 


GCTAGCTTCC 


TAGTTTGCTC 


2640 


TTTGATTTTC 


ATTGAGTACA 


AAAAGTAAAC 


TTTTCTCTTG 


CAATTCCAGA 


TAAATAGTGT 


2700 


ATAATGGATG 


GGTATGTGAA 


AAACATACTT 


GTGGGAGGTA 


AAAATCTCTA 


ATTACCGCCA 


2760 


AAACCACAAA 


GGAGGATTTA 


AAAATGGCTA 


AAAAAGTCGA 


AAAACTTGTA 


AAATTGCAAA 


2820 


TCCCTGCTGG 


TAAAGCTACA 


CCAGCTCCAC 


CGGTTGGACC 


TGCTCTTGG 




2869 


(2) INFORMATION FOR SEQ ID NO: 131: 









<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



WO 98/18931 



PCT/US97/19588 



897 

CTGAATCCCT TATAGGAGTC CAGTAACTTT TTAGCCTCTA CTTTGCCTTC ATAGGCAGCT 60 

TCAACATCAT TAAAAAAAGA ArGCACTGAA GCAAGTTCTT CAGTGCTCCA CGACAAATCT 120 

AGTGGGTAAC TATACTGTTT GTTCATTAAC TAATACCAGC TCTCATTCTT GCTTCTTTTA 180 

GTTCTTGCTT ACGATAACTA CGAGGGAGAA AAGCACGAAT CTCATCTTCA TTAAAACCGA 240 

TTTGCATACG CTTGGCATCA ATAATAATTG GACGACGCAA AAGACTAGGA TACTGCTCAA 300 

TCAAATGAAG CAATTCCGAT ACCGAAATAC TCTCTACATC AATATTCAAT TTTTGAAAAA 360 

TTTTTGAACG AGTTGAAATG ATGTCATCAG TACCATTTTC GGTCAAGGAA AGGATGTGTT 420 

GCAATTCTTT TCTTGTTAAA GGACTGGTCA TAATATTGTG TTCCACAAAG GGAACTTATG 480 

TTTTTCTAAC CAGGCCTTAG CCTTACGACA TGATGTACAG CTCGGTGATA GAAATAGTGT 540 

AATCATGCTT TTCTCTTCTT ATCTATACTT TGCTACTTCT ATTATACAAA AAAATAAAGC 600 

GCTTGACTAG GGATTTTTAG AAAAAAAGCC TATTTTTTCA AGAAAAATAG GCTTTTTGCG 660 

AACGATTGAC ACAATTGGAT TTGGTTAATT CACTCTTAAC GATGGTTTTA AACGATATAT 720 

ATTTTTATAT ATGTAAATTA AAAACATCTT TCCTTTCACT TCCTACGACT TTTCAGATAC 780 

AGATAGCCAA AGAAGTTTTC ATAGAGGGCA AAAAAGAGGA GGAAGGCATG AAGAAAGAAG 840 

GTCTCTGGCA AAATCATAAT AACAGGATCC TTGGCTGGAT CAAAAAGCCA GGTATCATCT 900 

CCCACAAAGA GAATTTGATG GAAAAGAGTA AAGAATTGGT CAAAACCAAT CAAAACTCCC 960 

CCAAGTCCAA TCATCACAGG TAAGACTACT AGAGCCAGGA GACTTTTTCG ATAAAGAGAC 1020 

AAAAAGTCCT TTTTCACAAT CCTATTGACA AAGACATAGA AACTTGGCAG TGTCACTAGA 1080 

GCTACTAGCT GAACCAAATG AAAGAGATTC TTGACCACTG CGAAATGGTG CAGACCAGCT 1140 

GCTGACGAAC GAAAATCAGG CATCTGTAAG ACCTGACTAA AAGGATTGGT CAGATAATTC 1200 

ATCAAGATAT GAAAATTGTA TTGAATGGTT TCTGGTTTTA GATAGACTCG ATTCGTTAAG 1260 

TTTAGCCACT GAATCTCCAT AGGATAGAAA ATCCAAGCCA GATAAATGGT CAGAAGGATG 1320 

GAGAGGGAGA GGAGAAAGAG CATAGAGCCC CAAAAGATCA ATTTAGTTTT CATCAAAATC 1380 

CCACTCCGCA AGGCTAGAAA CCACATGTGT CGGTGCGATT GGCAGGCCAG CTACTTCTTC 1440 

TGCCTTAGTA AAACCTGTCG TCACCAAGAG CGTTGGAATG CCATTGTCAA TCCCAGCCCG 1500 

AATATCAGTC AAATAATTGT CCCCAACCAT GATTAACTCT TCACGTTCCA AACCTAAGTG 1560 

CTCAACCGCC TTGTCCATAA TGATGGCATT TGGTTTTCCG ATATAAACCG GCTTCACTCG 1620 

TGTCGCTACT TCAAGCAGCG TAATCAGTGA GCCAGCACCT GGCAAAAGAC CGCGTTCCGT 1680 

CGGGATGTTG AGGTCAGGAT TGGTTCCGAT AAAATGGGCA CCCTTTTGAA TAGCAAGAGT 1740 
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TGCTGTGGCA AATTTTTCAT AGTCGACTTG CCAATCCAGA CCAACTACCA CGTAGGCAGG 1800 

TTTTTCCTTG TCTTCCACAT AACCAGCCGC CTTGATGGCT TCCTTGAGTC CTGCTTCTCC I860 

GACGACATAG ACGGTCTTTT CAAGCCCCAA ATCATTCATA TAGTCGATGG TTGCCAAAGT 1920 

CGCTGTGTAG ACAGTCGATA GGGGCGTATC GATATTAAAA TTCTGAGCCA ACATCTCCTT 1980 

AACACTCTCT GGAGTGCGGG TTGTATTGTT GGTTACAAAG AGATAGGGAA TGTCCCGCTT 2040 

TTGCAATTCA TGAACAAAAG TCTCTCCAGC AGGGATTCGG TCTTTCCCCT TATAAATGGT 2100 

TCCGTCTAAA TCAATTAAAT AGCCTTTATA TTTCATCTAT TTCTCCCTAA GCCTTTTTTA 2160 

TTTCTTGCCA AGTAATGATT GCTTGGGCAT TGATAACCCC ATCACTTGTA ATTTCATGCT 2220 

TGCTTTCCAG TCCAGTCCGT TCAACAGCCG ATGTAATCAC CCCACCTGGT CGAACTTCCT 2280 

TGACATACTT GAGGTTGATT TTCTTGGGAA TATAGTGGGT CAAAAAATCC GCTCCCATGA 2340 

CCTCAAAAAT CCAGTCCAAG TATTTACTGT TATTGACATG ACCATTCATA TCCAAGTCGT 2400 

AAAAACGAAC ATGGTAATCC TTGCTGATCG GTTCTTCCAA GGACTCATAC TTCGGTCCAC 2460 

GGATAAGTTT TTTATCAAAA TCAGACTGGT AAGGAGCCAC AATCTCAGGT TCAACAACAT 2520 

GGACTTTTCG ACTGTCGCGG TCCATGAGAA CAAAGGTCGC CATCATGTGG ATGAGCTCCT 2580 

GCTCCGCTTC ATTATAAATA GTAAAGCGAC GGTAGCAAAA AAGTCGATTG TAGCTCAAGG 2640 

CTTCCGTTTC GATGGTAATT TCTTCCGCAA AACGAGGCAA ACGAACCACC TCAATATCAT 2700 

ATTCTACGAT AATCCAGACC AGATTATATT CTTCCAAAAT GGCCTTATCA CTAACTCCCA 2760 

GTTCAATCGA CTGCATCCCT GAAACTTGCA GTGACAGCAA AATCACATCT GGAAGTTTGA 2820 

TATGACCGTT CATATCAGCC ATATCAAAAG GAATTTTCAT TTTCATTTGA TAAGTTAAGC 2880 

CCATGATCCT ACTCCAAAAT AAATCGTTCT GCTACAGTAT CTCCCAAAAA GAGACCTCTC 2940 

TTTGTCATGC GAACGTGGTC ACCCTCAATC TGCATGAGGC CTTGTTGAAC CAAATCTCTG 3000 

ACAATTTCTC CATAAAGTCC AGCAAAAGAC TGTCCAAATT TTTCCTCAAA TCGCGCCATG 3060 

GAAACCCCGG ATTTCTTGCG GAGTCCCAAG AACATTTCTT CTTCCATTTG CTCCTTTTGA 3120 

CTCAGGTGAT CTTCTGTAAT ACAAGCATTG CCTTCCTCAA CCGCACTGAG ATAATGACGA 3180 

ATGGGACCAT GATTTTTATA GCGTACTCCA TTGACATAAC CAGATGCCCC TGCACCAATA 3240 

CCATAGTATT CAGCATTGTC CCAGTACATG AGATTATGAC GACTTTCAAA ACCGGGTTTG 3300 

GAGAAATTAG AAATCTCATA ATGCTCAAAA CCCGCTCGCT CCAGCTCTGC AATGATGTAC 3360 

TCAAACATCT CCGCTTCTAG TTCCTCCTTA GGCAGAGGCA ATTTCCCACG TCGCATCCGG 3420 

TTCATAAAGA CCGTATGGTT TTCTAAAATC AAACTATACA AACTCATGTG GGGAATATCC 3480 

AATCCAATGG CTTTAGCCAC ATTTTCCTTT ACTTGCTCCA TGGTCTGACC AGGCAGAGCA 3540 
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TAAATCAAAT CAATGGAGAT ATTGTCAAAA CCAGCCAGTT TCAGGCGATC GATATTTTCA 3600 

TAAATATCCT TCTCCAAATG ACTGCGCCCA ATCTTTTTCA ACATCTTATC ATCAAAGGTC 3660 

TGGACACCTA GCGAAACACG ATTGACAGCC GAATTTTTCA AAACAGCTAT CTTATCCGCA 3720 

TCCAAATCGC CTGGATTGGC TTCAATGGTC AACTCTTCCA AGACAGACAA ATCCAAGTTT 3780 

TTAGTCAAGC CATTCAGTAA CACCTCCAGT TGCGGAGCCG ACAGGGCTGT CGgTGTTCCA 3840 

CCACCGATAT AAAGGGTTGA CAACTTTTCA ATATCATAAG AACGAAACTC TTCCAGCAGA 3900 

TGCTCTAAAT AGCTGTCGAC TGGCTGATTT TTGATGAAGA CCTTTGAAAA ATCACAATAA 3960 

TAACAAATCT GGGTACAAAA TGGGATGTGC ACATAGGCTG ACGTTGGTTT TTTCTGCATA 4020 

GTAATTATTA TACCACAAAG ACTAGATTCC AGATAAAAAT CACCATCCCC AGATACATAG 4080 

TCCGTCCGGA GATGGTGATG GTTTATTCTT CTGTTATATC AATCACAATC TCTTCTGAGT 4140 

CATCAAGAGC TTCGGCTTTT TCTTGCCATT GCTCCTTGAG ATTATTTAAT TGATTTTTTG 4200 

ATGCTTCTGT CGCTTGAAAA GCATAGGATT TAGTTTGAGC AAGTATACTG TCCACAGTGA 4260 

TTTCACCTGA CTCAACCTGT TCTTTTGTTT TCAGAACAAA ATCTGTAGCC TGCTCCTTAA 4320 

CTTCTGTCAG TTTTTCACAG ACTTGCTCCT TGGCATACTC CGGATCTTCT CTCAAATCAT 4380 

CTAGAAAATC TTGAGCCTGA CTGCAAACTT GTTTGCCCTT ATCACTTGTT AAAAACAAGG 4440 

CAAGAGCTGC ACCTGAAACG GTTCCTAAAA GGATTGAGGA TAATTTACCC ATAAGGATTC 4500 

TCCTTTTTTA TTTTTTGAAA AATTTACTTG CAAGACGAAG AGCTGACAGA CTTGCACCAG 4560 

TCTTGAGTGT TTTTGAACCA GCTGATGAAG CTTTCTTGCT CAAGACACGC GCATGGTCAT 4620 

TGAGGTCTGA AACAGATAGA GATAAATCTG CAACAGCACT GAAGAGTGGA TCAATCGTAG 4680 

CCACCTTGAC ATTGATATCA TCTGCCAAGA CATTGACCTT AGCCAACAAC TCATTGGTGT 4740 

GATGCAAGGT CACATCCACA TCTGAAGTCA AGGTTTTAAT CGTCTTTTCT GTTTCATCGA 4800 

TGACACGACC AAGCTTTTGT ACAGTAATGA TCAGATAGAC CAAAAAGACA ATCAAAGCTA 4860 

GGGCAACAAG AATATATGCA ACTTCTAACA TTTAGTTTTC CTCCTCTGTA ATATAGTAAG 4920 

GGGCCTTCTT TCGATTTTGA TAAATAACGA TCATTATACC GAGACCGATA AGGACAACTG 4980 

ACAGCCATTG GGACACTCGA AAGCCGAAGA ACATGAGACT ATCTGTTCGC ATACCTTCGA 5040 

TAACCATACG ACCGAAACCA TACCAAATCA AGTAAAAGGC CGTGATATGA CCTCGTCTGA 5100 

GACTCTTCCA TTTCCGTCTA AAAATCAGAA TCAAGGCAAA GCCAAGCAGA TTCCATAGAG 5160 

ACTCATAAAG GAAAGTCGGT TGACGGTAGC TCCCCTCAAT ATACATCTGG TCACGGATAA 5220 

AGCCAGGTAG ATAATCCAGA TTATCCACTG TTGCACCATA AGCTTCTTGG TTAAAGAAAT 5280 
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TACCCCAACG CCCCAAACTT TGAGCAATCA TAACGCTAGG CGCCGCAATA TCTAGAAAAT 5340 

CCCAAGTATT GATGAGTTTA CGGTCAGCAA AGATATAGAG CACAAGAGCC CCAGTTATCA 5400 

AACCACCGTA AATGGCCAAA CCACCATTCC AAATGGCAAA AATCTCTCCT AAATTCTGAC 5460 

TATAGTAATC AAATCGGAAA ATAACATAGT AGAGACGAGC TCCTAAAATA GCCAAGGGAA 5520 

AGGCTACTAA GATAAAATCT AAAATATCGT CTGGTATGAT CTTCTTTCTA GGTGCTTCTT 5580 

TCATGGTCAA ATAAACCGCA AGAATCAAGC CTGTCACAAT ACATAAGGCA TACCAACGAA 5640 

TGGCTAGGGG TCCTAGTTGA ATAGCAATTG GATCAAGCAT TTTGCACCTC ATTTCGAGCG 5700 

ATTAGACTTG TCAGTCGTTC GTCGAACAAA CGGGTCGCAT CAAAGCCCAT TTCCTTGGCA 5760 

CGATAATTCA TGGCAGCTGC CTCAATCACA ACAGAGATAT TACGACCTGT TTTAACTGGA 5820 

ATACGAATAC GAGGAATGtA CGCCAGAAAC TTCAAGTTCC TCTGCATTAT TTCCAAGACG 5880 

ATCAAAGGTC TTATGCGTAT CGTAATTTTC CAAATAGACA GCAAGCTGAA CCTGTGAAGA 5940 

ATCCTTGACA GCACTCGCAC CGTAGAGACT CATAACATCG ATAATACCAA CCCCACGAAT 6000 

TTCAATCAAG TGTTTCAAAA TTTCAGCTGG TTCACCCCAG AGAGTAATCT CATCCTTGGC 6060 

AAAGATATCG ACACGGTCAT CGGCTACCAA ACGGTGACCA CGTTTGACAA GCTCAAGACC 6120 

TGTCTCGCTC TTACCAATTC CACTATCTCC CTGAATCAAG ACGCCCATCC CATAAATATC 6180 

CATCAA 6186 
(2) INFORMATION FOR SEQ ID NO: 132: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

GAAAATCACA ACCCTTTTTG CAAAATTTTT GAGATTATTT TCACAAACTT GATTTTTCAA 60 

AGTATACTCA ATAAAAATTA AAAAAATCCA CTACGTCAAG GCGAGGCTAA TGTGGTTTGA 120 

AGAAATTTTC GAAGAGCGTG AATGAGTATC ATCTATAGTA AAATAAAAAA ACTGAACAAT 180 

TTGGTTGGGG ACAGCCAAAC CAATTTCTCA CAATGTTTCA GAAACAAGGG TGTGCTATTC 240 

CAATTTCAGC CTACTATAAC TGTCATAGAT TGCTGAAACA AAGTCTAGGT AAAAGTCTTC 300 

ATAATAAAAA GACCTCCTAT CAAGTGTTCA AAAACTTTGA TAGGAGGTCT TGTTTTGTGA 360 

AAATATTTAT CAAATTTTCT ATACAAGTGA GCTGTTAGCC AGGTTCTTTC TATTCTTTCA 420 

ATTTCAATGA ATGGATTTTT TACTAATACT CATAACTGGG AATTTGTCTG TGTAAAAATA 480 
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GCGAGATAGA TGGTATTTAT AAAACACTCA AGACAGCTAG ACTAATATCA TTTAAAACAT 540 

TATCTTCTTT TGAGCGACTG TTGGTTACCA ACATAGCTAA ATTTCCTGCA TTTTCAAATT 600 

GATAGGGTTC TGATTTAGCA TTCACAACCA CCAAGAGGTG TTCTTTGCCG TGAACTTCAT 660 

AGATAAGGTA GCCGCTATGT TCAATCGCAG AATGCACAAA GACATGATGG TAAATTTCAT 720 

CATAGCTAGA GTAAGAAAAG GCACCAGTTT TTGTCTTCAA TCGGATGACT TGACGGATAA 780 

ACTCAATACT GTCTTGACGC TCATTAATCA AGTTCCAGTT CACTTGGTTC ACACTGTCAG 840 

GAGCATTATA GCTATTCATC GCACGCTCTC TATCATCATG GGTCAACTCA CCATTTTCAC 900 

CAGTCGCAAC CAGTTTGGTA CGACCAAATT CTTGACCGAT TTCCATAAAG GCCATCCCCT 960 

GCATGAGCAG ATTCATGGCT GTGGCAGTTT CGACCTTGCG CATGATTTGC TCTGAACTTT 1020 

GGTCTGGATG AAGGGTTGCC AATAAATCGT GAAGATTGTA ATTGTCATGG GCTTCTACAT 1080 

AGTTAAGCAC CTGATTTGGA TGTGTATAGC TTCCTAATTC ACGACTTCCT AGGATTGCTT 1140 

TAGCTAGAAT TGGCTCTGTC GCAGCACCAC TGACAAAACC TGACTTGATA GCACCATAAA 1200 

CTTCTCCCCC TTTGACAGCA TCGCGCTGAT TGTCATTAAA GAAACCAATA TTTGGCATCT 1260 

GGTAGGCATT GTCCTTCTTG GCCTTATCAT AAGGGGCAAG ACCTGTTCCC ATATCCCATC 1320 

CTTCTCCATA GAGGATAATG TTGGAGTCGA TTTCATCCAA GCTTTGACGA ATCATCTGCA 1380 

TGGTCTTGAC ATCATGAATC CCCATCAAGT CAAAACGGAA GCCGTCAATA TTATATTCCT 1440 

GCACCCAGTA TAGAAGAGAA TCAATCATAT ACTTGCGAAA CATTTCGTGT TCACTGGCTG 1500 

TTTCATTTCC AACACCCGTT CCATTCTGGA AGGTACCATC TGGATTCATA CGATAATAGT 1560 

AATCAGGGAC TGTTGTTTGG AATGGTGCAT CAACAACTGA GAAGGTATGG TTATAGACTA 1620 

CATCCATAAT GACTCCAATA CCCGCATCGT GATAAGCTTG AACCATCACC TTCAAATCAC 1680 

GAATGACCTG AGCTGGATCA TCTGGATTAG TTGAAAAACT AGTTTCTGGC GCGTTATAGT 1740 

TTTGTGGATC ATAACCCCAG TTGTAGGTTA CATTTCCATC CTCATCGTAT TCTTTATGAC 1800 

GGTCTGCAAT TGGTTGCAAT TGAACATAAT TGTAGCCCAG CTTCTTGATG TAATCAAAAG 1860 

CAGTTGACTG GCCGTATTGG TTAACTGTTC CAGCCTGAGC AGCACCCAAG AAAGTTCCTC 1920 

GAAGATGTTC ATCTACACCC GATGTAGGTG ATTTAGTCAA ATCACGAATG TGCATTTCAC 1980 

AGATAACTGC CTTACATGGA TTTTCCAAGC GCCAAGTAGC CTCCGAACCG TGCTTAACCT 2040 

CGAAGTTTTC AACTTGCTTT TCTACATGGC TCAGAATAGC TGAACGTTTG CCATCAGGGC 2100 

TGGTCGCGAT TGTATAAGGA TCACGTGTCA GTGTTTGGTG ATGAGGGAAT TGGACTTGAT 2160 

ACTGATAAGT CTTACCTACC AAATCTTCTT CAACATCCAA ACTCCAGACA CCGATTGTAT 2220 
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TGTCCTTATG ATTATAAGAG TAGCTATTGC CTCTTTTCAT CTCAAAAGTC TTCCAAACGG 2280 

GTGCATCATT AGCAGCTGAT TCATAAACGA CAACTTGCAC TTCTGTCGCT GTAGGTGACC 2340 

AGAGAGAAAA ATGAGCCTGA TTGTCCTCTA CACGGCAACC CAATTCTCCT TGGTAACCCC 2400 

AATGATGATC AAAACTAGCA CTGTTAATGG CCTTATCAAA GGCAAAAGGA TTTTGATTTT 24 60 

TATAGAAAGG ACTGGCAATA GCAGGATTTT CAGAGTAATA AATCCTATCA TCGCCTTCCA 2520 

AAATCCAGAC CTCTGTTAAT AGGGGATAGT GATTAAAACG GATAGAATAT TCTTTACTAG . 2580 

TTTGACCTGT ATGAACCACA AAATTCAAGC TTTCTATAAC ATGTGAACTT GGGTGTTCAA 2640 

AGCTAAATAA AGCTCCAAAA TAATCTTCTT TGTAGGTTAG CAAATCAATT CGTTGATCCT 2700 

GACTTTTTAC AAAGGAGCAA GTGTCATATT CTCCATTCTT ACGATGGTAA TGAATGCGCA 2760 

TAGGGTAGTT ATACATTTTT TATTTTTCCT TTTTACTTTG TTTCTATTTC ACTAATAAAT 2820 

TTTTGTCAAT CTCGTCTCAA TTAACAGACA TAGTCATATT CTCTAAACTC TGTTTTTAAA 2880 

CGATCCATTA CAAACTTTCT AGCCATGCCT CATCTCTGAC CTGGATACCA AGTTCTTGTG 2940 

CTTTTTGCAG TTTACTTCCA GCGTCTGCAC CTACCACGAC GAGGTCGGTC TTTTTAGAAA 3000 

TACTACCTGT CACTTTGGCA CCCAGACTTT CGAGTTTACT TTTAGCTTCT GAGCGCTTGA 3060 

GTCGTTCCAA TTTTCCTGTC AATACCACGG TCAAACCTGA CAAGGCCGCA TCCGCTACTA 3120 

CCGTCTGTCC TTTATAGTCC AGATTGACCC CAGTTTCTTT CAATTCTCTG AGCAGAATTT 3180 

CAGAGCCTTC TGTCGCAAAA TAAGTCTGAA GACTTTTGGC AATCACGCCA CCTAGACTTT 3240 

CAATACTAGC CACTTCCTCT GAATCTGCCT GAG AC AG ATT TTCAATTGAA TGGAAATATT 3300 

GAAGTAAAAG CTGACTAACC TTGCTTCCGA CATGACGAAT TCCCAAACCA AATAAGAGCT 3360 

TCTCGGCAGA ATTTTCCTTT GATGCTTGGA TAGCCTGATA CAGTTTAGCA GCGGACTTTT 3420 

CCTTAACTCC CTCTAAAAGG AGGAAATCCT CTTCTTGCAA ACGATAAATA TCCGCCACAT 3480 

CCTTGACTAA ATTAGCAGCA AAAAGCTTCT CAACAATAGA TGGACCAAGG CCTGTAATAT 3540 

TCATAGCATC ACGAGAAGCA AAGTGAATCA AGCCTTCCAT GATTTGAGCA GGGCAACGCG 3600 

GATTGATACA ACGTAGGGCC ACTTCATCTT CAAAGTGCAA CAAGTCAGAG TTACAACTTG 3660 

GACAGTTTGT AGGGATATCT AGTTTTTCTT CAGAAACCCG TTTGGACTCT ACCACACGTA 3720 

AAACGGCAGG GATGATGTCA CCAGCCTTAT ATACAATGAC CGTATCGTCT TTTCGGATAT 3780 

CTTTTTCAGC AATATAATCT ACATTGTGCA GGGTCGCACG GCTAACAGTC GTACCGGCAA 3840 

GTTGTACTGG TGTTAGATTA GCAGTTGGAG TTACAACACC GGTACGGCCA ACTGTCCAGT 3900 

CAACTGATAA GAGTTGAGCT TCTTTTTCTT CGGCAGGGAA CTTGTAGGCT ACTGCCCACT 3960 

TTGGAGCCTT AACTGTAAAA CCAAGTTCTT CTTGACTTGC TAGGTCGTTG ACCTTGATTA 4020 
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CCACTCCATC AATATCGTAA GGCAGATTTT CCCGTTCCTG TCCTACTTCT TGGATAAAAT 4080 

TCCAGATTTC ATCTATGTTT TCAGCCAAGA TTCGCTTAGG ATTGACCACA AAACCTAGTT 4140 

GTTCTAGGTA CTTCAAACCC TTTTCTTGGC TATCACGAGT TGAAGGGCTG GCTTCTTGAT 4200 

AGAGAAACGT TGCAAGATTA CGCTTGGCAA CTACTGCTGT ATCCAACTGA CGCAGAGTTC 4260 

CTGCTGCCGC ATTACGAGGA TTAGCAAATT CAGGCTCTCC ATTTTCTTGG CGCGCTTGGT 4320 

TAACTTGGTC AAAGGAAGCG CGTGGCATGT AACATTCCCC ACGAACTGTG ATATCTAGTT 4380 

CTTCTGGCAA AGTCAAAGGG ATGTCCTTAA CACGCTTGAG GTTTTCTGTG ATATTTTCAC 4440 

CAATTGAACC ATCTCCACGT GTTACCCCAG CAACCAAAAT CCCCTTTTCA TAAGTCAGCG 4500 

AGATAGATAA GCCATCGATT TTCAGCTCAC AAATATAGGT CGGATGAGCC ACTTCCTTAC 4560 

GAACACGCGC ATCAAAAGCA TCTAGCTCCT CACATGAAAA AGCATCCTGC AAACTATAAA 4 620 

GAGGATACTG ATGACTGTAT TTTTCAAAAC CATCTAAAAC CTTGCCACCA ACACGATGAG 4 680 

TCGGACTGTC TGCTAGCACT TGCTCTGGAT AAGCAGTTTC TAACTCGACC AACTCACGGT 4740 

AAAGGCGGTC ATACTCACTG TCTGAAACCG AGGGATTATC GCTGGTATAG TACTCAGTCG 4800 

CATAGCGATT GAGCAAAGCG ACTAACTCAT TCATTCTTTT ATTCATAAGA CCATTTTACC 4860 

ATAAAACAAG CCCTCCTCAC AAACGAGAAG GGCGGAAAAA ACACTTAGTT TGAAATTATT 4920 

TTTGAAACTC AAGCAACCTT ATATCAATTT TTCAAAATGA GTTCGAACAT ATCCGAGAGC 4980 

TAAGAAATAT AAGGCTACAA CTCCAAGTCC AATAATCAAG AAAGAATAAA GATGGACACT 5040 

TGGCAAGACT GTCATAAATC CTTTTGCAAT AGGCATAAAT AGAATAGCTA AGGTAAAAAT 5100 

TGTACTCAGT ACTCTTCCAA GAAATTCGCT CTCAACCTTG GTTTGTACTT GAGTAAAAAA 5160 

GTGAATATTA AAAATCGTCA TAAACAATTC ACAAACTAAA TTTCCAGAAA AGGAAAGAAA 5220 

AGTTGGAAGT GGTAATCCCA TCATAAAAAC TCCGACACCT GTCAAAGCCA GTAAAATCAA 5280 

AAGATTATAA ATATTAGCTT TAATTTTACT AGCTAGAAGA GCCCCAATGA TGGAACCAAT 5340 

AGCCCCCATA GTTAAAATAC TTGCATAGGC TCCTTCTGAC CCGTAAAGCT GATTCGAAAA 5400 

GGGAAGTAGA AATTCAAAAG CTGCAAAAAA GAAATTAACG CTGGAAGCTA CCAGCAAAAG 5460 

GAAGAAAATT TCTTGCTGAT GCCAGATATA GTGTAACCCA TCCTTGATAT CTACAAAAAT 5520 

ATCTCTCCCA GTAAAAGCCT TTTTCTCTTG AACTTTTGCT TCCTCTTTTG GAAGGAAAGC 5580 

CACTAGAACA AAAGCAATGA AAAAAGTCAG CGAGTCTAGC AGTAGCGTCA TATGGAGACT 5640 

TGCAAACTGT AAAACAAGGA AGGAAAGAAC AGGAGAGCTA ACACCTACAA CCTGCAAAAC 5700 

CAGCTCTAAG CGAGAATTAT AGATCACAAT CTCATCTTTC TCCACCACTT CAGTTATGAT 5760 
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AGCTTTATTG GCTGTGCGAG AAAAGGCAAA AGCAATAGCC TGCACAATGT TAGCAACAAT 5820 

CAAAGCGCCA ATCATCCAGC TATCATTCCT TATGAAAGAA ATAGCCAGAC AAAGAATCCC 5880 

ACAAACAAGA TCTGCCGTCA TTAAAATCTT ACGACGAGAA AAACGGTCTG AAATAACTCC 5940 

GCCAAAGGGA TTGACGAGAA TAGATGTGAC GAGCTCAGAA ATCTGATACA TTCCTAAAAC 6000 

TGTCTGTCCT ATAGTCCCCA TAGAAGCCAA CCAGACACTA TTTCCATAAT CATAGAGCAT 6060 

ATTTCCCATT TTATTGATAG CCCCACGGCT AATCAACTGC ACTGCATAGC. GATTCATATT 6120 

AAAGCTCCTC TCAAATTTTG AAACTATTGT ATCAAAACCG AAAGGAGCTT TTTATTTTTT 6180 

CCCTTATTTG GGAAAATTAA CTTTTGACAA ATTTTTCGTA GTGTTCCTGA TAATAGGCTA 6240 

CTTGCTCTGG AAGACCTAAC ACATCAAAAA TATGCATGGC CTCTTGCATC TGCTTACAGC 6300 

CTTCTTTACA CTGTCCTTTT TGATATAAGG CAAAACCTTT TAAATAATGG AAAACATTAC 6360 

GCTCATAAAG CTTAATACCT TTGTCAATAA TCTTCTCTGT ATAAGCCTCA AAATAGTTGG 6420 

CATTATAAAA AGAAGAATGC TCTAAACAAT GCTGGTAACA ATTGAGGGCC AAAATCAACA 6480 

CTAATCTCTT ATGGCGACTA ATCTCTTGGT AAAATTCCTC CCTCTCCATA ACTTCTCTAC 6540 

CAATCCGAGT GACATAGTCT ACATCGTAGA AACTATAGAG GTTACCGAAA AGAATCAACT 6600 

CATACATGGT CCATTCTTCT GTTTTGAAGA GATAATCTGC TACCTTACCC AAATCATCCT 6660 

GCTTCATATC ATAACTCGCA TCTCTTTGAC AAATCAGACC TTGTAGCAAA ATCCAGTTCA 6720 

GCTCAAAATA AAGGGGAGTC GTCGAACTCT TAGACTTTTC AAGTTGTTCT CTTTGAAGCT 6780 

TTTGAAAACC TGCAATATCG TTTGAATAGT AAAGTGGGAT AATCTGTGCC ATCATAGACA . 6840 

CATGTTCATG ATTATGAAAA TTCCTTGCCT TATCCATGAA ATTTTCGATT GTTACATGAA 6900 

TGTTATCCAA AATCTCAAAG AAACGGGAGA CTGCCAGGTC AGACTCCCCA AGCTCAAAGC 6960 

GAGATAACTG AGAGGTAGAG CAGGATTCGC CTGCTGCTTC CTTTAAAGAA TAATTTCCAC 7020 

TTGTTCGAAA TTCACGAAAT ACTTTTCCAA GATGTTCCAT CTTTACACCT GCTCTGATAA 7080 

TTCTTCCCAC TCAAGCATAG CTTCTTCCTG ACGATGGCTG ATTTTGTCCA GCTCAGCCTG 7140 

TAATTCCATG AGTTTGTCGG CATCGTTTGT TTCCAACATT TGTTCAGAAA TGGCTTGGCT 7200 

TTGACTTTCT AGCTCTTCAA TTTCAGCTTC TAGACTTTCG ATTTGTCGCA TGAGTTTGCG 7260 

AACTTCTTTT TGACTTTCTT TCTGGGCCTG ATAGTCATTG ACTGGACTTG CTTCCTTTGC 7320 

TTGATTGCTA GTTGAAGCTT CCTCAGTCTG ACTCATTTCT GCTGTTGCTT TCTTCTCAAC 7380 

ATAGTAGTCG TAATCTCCAA GGTAGAGAGT TGAACCATTC TCAGACAATT CCAAAACATG 7440 

AGTTGCCACA CGATTGATAA AGTAACGATC ATGACTGACA AACAGCAAGG TTCCATCAAA 7500 

GTCAATCAAG GCATTTTCTA GCACTTCCTT ACTATCAATA TCCAAGTGGT TGGTCGGCTC 7560 
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ATCCAGAATC AAAAAGTTAT TGTTTTCCAT AGACAATTTA GCTAAAAGCA AACGAGCTTT 7620 

TTCGCCACCA GATAGCATGC CGACTGATTT TTTAACATCA TCTCCTGAGA AAAGGAAGGC 7680 

TCCAAGACGG TTGCGGATTT CAACTTCTGG TGTCAGTTTG AAATCATTCC AGAGTTCATC 7740 

CAGCACCGTA TTACTTGGTG TCAGCTTGCT TTGGGTTTGG TCATAGTAAC CAACCTCAAC 7800 

ATTAGCGCCA AAGCGCTTTT CTCCCTTGAT AAAAGGAATC TGGTCCACAA TAGACTTGAT 7860 

AAAGGTTGAC TTGCCGATAC CATTTGGACC AACGATAGCG ACAGCATTCA TCTTACGAAG 7920 

ATCTAGGTTA ATCGGTTGTG ACAAGACTTC CCCGTCATAG CCAACAGCTG CATTTTCAAC 7980 

AGTCAAAACA ACATTGCCCG ACGTTTTTTC AGACTGGAAG GTCATGTTGG CTGATTTCTT 8040 

GCCAGCTTCA GGCTTGTCCA AACGTTCCAT TTTTTCCAGT TGTTTACGGC GAGATTGAGC 8100 

ACGTTTAGTC GTTGAAGCAC GAACTAGATT GCGATTGACA AAGTCTTCCA GAGCAGCGAT 8160 

TTCCTTCTGT TGCTTTTCAT AGTTTTTTGC CTCAGTAACT AGCTTTTGCT CCTTCAATTC 8220 

GACAAAACGA GAGTAATTCC CCACATAGCG ATCCAAGGAA TGCTTGGTCA AATCTAGCGT 8280 

AATTGTCGCA ACCTTGTCCA AGAAATAACG GTCGTGGCTG ACGATAATGA GGGCACCGCT 8340 

ATAGTTTACC AAGTAATTCT CTAGCCAGGC GATGGTTTCA ATATCCAAGT GGTTAGTTGG 8400 

CTCGTCCAAG ACCAAGAGAT TGGGCTTTTC AAGGAGCATT TTGGCAAGTG CCAAACGAGT 8460 

ATTTTGACCA CCAGAAAGCT CAGCAATTTT CATCTGCCAC ATAGACTCGT CAAACTTGAA 8520 

TCCATTCAAA ATCGCTCGAA TATCAGCTTC ATAGGTAAAG CCACCTGCTT GGCGAAAATT 8580 

CTCAGATAAG CGGTCATAAT CTGACATCAG TTTATCCAAA TCCTCACCAG ACTTTTCACC 8640 

CATCTCCAGC TCCATCTGAC GCAGTTGTCT CTCCGTCCGA CGCAAATCAT TAAAGACATG 8700 

AAGCATTTCA TCGTAGATGG TATTTTCAGA CTCAAAACGG CTATCTTGGG CTAGGTAAGA 8760 

CAGAGAAATA TCTTTTTTCT TATTGATTTC TCCGCTAGTT GGCTCCTCTT CTCCAACTAA 8820 

AATCTTCAAA AGAGTAGACT TACCTGCACC ATTTTTCCCA ACAAGAGCAA TCCGATCTCG 8880 

TTCATCAACC TGCAGGTTGA TATTATCGAA AAGAACCTCT CCTGCAAAAG AACGTTCAAT 8940 

TTTATTAGCT TGTAAAATAA TCATACAAGT AGTATAGCAT GTTTCCCTAA GGCATTCAAG 9000 

ATAATCGTAA GTCTTTTAGT ACAACTTTTA TAACATAAAA TAAACTAAAT TATGTATATT 9060 

TTATATTAGA TTACTTCACT ATCTTGTTGG ATTTTCTAAC CAGCTAATCT TGTTTCAAAT 9120 

AGTTATCGCA CAAGTCTATT ATTTAATTCT TTTCATCATT TACGTACGTA TAGCAGATTG 9180 

AAATAAGATG AGAACAAATC GATTGGGAAA GTAAAATTAA TTTCTATAAA TGTTTTAGCA 9240 

ATTGTTTCGT ACT ATT TT AG ATTCAGTCTA CTATATACAA TATTTTCGGA ACATTCAACT 9300 
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TTTTAACTCT ATTTATTACT AGATTTCATA ATTAAAAAAC CTACTGACCA AGCTAGAAAG 9360 

CTTGATACAA TAGGCTTTTT AAAGACTGAT TATTTAACAG CGTCTTTAAG AGCTTTACCA 9420 

GCTTTGAATG CTGGTACTTT AGAAGCTGCA ATTGTCATTT CTTTACCAGT TTGTGGGTTG 9480 

CGACCTTTAC GTTCTGCGCG CTCACGAACT TCAAAGTTAC CAAAACCGAT CAATTGAACT 9540 

T 9541 



(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

TTGACTATCC TATCATGCTT TCTAAGGTCT ACTCAAGAAA ATCATTTTCA AGTTTTCACA 60 

CCTTTCTCAA AAAAGTTAAA AAATTTTCTC AAAAACGCTT GACTCTGACC TAAGGCGAAG 120 

GGTTATACTA TCATTGTAAG GAGGAAATCA TGTACCATAT AAAAGAAGCT GCGCAGCTTT 180 
CAGGTGTCTC TGTCAAGACC CTGCATCACT ATGACAAGAT AGGACTCTTG GTCCCCTTAA . 240 

AGTCGGAAAA CGGCTATCGA ACCTACAGTC AAGAGGATTT GGAACGCCTT CAGGTCATTC 300 

TTTACTACAA ATATCTAGGC TTTTCTTTAG AGAAAATAGC AGAGCTGTTA AAGGAAGAAA 360 

GGACAGATTT ATTGCCCCAT TTGACTAGGC AGTTGGACTA TCTAACTCGC GAAAGGCAAC 420 

ATCTGGATAC CTTGATTTCC ACCTTGCAAA AAACTATTCA AGAACAAAAA GGAGAAAGAA 480 

AAATGACCAT TGAGGAAAAA TTCACGGGAT TTAGCTATCA AGACAATCAA AAATACCACC 540 

AAGAAGCGGT AGAGAAATAT GGTCAAGAAG TCATGGGACA AGCGCTCGAA CGCCAAAAAG 600 

GTCACGAAGA CGAGGCTACG GCCGCCTTTA ACCAAGTCTT TCAAACTTTG GCACAAAATC 660 

TTCAAGTTGG TTTACCTGCA ACAGCAACCG AAAACCAGGA GCAAGCAGCC AAGCTCTTGC 720 

AAGCCATTCG CACTTATGGA TTTGACTGCT CTATTGAGGT ATTCGGTCAT ATCGGTAAAG 780 

GTTACGTCTA CAACCCAGAG TTTAAGGAAA ACATTGACAA GTTTGGTTCT GAAACAGCCC 840 

AGTACACGTC AGATGCCATT GCGGTTTACG TTCAGACAAA TGCAGAATAA ATAGGCTAGG 900 

AATTTCCTAG CCTATTTTTT ACTTCAAATC ATAAAGCCAG TCGTCACCGT TTTTGTAGTA 960 

AAAGAATTCA CTGAGATCTT CTTCTAGAAA CACACGAAGC ATATCAGACA TATCATCGGT 1020 

TGCAAGTTTT AGATGAGAAA GATTTTCAAA GTCCTCCCAC CAAACTTTCC CTTCGTCTGA 1080 

AGACTGGAGT TCACCAGTAA AGTGTTCTGT CTTGTAAAAA AGGACGACAT AACGATAATC 1140 
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CTTGTCGTCA TACCAGTTTT TGATACCACA GAGTTGGGGT TTGGAAATGA TCAGACCAGT 1200 

TTCTTCTTTC ACTTCACGAA TGACAGCATC GACAAAGGAT TCGCCACGTT CAACATGACC 1260 

ACCAGGAAAA GTAATGCCAG ACCAGTCGGG ATTAACTCGG TCTTGGACCA GGACCTTATC 1320 

TCCGTTTTTA ATCATACACA TGTTAACAAA TTCGACTGCC TCTCTTCTGT TCATTCTTCA 1380 

CAACCTTTAA TCTTTAATCA TAATGCAGAC TTCCCGCCAC CCAGCCGGTA CAGAGGGCAG 1440 

AAGTGATGTT AAAGCCACCC GTGTGGGCAT TGATATCCAT AACTTCGCCT GCAAAGTGGA 1500 

GGCCAGGTAC CAGCTTACTT TCAAGGGTTT TAGGATTGAT TTCCTTGAGA CTGACTCCAC 1560 

CCTTGGTAAC AAAGGACTTT GCAAGGGACA TTTTTCCAGT TACAGGAATT TTAAGTTCTT 1620 

TAATGGACTG GACAAGTTGT TCTCGTTCCT TTTCAGTCAG TTGTTTGACT TTTTCAGGAT 1680 

ATCCTTGTAC AAAAAATTCG GCCAAGCGTT CTGGTAACAA GGTTTTTAAA GCGTTTTTCA 1740 

AGGATTTTTC CCGATTTTCT TCTAGAAATG TAACCAAGTC CTTCTCAGAA AGTTGAGGCA 1800 

AAACATCGAG TGAGAGAACC TCCCCACCTT TGACAAAGCT AGACATGCGT AGGGCAGCAG 1860 

GACCTGACAA ACCAAAGTGG GTAAAGAGTA AATCATGAGT GATGACATGC TTACCATAAC 1920 

TTAGGGTCAC ATCGTCCAGA GAAATACCTT GTAAGGCTTT ATGTGGAAAA TCTGTTAATA 1980 

AAGGACTTTC AGCAGCCTCA AGATCGGTGA TGGTATGCTT AAAATGGCGA GCAATCTCGT 2040 

GACCAAAACC AGTCGAACCA GTCGAAGGAT AAGACTTACC ACCTGTTGTG ACAATGAGTT 2100 

TCTCACAAGT GAAGGTTTGA TCCGCTGACT TAAGGACAAA CTGGTCATCT ACTTTTTTAA 2160 

CAGAAACGAT TTCTATTTGA GTAGCAACTT GACCACCTAG TTCGGTGATT TTCTTTTCCA 2220 

AAGCTTCGAT AATAGTCCGA GACTTGTCAC TGGCTGGAAA GACGCGTCCG TGGTCTTCGA 2280 

CCTTAAGTTT AACACCATTT TCTGTAAAAA AGTTGATGAT GTCATGATTA TCGAACTGGG 2340 

AGAAAACACT GTAAAGAAAG CGTCCGTTTC CAGGAATTCC AGCTAGCAGG TTGTCTAAGC 2400 

TACCATTGTT GGTCACATTG CAACGTCCCC CACCAGTCCC AGCTAATTTT TTTCCAAGTT 2460 

TCCGATTTTT TTCGATGAGG AGGGTTTTCT GTCCATAAAA GCTACTGGAA ATCGTAGCCA 2520 

TCATACCAGC AGGTCCCCCA CCGATGACAA TAGTATCAAA ATGTTTCATA GCTCTATTGT 2580 

ACCACAAAAA AACAAGAGAT GATGGTCACC TCTTGTCAAG AATGCAATTA ATCAATTTCA 2640 

TAGCCCATCA GCAAACCGCC CTCTTCTGCA TAGAAACTGC AGAGACCAGA GGTTGGTAGA 2700 

ATTTTAATAT CCGCTTGTGG GAAGGTTTCA CGGATTCGCT CTGAGAGCTG TTGACAACAT 2760 

TTTTCGTTAT TGCGTTGGGC CATGACAATA CGGCCACCAG CATATCCAGC TTTTACTAAC 2820 

TCATCATAGG CAGCTTGAAC TGATTTCTTT GATCCCCTTG CTTTTTGTAG CAATTCGAGA 2880 
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GTCCCAGTTT CACTAGCTTT TCCGACCATA CGAATGTTGA GAAGGCCAAC GACCGTACCG 2940 

ATAAGCTTGC TCAAACGGCC GTTCTTCACC AAGTTATCGA CTTTGGCTAG GACAAAGAGC 3000 

AACTTAGTTT TTTCTTGATA GGCGGTGATA GCTTCAACCA CTTCTTCAAA AGACAAGCCC 3060 
TGGTCAATCA AGTCATTCAA TTTTTCTACG AGTAGGTCAA CTTCACCACC AGCAGATAAA * 3120 

CTATCAATCA CATGAATCTT AGTGTCAGGA TGGTCTTCCA GATAAATATT CTTTGCTAGT 3180 

TGAGCACTAT TGTGACTGCC AGAAAGGGTA CCTGTGATGG TTACTAGGAA AATGTTTTTG 3240 

GCACCTTCAA ATGCTCGCAA ATAGTCATCT GGGCTTGGAC AAGCCGATTT TGAAGCTTCT 3300 

GCAGTTGCAT ACATGGTTTC CATCATTTGG TCAATATCGA GACTGGCGTC ATCAACAAAG 3360 

ACCTGATCAG CTACTTGAAT GGTTAAGGGG ACACTTACAA AGGTTGTGTT AATAGCTGGT 3420 

GTTGGCAGTT GACGATAATC ACAACCAGAG TCAGCAATAA TCTTCCAAGT CATAGAAATT 3480 

CTCCATCTTT GTCAGGAACG AT 3502 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12665 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

CGATTGATTT TTTTAAAGCG TTCGATAGAG AATGAGAAAC GAATCCTTAG CAATGGCGGG 60 

AAAGAATTTG GAGTTGAGAA TACAAAACGA TTAACTATGG CTCATATTGT TTTTTATCTC 120 

TCTTGCTTGG TTGAGGCAAT GGTGCACAAG ACAATTTTTG ATGGCATGGG CATGGTTGGT 180 

TTAGTCTTGC TTATTTTTTC TATGCTGATG TTGATGTTGG TGATTCACTT GTTGGGAGAT 240 

ATTTGGACAG TGAAGCTTAT GCTTGTCAAT AATCACAAAT ATGTAGATCA TATCTTGTTT 300 

AGGACAGTAA AACACCCTAA TTACTTTTTA AATATTCTTC CTGAGTTGAT TGGCTTGACC 360 

TTGTTGAGTC ATGCTTATGT GACTTTTGTT TTAGTTTTTC CAGTTTATGC AGTTATTTTG 420 

TATCGACGAA TAGCTGAAGA GGAAAAGCTA TTACATGAAG TTATAATCCC AAATGGAAGC 480 

ATAAAGAGAT AAATACAAAA TTCGATTTAT ATACAGTTCA TATTGAAGTG ATATAGTAAG 540 

GTTAAAGAAA AAATATAGAA GGAAATAAAC ATGTTTGCAT CAAAAAGCGA AAGAAAAGTA 600 

CATTATTCAA TTCGTAAATT TAGTGTTGGA GTAGCTAGTG TAGTTGTTGC CAGTCTTGTT 660 

ATGGGAAGTG TGGTTCATGC GACAGAGAAC GAGGGAGCTA CCCAAGTACC CACTTCTTCT 720 

AATAGGGCAA ATGAAAGTCA GGCAGAACAA GGAGAACAAC CTAAAAAACT CGATTCAGAA 780 
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CGAGATAAGG CAAGGAAAGA GGTCGAGGAA TATGTAAAAA AAATAGTGGG TGAGAGCTAT 840 

GCAAAATCAA CTAAAAAGCG ACATACAATT ACTGTAGCTC TAGTTAACGA GTTGAACAAC 900 

ATTAAGAACG AGTATTTGAA TAAAATAGTT GAATCAACCT CAGAAAGCCA ACTACAGATA 960 

CTGATGATGG AGAGTCGATC AAAAGTAGAT GAAGCTGTGT CTAAGTTTGA AAAGGACTCA 1020 

TCTTCTTCGT CAAGTTCAGA CTCTTCCACT AAACCGGAAG CTTCAGATAC AGCGAAGCCA 1080 

AACAAGCCGA CAGAACCAGG AGAAAAGGTA GCAGAAGCTA AGAAGAAGGT TGAAGAAGCT 1140 

GAGAAAAAAG CCAAGGATCA AAAAGAAGAA GATCGTCGTA ACTACCCAAC CATTACTTAC 1200 

AAAACGCTTG AACTTGAAAT TGCTGAGTCC GATGTGGAAG TTAAAAAAGC GGAGCTTGAA 1260 

CTAGTAAAAG TGAAAGCTAA CGAACCTCGA GACGAGCAAA AAATTAAGCA AGCAGAAGCG 1320 

GAAGTTGAGA GTAAACAAGC TGAGGCTACA AGGTTAAAAA AAATCAAGAC AGATCGTGAA 1380 

GAAGCAGAAG AAGAAGCTAA ACGAAGAGCA GATGCTAAAG AGCAAGGTAA ACCAAAGGGG 1440 

CGGGCAAAAC GAGGAGTTCC TGGAGAGCTA GCAACACCTG ATAAAAAAGA AAATGATGCG 1500 

AAGTCTTCAG ATTCTAGCGT AGGTGAAGAA ACTCTTCCAA GCCCATCCCT GAAACCAGAA 1560 

AAAAAGGTAG CAGAAGCTGA GAAGAAGGTT GAAGAAGCTA AGAAAAAAGC CGAGGATCAA 1620 

AAAGAAGAAG ATCGCCGTAA CTACCCAACC AATACTTACA AAACGCTTGA ACTTGAAATT 1680 

GCTGAGTCCG ATGTGGAAGT TAAAAAAGCG GAGCTTGAAC TAGTAAAAGA GGAAGCTAAG 1740 

GAACCTCGAA ACGAGGAAAA AGTTAAGCAA GCAAAAGCGG AAGTTGAGAG TAAAAAAGCT 1800 

GAGGCTACAA GGTTAGAAAA AATCAAGACA GATCGTAAAA AAGCAGAAGA AG AAG CT AAA 1860 

CGAAAAGCAG CAGAAGAAGA TAAAGTTAAA GAAAAACCAG CTGAACAACC ACAACCAGCG 1920 

CCGGCTCCAA AAGCAGAAAA ACCAGCTCCA GCTCCAAAAC CAGAGAATCC AGCTGAACAA 1980 

CCAAAAGCAG AAAAACCAGC TGATCAACAA GCTGAAGAAG ACTATGCTCG TAGATCAGAA 2040 

GAAGAATATA ATCGCTTGAC TCAACAGCAA CCGCCAAAAA CTGAAAAACC AGCACAACCA 2100 

TCTACTCCAA AAACAGGCTG GAAACAAGAA AACGGTATGT GGTACTTCTA CAATACTGAT 2160 

GGTTCAATGG CGACAGGATG GCTCCAAAAC AATGGCTCAT GGTACTACCT CAACAGCAAT 2220 

GGCGCTATGG CGACAGGATG GCTCCAAAAC AATGGTTCAT GGTACTATCT AAACGCTAAT 2280 

GGTTCAATGG CAACAGGATG GCTCCAAAAC AATGGTTCAT GGTACTACCT AAACGCTAAT 2340 

GGTTCAATGG CGACAGGATG GCTCCAATAC AATGGCTCAT GGTACTACCT AAACGCTAAT 2400 

GGTTCAATGG CGACAGGATG GCTCCAATAC AATGGCTCAT GGTACTACCT AAACGCTAAT 2460 

GGTGATATGG CGACAGGTTG GGTGAAAGAT GGAGATACCT GGTACTATCT TGAAGCATCA 2520 
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GGTGCTATGA AAGCAAGCCA ATGGTTCAAA GTATCAGATA AATGGTACTA TGTCAATGGC 2580 

TCAGGTGCCC TTGCAGTCAA CACAACTGTA GATGGCTATG GAGTCAATGC CAATGGTGAA 2640 

TGGGTAAACT AAACCTAATA TAACTAGTTA ATACTGACTT CCTGTAAGAA CTCTTTAAAG 2700 

TATTCCCTAC AAATACCATA TCCTTTCAGT AGATAATATA CCCTTGTAGG AAGTTTAGAT 2760 

TAAAAAATAA CTCTGTAATC TCTAGCCGGA TTTATAGCGC TAGAGACTAC GGAGTTTTTT 2820 

TGATGAGGAA AGAATGGCGG CATTCAAGAG GCTCTTTAAG AGAGTTACGG GTTTTAAACT 2880 

ATTAAGCCTT CTCCAATTGC AAGAGGGTTT CAATCTCTGC CAGGGTGCTG GCTTGCGAAA 2940 

TGGCTCCACG GAGTTTGGCA GCGCCAGATG TTCCACGGAG ATAGTGAGGA GCGAGACCGC 3000 

GGAATTCACG AACTGCGACG TTTTCTCCTT TGAGGTTAAT CAATCGTTTC AAGTGTTCGT 3060 

AGGCGATCTT CATCTTGTCT TCAAAGGTCA AATCAGGTAG GATTTCTCCT GTTTCAAAGT 3120 

AATGGTTGAT TTGGTTGAAG AGGTAAGGAT TTCCCATGGC AGCTCGGCCA ATCATGACTG 3180 

CGTCAGCACC AACTTCTTCG ATGCGTTGCT TGGCTTCTTG GACAGTACGG ATATCACCGT 3240 

TGGCGATGAA TGGAATCTTG GTTAGAGCTT GGGCAACCTT GTAAAGGGTC TCAAGGTCTG 3300 

CGTGGCCAGT ATACATTTGT TCACGGGTAC GGCCATGCAT GGCGAGGGCA GAAACACCTG 3360 

CAGCTTCAGC AGCGAGAGCA TTTTCTACTG CAAGAGATGG GTCCGCCCAG CCGGTACGCA 3420 

TTTTGACAGT AAGTGGGATA TCAAGGACAG ACTGGACCTT GTTGATGATG GAGTAAATCT 3480 

TGTCTGGATC CTTGAGCCAC ATAGCACCAG CTTCGTTCTT CACGATTTTG TTGACAGGGC 3540 

AGCCCATGTT GATATCGACG ATATCGGTCT TGGTGTTTTC TTGGATGAAT TCTGCTGCGC 3600 

GTGCTAGGCT GTCTTCATCG CTACCAAAAA GTTGGATAGA GACAGGGTTT TCGCCCTCAT 3660 

CGATATGAAG CATGTGCAGG GTTTTTTCGT TGTTGTATTG GATTCCCTTG TCAGAGACCA 3720 

TTTCCATTAC AACGAGTCCA GCTCCGAGCT CCTTTGCGAT AGTACGAAAG GCTGAGTTGG 3780 

TCACGCCAGC CATAGGCGCT AAAACGGTAC GATTGGGAAT CTCAATATTG CCAATCATAA 3840 

AAGGTGTATT AAGATTTGTC ACGAATGAGT TCCTCCAGGT CCTTTTCATC AAAGTTGTAA 3900 

GTAGTTTGGC AGAATTGACA AGTGATTTCT GCCCCGTGGT CTTCCTCTTT CATTTCCTGT 3960 

AAGTCTGAGC TTGGAAGGCT GGCAAGAGCG TTCATAAAGC GTTCATGGCT ACAGTCACAT 4020 

TGGAAACGGA TTTCTTCTTC AGAAAGACGC TTGTAGGCTT CGTCCCCGTA GATAGCCTTG 4080 

AGGAGGGCTT CGATATGGTC GTCGCTTTCG AGAAGAGTAG AGATAGCTGG CATTTCTTGG 4140 

ATGCGTTTTT CAAAGCGAGC AATCTCTTCT TTCTTGGCTC CTGGCAAGAC TTGAACTAGG 4200 

AAACCACCTG CAACCTTGAC CTTGTCTTCC TCGTCCAAAA GGACATTGAG GCCGACCGCT 4260 

GAAGGCGTTT GTTGGCTTTC AGTAAGGTAA AAGGCAAGGT CTTCACCGAT TTCTCCAGAG 4320 
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ATGAGGGGAG TTATAGAGTT GTAAGGATTT CCAGTACCGT AGTCTGTGAT AACGAGGAAT 4380 

TGACCATTTC CAACAAAAGG TCCGACTAGG ACTTCACCAG TCGCAGTCTT TTTGATGTCA 4440 

ACACCAGGAT TTTGAACATA GCCTTTGACG TTCCCCTTGG TATCAGCGAC GGTGATAATA 4500 

GCACCTAGAG AGCTAGATCC CAACACCTTA ACTGTAAGTT TGGTATTTCC TTTTTCATTG 4560 

GCTGCGAGAA TCTGGCTAGC GATAAGAGTT CGACCAAGCG CTACAGTTGA GCTAGCTTGG 4620 

GTTTGATGTT TTTCTTGAGC AGTGCGGACG GTTTCAGTGC TATCAAGGAC AAAAGCACGA 4680 

AAGGcTCCGC TTTCTGATAT AGTTTTAATA ATTTTATCCA TAGCTACTAT TTTAGCATAA 4740 

AAATGCCCAA AGGGGGAGCC GTGTGTTTAC TGATTTTCAG GATAATGGAC CAGGAAATCA 4800 

GCATGAAAAT AAAAAGAGAA ACAGATTATT TTAGCATTTG TCAGATTTAT GCTATGCTTA 4860 

AGGTAGAAAA TGAAAGGGAT AACAAATGTA TTTAGGAGAT TTGATGGAGA AAGCCGAGTG 4920 

TGGTCAATTT TCAATACTTT CCTTTCTATT ACAAGAGTCT CAGACGACCG TCAAGGCTGT 4980 

AATGGAAGAA ACAGGATTTT CAAAAGCAAC CCTAACCAAA TATGTCACCC TGCTCAATGA 5040 

CAAGGCTTTG GATAGTGGCT TAGAGCTGGC TATTCACTCA GAAGATGAAA ATCTGCGTCT 5100 

GTCTATCGGT GCAGCTACCA AGGGGAGAGA TATTCGGAGC TTGTTTTTGG AGAGTGCTGT 5160 

TAAATACCAG ATTTTGGTTT ATCTTCTCTA CCACCAACAG TTTTTAGCCC ATCAGCTGGC 5220 

TCAAGAATTG GTGATTAGCG AGGCTACGCT TGGTCGTCAC TTGGCTGGTT TAAATCAGAT 5280 

TTTGTCAGAA TTTGATTTAT CCATCCAAAA TGGCCGTTGG CGAGGTCCAG AGCATCAGAT 5340 

TCACTATTTC TATTTCTGTC TTTTCCGAAA GGTCTGGTCG AGTCAGGAAT GGGAAGGTCA 5400 

CATGCAGAAA CCAGAGAGAA AACAGGAGAT TGCCAATTTA GAGGAAATCT GCGGTGCAAG 5460 

TTTGTCTGCG GGGCAGAAAT TGGACTTGGT TCTCTGGGCT CACATCAGTC AACAACGTCT 5520 

TCGGGTCAAT GCTTGTCAGT TTCAAGTCAT AGAAGAGAAA ATGCGAGGGT ATTTTGACAA 5580 

TATCTTTTAT CTTCGTTTGC TGAGAAAGGT TCCGTCCTTT TTTGCTGGGC AACATATTCC 5640 

ACTAGGAGTT GAGGATGGTG AGATGATGAT ATTCTTCTCT TTTCTCCTAT CTCATCGCAT 5700 

TCTTCCTCTT CATACTATGG AGTATATTCT TGGTTTTGGA GGGCAGTTGG CAGATTTACT 5760 

GACGCAATTG ATTCAAGAAA TGAAGAAGGA GGAACTATTG GGGGATTATA CAGAGGACCA 5820 

TGTCACCTAT GAACTCAGTC AGCTTTGTGC TCAAGTCTAT CTCTATAAGG GCTATATTTT 5880 

ACAGGATCGC TACAAGTACC AGTTAGAGAA TCGTCATCCA TATTTACTGA TGGAACATGA 5940 

TTTTAAAGAG ACAGCAGAGG AGATTTTTCA TGCTCTACCT GCTTTTCAAC AGGGGACAGA 6000 

TTTAGATAAG AAGATTCTCT GGGAATGGCT CCAGTTAATC GAATATATGG CTGAAAACGG 6060 
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TGGCCAGCAT ATGCGGATTG GTCTGGATTT GACATCTGGT TTTCTTGTCT TTTCAAGGAT 6120 

GGCAGCCATT TTGAAACGGT ATTTGGAATA CAATCGTTTT ATTACCATTG AAGCTTATGA 6180 

CCCTAGTCGG CATTATGATT TGCTGGTTAC CAATAACCCG ATTCATAAGA AGGAACAGAC 6240 

ACCAGTCTAT TATTTAAAAA ATGACTTGGA TATGGAGGAT TTGGTAGCGA TTCGCCAGTT 6300 

ATTATTCACT TAAAAGGCTT GGTTAATCCA GGTCTTTTTT GTGAAATTCA CACAATCTCC 6360 

TCACATTTTT TTAAAAATTA AAAAAAGTTG ATAAACAAGA AAGCGCTTTA TTTTGTATAC 6420 

TAGTAAGTGT AAAGAGGAAA CACCTCAAGA TCTTTATCAG GAGGACAGTA CATGTCACAA 6480 

GAAAAATACA TCATGGCCAT TGACCAGGGA ACTACAAGTT CTCGTGCCAT CATTTTCAAC 6540 

AAAAAAGGGG AAAAGGTTAG CTCGAGTCAA AAAGAGTTTA CCCAGATTTT CCCTCAGGCA 6600 

GGTTGGGTTG AGCACAATGC CAATGAAATT TGGAACTCTG TTCAGTCAGT TATTGCGGGT 6660 

GCTTTCATCG AAAGTGGTGT CAAGCCAAAT CAAATCGAGG CAATCGGGAT TACCAACCAA 6720 

CGTGAAACAA CGGTTGTCTG GGATAAGAAA ACAGGACTTC CTATCTACAA TGCTATCGTT 6780 

TGGCAGTCAC GCCAGACAGC ACCTTTGGCT GAGCAACTAA AAAGCCAAGG TTATGTGGAA 6B40 

AAATTCCATG AAAAGACTGG TTTGATTATT GATGCTTACT TCTCTGCTAC CAAGGTTCGT 6900 

TGGATTTTGG ATCATGTAGA AGGTGCTCAA GAGCGAGCAG AAAAAGGGGA ATTGCTCTTT 6960 

GGTACTATCG ATACTTGGTT GGTTTGGAAA TTGACTGACG GTGCGGCTCA CGTGACTGAC 7020 

TACTCAAATG CAGCTCGTAC CATGCTTTAT AACATTAAAG AACTCAAATG GGATGATGAG 7080 

ATTTTGGAAA TCCTTAACAT TCCGAAGgCT ATACTTCCAG AAGTTCGTTC TAACTCCGAA 7140 

ATCTACGGCA AGACAGCTCC ATTCCATTTC TACGGTGGAG AGGTGCCAAT CTCAGGTATG 7200 

GCTGGGGACC AACAAGCAGC CCTCTTTGGA CAGTTGGCTT TTGAGCCAGG TATGGTTAAG 72 60 

AATACTTATG GAACAGGCTC TTTCATCATC ATGAATACTG GGGAAGAGAT GCAGTTGTCT 7320 

GAAAACAACC TCTTGACAAC CATTGGTTAC GGAATCAACG GTAAGGTTTA TTATGCCTTG 7380 

GAAGGTTCTA TCTTCATCGC AGGAAGTGCT ATTCAGTGGC TTCGTGACGG TCTTCGCATG 7440 

GTTGAAAATT CACCAGAATC TGAAAAATAC GCTCGTGATT CTCACAACAA CGATGAAGTT 7500 

TATGTCGTTC CAGCCTTTAC AGGTCTAGGC GCTCCATACT GGAACCAAAA TGCTCGTGGT 7560 

TCCGTCTTTG GTTTGACTCG TGGAACAAGC AAAGAAGACT TTATCAAGGC GACTTTGCAA 7620 

TCTATTGCTT ATCAAGTGCG TGATATCATC GACACCATGC AAGTGGATAC TCAGACCGCC 7680 

ATTCAAGTAC TGAAGGTGGA TGGTGGTGCA GCCATGAACA ACTTCCTCAT GCAGTTCCAG 7740 

GCGGATATTT TAGGCATTGA CATTGCACGT GCTAAAAACC TGGAAACAAC AGCTCTAGGA 7800 

GCGGCCTTCC TAGCAGGTTT GTCAGTAGGG TACTGGAAAG ACTTGGACGA GTTGAAACTC 7860 
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TTGAACGAGA CAGGAGAACT CTTTGAGCCA TCTATGAACG AATCTCGCAA GGAACAACTC 7920 

TACAAGGGCT GGAAGAAGGC TGTGAAAGCA ACTCAAGTCT TTGCGGAAGT AGACGACTAA 7980 

TACTGGCAGA ATAAAGCGAT TTATTTAGAA AGTGTGTAAA TATGGAATTT TCAAAGAAAA 8040 

CACGTGAATT GTCAATTAAA AAAATGCAGG AACGTACCCT GGACCTCTTG ATTATCGGTG 8100 

GAGGAATCAC AGGAGCTGGT GTAGCCTTGC AGGCGGCAGC TAGCGGTCTT GAGACTGGTT 8160 

TGATTGAAAT GCAAGACTTT GCAGAAGGAA CATCTAGTCG TTCAACAAAA TTGGTTCACG 8220 

GAGGACTTCG TTACCTCAAA CAATTTGACG TAGAAGTGGT CTCAGATACG GTTTCTGAAC 8280 

GTGCAGTGGT TCAACAAATC GCTCCACACA TTCCAAAATC AGATCCAATG CTCTTACCAG 8340 

TTTACGATGA AGATGGAGCA ACCTTTAGCC TCTTCCGTCT TAAAGTAGCC ATGGACTTGT 8400 

ACGACCTCTT GGCAGGTGTT AGCAACACAC CAGCTGCGAA CAAGGTTTTG AGCAAGGATC 8460 

AAGTCTTGGA ACGCCAGCCA AACTTGAAGA AGGAAGGCTT GGTAGGAGGT GGAGTGTATC 8520 

TTGACTTCCG TAACAACGAT GCGCGTCTCG TGATTGAAAA CATCAAACGT GCCAACCAAG 8580 

ACGGTGCCCT CATTGCCAAC CACGTGAAGG CAGAAGGCTT CCTCTTTGAC GAAAGTGGCA 8640 

AGATTACAGG TGTTGTAGCT CGTGATCTCT TGACAGACCA AGTGTTTGAA ATCAAGGCCC 8700 

GTCTGGTTAT TAATACAACA GGTCCTTGGA GTGATAAAGT ACGTAATTTG TCTAATAAGG 8760 

GAACGCAATT CTCACAAATG CGCCCAACTA AGGGAGTTCA CTTGGTAGTA GATTCAAGCA 8820 

AAATCAAGGT TTCACAGCCA GTTTACTTCG ACACAGGTTT GGGTGACGGT CGTATGGTCT 8880 

TTGTTCTCCC ACGTGAAAAC AAGACTTACT TTGGTACAAC TGATACAGAC TACACAGGTG 8940 

ATTTGGAGCA TCCAAAAGTA ACTCAAGAAG ATGTAGATTA TCTACTTGGC ATTGTCAACA 9000 

ACCGCTTCCC AGAATCCAAC ATCACCATTG ATGATATCGA AAGCAGCTGG GCAGGTCTTC 9060 

GTCCATTGAT TGCAGGGAAC AGTGCCTCTG ACTATAATGG TGGAAATAAC GGTACCATCA 9120 

GTGATGAAAG CTTTGACAAC TTGATTGCGA CTGTTGAATC TTATCTCTCC AAAGAAAAAA 9180 

CACGTGAAGA TGTTGAGTCT GCTGTCAGCA AGCTTGAAAG TAGCACATCT GAGAAACATT 9240 

TGGATCCATC TGCAGTTTCT CGTGGGTCTA GCTTGGACCG TGATGACAAT GGTCTCTTGA 9300 

CTCTTGCTGG TGGTAAAATC ACAGACTACC GTAAGATGGC TGAAGGAGCT ATGGAGCGCG 9360 

TGGTTGACAT CCTCAAAGCA GAATTTGACC GTAGCTTTAA ATTGATCAAT TCTAAAACTT 9420 

ACCCTGTTTC AGGTGGAGAA TTGAACCCAG CAAATGTGGA TTCAGAAATC GAAGCCTTTG 9480 

CGCAACTTGG AGTATCACGT GGTTTGGATA GCAAGGAAGC TCACTATCTG GCAAATCTTT 9540 

ACGGTTCAAA TGCACCGAAA GTCTTTGCAC TTGCTCACAG CTTGGAACAA GCGCCAGGAC 9600 



WO 98/18931 



PCT/US97/19588 



914 

TCAGCTTGGC AGATACTTTG TCCCTTCACT ATGCAATGCG CAATGAGTTG ACTCTTAGCC 9660 

CAGTTGACTT CCTTCTTCGT CGTACCAATC ACATGCTCTT TATGCGTGAT AGCTTGGATA 9720 

GTATCGTTGA GCCAATTTTG GATGAAATGG GACGATTCTA TGACTGGACA GAAGAAGAAA 9780 

AAGCAACTTA CCGTGCTGAT GTCGAAGCAG CTCTCGCTAA CAACGATTTA GCAGAATTAA 9840 

AAAATTAAGA AAAAATAAAA GAGGTGGAGG GCAGCATTCC TTGTCGCCCG TCCCTTCTTT 9900 

TTAATGGAGA CAGAAAGATG ATGAATGAAT TATTTGGAGA ATTTCTAGGG ACTTTAATCC 9960 

TGATTCTTCT AGGAAATGGT GTTGTTGCAG GTGTGGTTCT TCCTAAAACC AAGAGCAATA 10020 

GCTCAGGTTG GATTGTGATT ACTATGGGTT GGGGGATTGC AGTTGCGGTT GCAGTCTTTG 10080 

TATCTGGCAA GCTCAGTCCA GCTTATTTAA ACCCAGCTGT GACCATCGGT GTGGCCTTAA 10140 

AAGGTGGTTT GCCTTGGGCT TCCGTTTTGC CTTATATCTT AGCCCAGTTC GCAGGGGCCA 10200 

TGCTGGGTCA GATTTTGGTT TGGTTGCAAT TCAAACCTCA CTATGAGGCA GAAGAAAATG 10260 

CAGGCAATAT CCTGGCAACC TTCAGTACTG GACCAGCCAT CAAGGATACT GTATCAAACT 10320 

TGATTAGCGA AATCCTTGGA ACTTTTGTTT TGGTGTTGAC AATCTTTGCT TTGGGTCTTT 10380 

ACGACTTTCA GGCAGGTATC GGAACCTTTG CAGTGGGAAC TTTGATTGTC GGTATCGGTC 10440 

TATCACTAGG TGGGACAACA GGTTATGCCT TGAACCCAGC TCGTGACCTT GGACCTCGTA 10500 

TCATGCACAG CATCTTGCCA ATTCCAAACA AGGGAGACGG AGACTGGTCT TACGCTTGGA 10560 

TTCCTGTTGT AGGCCCTGTT ATCGGAGCAG CCTTGGCAGT GCTTGTATTC TCACTTTTCT 10620 

AGTTTATACT CTTCGAAAAT CAAATTCAAA CCACGTCAGC GTCGCCTTAC CGTACTCAAG 10680 

TACAGCTTGC GGCTAGCTTC CTAGTTTGCT CTTTGATTTT CATTGAGTAT TAGAAAACAA 10740 

TTATGTTGAT AGAGCTTGGG CAAGAGCCCA ATTTCAGCAA AAAATGAAGT AAATCTTCTC 10800 

ATAATAAAAC GCATCATATC AAGCACGAAA ATTCCACGAG GTCAACTACA GTCAGAAAGC 10860 

TGAACAACAA GCCAAAACGC CCAAAAAAGG CGGCAAAAAG CAAGCACCTG CAAGCAACGT 10920 

GCCGAAATGG TCAAATCCTG ATTATGTCAA CGAATTAGAC CCAAAAATCG TTGATATGCT 10980 

AGTAGAATTT CACAAGTCAC AAGGCACTTT GGAAACTCCC GAGGCGCAAG CAGAAATCGC 11040 

CCAAAAACGT GAAGAAATCG AGCAAAGGAG AGCTGAGCTT GAGGGTAAAA AACAAGAGCT 11100 

TTTGAACCGC TTGAACAAAT AGAGTTTCGC AAGTATTATG CTTACAAATT ACTTGAGCAA 11160 

TTAACTAAAA TATAAACCCT GCCTTTATAT CTAGGCAGGG TTTATATTTT AGAAATTCAC 11220 

GTAGGTTGTT, ACGGTTTTTA CATACCCAGT ATAGTTTGAG TTTCTATAGT ATTCAGTGAT 11280 

AAACTTCCAT TTTCTTTGAG CAACATGGAT ATAAGTACTT GTTATGTAGT ATGGATATGG 11340 

GCTTTGTGAA TCCAAGTAAG ACTGATAAGC TTGTATACCA AAATATGCTC CACCAATTAT 11400 
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TGCACCCCAT GGACCCCCCA ATAAAGCACC TATCCTACCA ATCATATAAC TGATTCCAGC. 11460 

ACCAGTCATG AAGTTAGCGA ATGTGTTAGC TTGTTTATTC CCATGTATTG TGTTGACGTA 11520 

ATTCCAAACA TTAGGATCGT ATGATCTAAA AGATATATTT AGGTCGATTT CATTCTTTTG 11580 

ATAAGCCATA TAAAATGCCC CATTGATATA GACGCCGTCA GCACGTCGTT CAATAGTGTC 11640 

TACACTTCCA TCTGGATTGA CAACCTCAAG AACTTCATCG CTTAAAATAT TTACTTGCGT 11700 

ATCTCCGAAC CGCACTGATG AGCCATTCTC AAACTGAGCC TCACCAGATA CAACTTTAGA 11760 

GTTTGCCGAT AAGCTATCAT CAGCAAAAAC AAACAAGCGA CGGGGAAATG CTAGACATAC 11820 

AGAAAACAGA CATAACTAGC AAACACATGC ATTTAAACAT CTTAGACATA ACGGAAACTC 11880 

CTTTGTATTT TTGATTTTTT TCAACTTTTA TTATACAATA AAACCAAATA AAAAGAAAGC 11940 

GGTAACAATA TGCTTAATGC GAAAATTTTT TATATATTTT TATGTTTGAT CGTTATCGAA 12000 

ACTACAGGCT TGTTGTTGTT GAAAAGAGGT CTCGAAATGG GTTATTTAGA CACAGAAGCT 12060 

ATTATCCTCG CAGTTTTTTC ATTTGCTTTT TACAACCTAT GTTCATTCGC TTGGGTCTGC 12120 

TCTACAATAA AAAACAATAA AAAATAAATA GACGTATTTT CAAAAAAAAC maAATGCATA 12180 

TTTATATTAG CAAAACGACG ATTTAAATCG TCGTTTTTTT GTAGTACGAC GGGCATGTCG 12240 

TATATCTGAG GTGTAAGTCC TCAGCCTGAC TATCGTGAGG TAGCAGGGAG AGGAAGGGAT 12300 

AGCGAAATCG TGGCTCTACG AACAGGAACG TGATAGTAAG GCGTATATAG CGGATAAGGA 12360 

GGCTTCAAAC TCTAAAGTCC AAAAAGGTAG TCGTAACCTA TATGTGTAAA TCACGAGAGT 12420 

AATTGAATTC GGACTAAGGT TTGTGTGAAA AAGATAAATC TTTCTAGAGT CTAAAGACTC 12480 

TGCGTCAGAT TTCCTATTTT CACTGTAACC TTTTAACGTC CTCATATCTT GTATAAACGA 12540 

GGAAAGATGT ACGACTTATC CCGTGAGGTT TCATGAGCGT GAAAGCGTAG TAACAACGAA 12600 

TCATGAGAAG TCAGCCGAGC CCATAGTAGT GAGGAAACTT CCGTAATGGA AGTGGAGCGA 12660 

AGGGG 12665 
(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5305 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) . SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

CGCTAATCAC TACAATCATT TTATTGTACT TTTTCACTCT CAAGAAAAGC AAGAAGTATT 60 
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CATTTTAGTT TCATTTAGTA 
TTGGTATGCT CCTGCTTTCA 
AAAGCATTAT ATAATAGTGA 
AAATTCGTTT AAAAGATCTT 
GAAGTTTATT TCAAACAACC 
AATTAATGGG GTTATAATAA 
AGATGAGTAT TAAAACTATA 
CTCGGGTAGA GCGTGATTTA 
AGACTATTGG TTTAATAGAG 
TTTGCAGATG TTTAGGGAAA 
AGATTTTATT ATTCTCAATT 
TCTGAAAGTT TCTATATCTG 
GCACCAAGCC TTTTTAATTC 
TTTTTCAATC GTGCCCGTTA 
TTGGGTTGTT TTTTCCTAAC 
TTCAACATAG AAATTT ATC A 
ATTACTTATA TCATTTACCT 
GGCGAATGGG CTCAGAAAAA 
CTTGTTACTC TTTTCTCAAT 
ACATGAAAGT TCTTGAAAAT 
TCCAGATGAT AGTTTGACAT 
CTTTGACCAA GGGATTCACA 
TTTTAAGAAC GGCTATGCGG 
CTTGCGTTTT TCAGATAGTG 
CTTGGATTAC CTTCGCAATT 
TTTGGTTTTT GCTAATGAAC 
GGTCGAAACG GCTCTTTTGA 
TCGTATTCGT TCGGTTATCG 
AGAAATGGAT GCGGCCATCT 
CAGCAACGTG CGTGCGGGTA 
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TTATTTTGCA TACCTAAAAT ACAGTAAAAA ATCAGTCATC 120 

CTATTCAACA CGTTTTTGAC TTATACTAGG CTCATTTCCA 180 

TATGAAACCA ACTAAACTAA ACAAGAAATA TAAGCAATAA 240 

ACTAAAGCTA ATACTAAATA AAAATAAAAG AGTAAACTAG 300 

TAAAATACTG ATTTTCGGCT GAAGATAATA CTGGAGTGCA 360 

ATAGCTGATA GCTTGTGTTG GTTTTGGATT TTTTAAGAGT 420 

AGGAGGACGA AGGTGGCTAA AAATTT AAAA TTAAAATTAG 480 

ACACAAGGTC AACTGGCAGA GGCTGTCGGG GTGACACGCC 540 

GCGGGAAAAT ACAATCCCAG TCTCTCGCTC TGCCAGTCTA 600 

ACCCTAGACC AACTATTTTG GGAGGAAGAA GATGAAAAAT 660 

ACTAGACGAA AGAGAAGAAC AACTGTTCAA TAAAGCGGGC 720 

CATTGCTTTG TCGCTCCTAT CTTATATCAT TTCAGTATTA 780 

TAATATGCTG CTAATCGTTA TCATCATAGG GACATTTTAC 840 

TCTGGGAGTG ACCTACTATG GTCGTTTTCA TTTTACGATT 900 

CTTGGCTATT ACGGCTCTTT TGATGTTGCA GAATTATCAA 960 

GCACAATCCT TTGAATTTTA AATACCTGTC TGCTTGGGTC 1020 

TCCGTGGATC TTTATTGGCA ATCTTGGTCT TAAGAGCTAT 1080 

ATTTGAACAA GATATGGATG AATTGGAGAG TGGAGAATAG 1140 

CCAGCTAAAA TGTGATATAA TAGTACTAAT TTATTGGAAT 1200 

TTTCATGGGT TTCTAGCTAA GGAAGTAGGA AAAGTATGTA 1260 

TGCACACGGA CTTGTACCAG ATCAACATGA TGCAGGTTTA 1320 

ATAAGAAGGC GGTCTTTGAG GTGTATTTCC GCCAACAGCC 1380 

TTTTTGCAGG TTTAGAAAGA ATTGTGAACT ATCTTGAAGA 1440 
ATATAGCCTA TTTGGAGTCG CTTGGTTATC ATGGGGCGTT . 1500 

TCAAGTTGGA GTTGACCGTT CGTTCTGCCC AAGAAGGGGA 1560 

CGATTGTGCA GGTGGAAGGA CCTCTAGCCC AATGTCAGTT 1620 

ACATCGTCAA CTACCAGACT TTGGTGGCGA CGAAGGCAGC 1680 

AAGATGAACC CTTGATGGAG TTTGGGACAC GTCGGGCTCA 1740 

GGGGAACACG CGCAGCTGTG ATTGGTGGCG CCAATGGAAC 1800 

AGCTCTTTGA CATTCCTGTT TTGGGAACCC ATGCCCATGC 1860 
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CTTGGTACAG GTTTATGGCA ATGACTATGA AGCTTTCAAG GCTTACGCTG CGACCCACAA 1920 

AAATTGTGTC TTTCTTGTGG ATACCTATGA CACCCTTCGC ATCGGTGTAC CAGCTGCCAT 1980 

TCAGGTGGCG CGTGAGCTGG GTGATCAGAT TAACTTTATG GGTGTGCGGA TTGACTCTGG 2040 

GGATATTGCC TACATTTCTA AGAAAGTCCG TCAGCAACTG GATGAGGCTG GATTTACAGA 2100 

GGCTAAGATT TATGCTTCTA ATGATCTAGA TGAAAATACC ATCCTTAACC TCAAGATGCA 2160 

AAAGGCCAAG ATTGATGTCT GGGGTGTGGG TACCAAGCTG ATTACAGCCT ATGACCAGCC 2220 

GGCTCTTGGG GCGGTTTACA AGATTGTTGC AATCGAAGAT GAAACTGGTC AGATGCGCAA 2280 

TACGATTAAG CTGTCTAATA ATGCTGAAAA AGTTTCTACG CCAGGTAAGA AGCAGGTGTG 2340 

GCGCATTACC AGTCGTGAAA AAGGCAAGTC AGAAGGCGAC TATATCACTT ATGATGGTGT 2400 

GGATATTAGC GACATGACAG AAATCAAGAT GTTCCATCCG ACCTATACAT ACATCAAGAA 2460 

GACGGTTCGT AATTTTGATG CCGTTCCTCT CTTGGTGGAT ATCTTCAAAG AAGGAATATT 2520 

AGTTTACAAC TTGCCTAGTT TGACTGACAT TCAGGATTAT GCCCGTAAGG AATTTGACAA 2580 

GTTGTGGGAT GAGTATAAGC GTGTGCTCAA TCCGCAGCAC TATCCAGTGG ATTTGGCGCG 2640 

TGATGTATGG CAAGATAAGA TGGACTTGAT TGATAAGATG CGCAAGGAAG CCCTTGGTGA 2700 

AGGAGAAGAA GAATGAGTTT GCAAGAAACG ATTATCCAAG AGCTGGGTGT CAAACCAGTG 2760 

ATTGATGCCC AGGAAGAAAT CCGTCGTTCT ATTGATTTCT TAAAAAGATA TCTGAAAAAA 2820 

CATCCCTTCC TAAAAACCTT TGTACTAGGG ATTTCTGGGG GACAAGACTC AACCTTGGCA 2880 

GGACGTTTGG CGCAATTAGC TATGGAAGAA CTGCGAGCTG AAACGGGAGA CGATAGCTAC 2940 

AAATTTATCG CTGTCCGCCT GCCATACGGA GTGCAAGCTG ATGAAGCAGA TGCTCAAAAA 3000 

GCCCTAGCCT TCATCCAGCC AGATGTCAGC TTGGTTGTGA ATATCAAGGA ATCAGCTGAT 3060 

GCCATGACAG CTGCAGTTGA AGCGACAGGT AGTCCTGTTT CAGACTTCAA CAAGGGGAAT 312 0 

ATCAAGGCAC GTTGCCGTAT GATTGCTCAG TATGCCCTTG CTGGTTCCCA TAGCGGAGCG 3180 

GTCATTGGAA CAGACCACGC CGCGGAAAAT ATCACAGGTT TCTTTACCAA GTTTGGTGAC 3240 

GGCGGTGCGG ATATTCTCCC TCTTTACCGC CTCAATAAAC GCCAAGGAAA ACAGCTCTTG 3300 

CAGAAACTTG GCGCAGAGCC AGCCCTTTAT GAAAAAATCC CAACGGCAGA CCTAGAAGAA 3360 

GATAAACCAG GCCTAGCTGA CGAAGTCGCA CTTGGAGTCA CCTACGCAGA GATTGACGAC 3420 

TACCTAGAAG GCAAAACAAT CAGCCCAGAA GCTCAAGCGA CCATTGAAAA CTGGTGGCAC 3480 

AAAGGCCAAC ACAAACGCCA CTTACCCATC ACCGTATTTG ATGACTTTTG GGAGTAAAAA 3540 

GGTCCGGGGG ACCTTTTTAG CTTCTTGCCC TGAAATTAAA AAGCAAGAAA AACCTCCACT 3600 
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GGAGGTTTTC AGCCTCTCAT CTTGAAATAA GAAAGTGAGA GAAGGTCTGG GGGATCTTGA 3660 

ACCCCGAGTT TAGAAATAAG AAAATGAGGC AGATTCAGTA ACTCGAAGAG TTCGATTTCA 3720 

TCGTCTTACC CCTGCAACGA TGACTAGGTT TGAAAAAGCT TGCTAGAGCG CATTTCAAAC 3780 

CAGGCAGCAA CTGCGTCAAG AAATTAGAAG ACAAACTCGT TTTCTAGCTG TTACTGAGTT 3840 

GAGCCTTTTT ACTACGAGTA TAGAAATAAG GAAGTGAGGT AGCATCATGA AATCTATCGG 3900 

TACGCAAATA TTACAGACAG AACGTTTGAT TTTAAGAAGA TTTGTGGAGA GTGATGCAGA 3960 

AGCCATGTTT CAAAATTGGG CTTCATCCGC TGAGAATCTG ACCTATGTTA CCTGGGATCC 4020 

CCATCCTGAT GTCGAAATCA CTCGAAACTC GATTTGCAAT TGGGTTGCTT CCTATACTAA 4080 

TCTCAACTAT TATAAATGGG CCATTTGTCT AAAAGAAAAC CCAGAGCAAG TAATAGGAGA 4140 

TATCAGCATT GTTAAGATAG ACGAGGCTGA TTTAAGCTGT GAAATTGGCT ATGTGTTAGG 4200 

CAAGGCTTAC TGGGGAAATG GTATGATGAC AGAGACTTTG AAAGCTATCT TGGACTTTTG 4260 

TTTTACTCAA GCAGGTTTTC AAAAGGTCAG AGCACGTTAT GCCAGTCTCA ACCCAGCTTC 4320 

AGGTCGTGTC ATGGAAAAGG CTGGAATGTC CTATCTACAA ACCATTGTTA ATGGTGTAGA 4380 

GAGAAAAGGC TATCTTGCGG ATCTTATTTA TTATGGTATA AGTAGGGAAG AATGTTGAAT 4440 

TCTATTTTCT GTTTCTATCG AAGTCAACTA TTTATTGTAA ATATAATAAT TAGCATTCCA 4500 

AGTTTATTTG AAACTTTAAA ATAGCATATT GATTAGTACA AGACAGATGT TCTAGTTCCT 4560 

TCTTTAATCT GGTTTAGTGT TAGTTAAAAA ATCGCTTTAA GCTTGTAACT AAGAGGGAGC 4 620 

TAATCGACTA GATTCTCCAG CCGAACAGG1 GGTAATGTAC TTTTTATAGT GTAATCCTAG 4680 

CTGTTGTTAA ATTTAAAATA GAATCCTCTA TCGAGTTAGG GAATTAAATT CAACCAATTT 4740 

TATTCATGTT TTTTCTATCA AATTATCTAA TATTAAAATA GTCTCATTCT GATGAGAAAA 4800 

CTATTCCCAA ATCATTCATA CCTCTCTCAA CTAGATGTAA CTTACAAAAC CCCTGACCTC 4860 

ATGAGCCACT TTCTTCCTCC TCATGAGGTC AGTTTTACTT TCTGCTGTTC CAGTATCGTT 4920 

TTTCCTCGCT AGATTTCCTC AAAAGGGCAG ACTCCTCCCT TGGTGCGTCA CACGATTTTT 4980 

TCATCTCGAC TGTTCTTTAA TGCATCATTA ACGACGCTTT TCTTCTAGGT GGTTCATAAG 5040 

GAACAGGAAG ATTCAGGTTG ACTTTTCTAA TCCTAGAATA AAGTGCTGAA AACAATTCGG 5100 

AATAGGCATA GAGACTAGAC AATTTGAGGA GCTGCTTGCG TCCTGTTCGA ACACATTTTC 5160 

CCACCACGTG AAGAAAAAGA TGGCGGAAGC GTTTGATTGT TAAAGTTTGG AAGTCACCTC 5220 

CAGCTAGATG TTTGAGAAAA AGATAGAGAT TGTAGGCGAT ACAGCTCATC ATCATACGAA 5280 

CTTCGTTTTT GATTAAGGTT GAACT 5305 
* (2) INFORMATION FOR SEQ ID NO: 136: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3964 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

TGGCAGCTCG TCGTCGTAAA GGACGCAAAG TTTTGGCTGC ATAATCCAAA CGAATTCTAT 60 

CAAAAATCAG TAGGAACTCG AGTCTACTGA TTTTTATTTT TGTAAAAAAG TTCAGTAGAT 120 

GCAAATGGAT TCGGAAGCGA TGTTACAGTA GATTGAAACT AGAATAGTAC ACCTCTGTTT 180 

CTAAAACATT GTTAGAAATC GATTTGACTG TCCTGATCGA TTTGTCCTGT TATTATTTTA 240 

TTTTACTATA AAGTTGAAGT AGGTGGAGAT GGTACAGCAA CAATCGTCTT TAAAGATGGT 300 

TCAGCTATTA CAATTCCAGG AAATCAATTG GTAGCACAAG ATCCAAAAGC ACAAGATAGC 360 

ACTAAACTGA CTGCTGAAAA ATCAACTGTT AAAGCACCTG CTCAAAGAGT AGATGTAAAA 420 

GATATAACTC ATTTAACAGA TGAAGAAAAA GTTAAGGTTG CTATTTTACA AGCAAATGGT 480 

TCAGCATTAG ACGGAGCGAC AATCAATGTA GCTGGAGATG GTACAGCAAC AATCACATTC 540 

CCAGATGGTT CAGTAGTGAC GATTCTAGGA AAAGATACAG TTCAACAATC TGCGAAAGGT 600 

GAATCTGTAA CTCAAGAAGC TACACCAGAG TATAAGCTAG AAAATACACC AGGTGGAGAT 660 

AAGGGAGGCA ATACTGGAAG CTCAGATGCT AATGCGAATG AAGGCGGTGG TAGCCAGGCG 720 

GGTGGATCAG CTCACACAGG TTCACAAAAC TCAGCTCAAT CACAAGCTTC TAAGCAATTA 780 

GCTACTGAAA AAGAATCAGC TAAAAATGCC ATTGAAAAAG CAGCCAAGGA CAAGCAGGAT 840 

GAAATCAAAG GCGCACCGCT TTCTGATAAA GAAAAAGCAG AACTTTTAGC AAGAGTGGAA 900 

GCAGAAAAAC AAGCAGCTCT CAAAGAGATT GAAAATGCGA AAACTATGGA AGATGTGAAG 960 

GAAGCAGAAA CGATTGGAGT GCAAGCCATT GCCATGGTTA CAGTTCCTAA GAGACCAGTG 1020 

GCTCCTAATG CTGCTCCTAA GACAACAAGT GCACCGCAAG CAACTGCAGG AACAATGCAA 1080 

GATGTTACCT ACCAGTCACC TGCTGGCAAA CAATTACCTA ACACAGGTTC AGCATCAAGT 1140 

GCAGCACTTG CTAGTCTTGG TCTAGTGGTG GCAACAAGTG GTTTTGCTTT GCTAGGAAGA 1200 

AAGACTAGAC GTAGAAAATA GAACAGCTAG AAAATTCTAT TCTCTACTTA AAGTTAGATT 1260 

ATAAGGGGGA TTTTGAGAAG TCATCAATCC TAGTGATGGG TGAGAAAAGT GAGAACCCAA 1320 

GATAATCACA TACTTTAGCT GAATAGGAAT ATTCTATCAA TGTAGCCAAT CTCTTCTGTC 1380 

TCTAACTGTG GAATAGGAGA TGGGCAATAT CGGATAGAAA AGATAGCAGA ATAGCTCTCT 1440 
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ATTGAAGAGA GGAGGGGAAA CCGAAAAATT 
AAGATAGAAA ACGAGGTTAG AAGAGATGAT 
AGAATTTGCA GGTCGTGAGG CAGAAGAAAT 
GATGAATATT GTTGGTTTTG ATAAACCGAC 
GTATGAGCAG CTCTATGCGA CTATTGGTTG 
GGAAGTTGAG GCTTACTTGT TGGATAAGTT 
AATTGGCTTA GATTACCATT GGATGACAGC 
CCGTCAGATT CAGCTATCTA AGGACTTGGA 
GCTGGAAGAT ACCTATGAGA TTATCAAGAG 
GCATTCATTT TCAGGGACGC TTGAGTGGGC 
TTCCTTCTCA GGAGTGGTGA CTTTTAAGAA 
GTTACCTTTG GACAAGATGT TGGTGGAAAC 
GCGTGGTCGT GAAAATAAAA CAGCCTATAC 
GCGTGGTATG ACGACAGAAG AGCTGGCGGT 
TGGACTGGAC AGCAAGTAAT GAAAGAGAAA 
GATGATACGG TCAATCTCAA ACGTTATTTC 
GCCATCAATG CTCAGGATAT AGAGCGGATT 
GTCTTTACAG ACCCAGATTT TAATGGGGAA 
CCAACAGTTC AGCATGCCTT TCTCAAGCGA 
GGGCGTTCTC TGGGAATTGA GCATGCCAGC 
GTGACAGAAC AATTTGAACA TGAGAGTCAG 
CTTGGTTTTC TAGCAGGGGC AGACAGCCGT 
CGAATCGGCT ATTCCAACGG CAAGCAACTC 
TTGGCAGAAG TGGAAGAAGC TATGAAATCT 
TTTTTAAGTT TCACAGTATT TTTCGAAGCA 
GTCAAAAAAT TAAAGAGATT CGGATAGAAA 
ATGAGCAAGA ACTGACAGTT CGTCAACTGT 
GTTTGCCCAA GTTAGACTAT ATTGCTCGCC 
CGGATTTTTC AGCTCTTCCT TCTGCTTATT 
CAATCTATGG TAAAGAAGAG GAGTACGATA 
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AGGTGCCCCT CCTCTTTTTT 
TTTTGATACA CATACACACT 
TGCCTTGGCT GCTGAGATGG 
GATTGAGCAT GCCTTGGAGT 
GCATCCTACA GAAGCTGGTA 
AAAACATTCC AAGGTTGTGG 
GCCCAAAGAG GTGCAGGAGC 
TTTGCCTTTT GTTGTCCATA 
TGAGGGCGTT GGTCCTCGTG 
AGAGAAGTTT GTGGATCTTG 
GGCAACTGAC CTCCAAGAAG 
AGATGCGCCT TACTTAGCAC 
TCGCTATGTG GTCGACTTTA 
AGCAACGACT GCAAATGCAG 
ATTTCTCAAG TTATCGTGGT 
GATGTGGAGA CCTATGAGAC 
CAGCGCCTGC ACCAACGTCA 
CGGATTCGGC GCATGATCAT 
GATGAAGCTG TTCCCAAGTC 
TATGAAGACC TGAAAACGGC 
TTTGACATTA GTCGTAGCGA 
AAGCGTAGAG AATATCTCGG 
CTCAAACGCC TAGAGTTGTT 
TATGAGTAGG AAAGATGTAG 
GGTAGAAGAG GAGGCGTCTG 
AAGGAATTAG TCGTCCAGAT 
CGCGAATTGA AAGTGGAGCT 
GGCTAGGAGT TCCAGTTTAT 
TAGAATTGAA ATACCAGATT 
AGAAGGAAGC GTGTTTGGAA 



GGTATAATAG 1500 

TGAATGTAGA 1560 

GTGTGACACA 1620 

TGGTAGATGA 1680 

CTTATACAGA 1740 

CTTTAGGTGA 1800 

AGGTTTTTCG 1860 

CCCGTGATGC 1920 

GTGGTATCAT 1980 

GTATGACCAT 2040 

CAGCTAAAGA 2100 

CTGTACCCAA 2160 

TCGCTGACTT 2220 

AACGAATTTT 2280 

TGAAGGGCGT 2340 

TCGAGGTTCT 2400 

TGGAGTCATT 2460 

GATGGTCATT 2520 

CAAGACCAAG 2580 

TCTAGCTCAA 2640 

TTTGATTCGC 2700 

AGAGACTCTC 2760 

TGGGGTTACT 2820 

CCGTTACAAT 2880 

ATGTTAATTG 2940 

TTTTGTGGAG 3000 

TCGCAACCGA 3060 

AGCCTTATGC 3120 

TTACGTGAAC 3180 

GAGATTTATA 3240 
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AAACATACTT TGATAATCTT CCTAAAGAAG AACAATTAGC ATGTGAAGTA TTGCAGGCGT 3300 

GTTTGGATAC TTCTAGAACT AGAAGGCCTG AATATGCAGA GTTAATACTT GAGGAACATA 3360 

TGCCTCAGAT TATAGAAAAA GAAGCTTATT CAATAAATGA TATGTTGTTG ATTCGTTTGT 3420 

TTTTTTATCA AATGCTCATT AGAAAAGATC TTGCCAAATT TATAAATCAA ATCGAAAAGC 3480 

TAATGCTCTT TCTTTTGGAA CAGAAGAAGG TAACTCAAAT AGAGAATTAC TTTATAATTA 3540 

GAG AT ACT CT TATTTCAGGA ATGTGTTGTC TTGAAAAGGT AGGAGTAACT GATTGTTTTA 3600 

ATGATTATCT ATCGTGTTTA CAAGAAATTA TGGATAAAAC TCAAGATTAT CAAAAGAAAC 3660 

CTCTTGTATT TATGTTTTTG TGGAAGCAAG CATTAAGAGA AGAAAGAGAT TTTAGTTTAG 3720 

CTGAATCATT TTATCAGTCT TCTAAAACAT TTGCGCAGCT AATTGGAGAT GAATTTCTAG 3780 

TAAAGAAATT GACAGAGGAA TGGCAAGAGG ATGTCAAAAA ATATTTATAA ACATAGTGAA 3840 

TCAGTGACAA AGATGTCCTT GTCCTCGTAT CAAAACAGTT CTAAAGTTCG TCTTTAGGGA 3900 

TGTTTTTTTA GATATAAGCT AAAAATGACA CGAAATGGTT AGATTTTAAG GACATTGATG 3960 

TCCG 3964 

(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12666 base pairs 
" (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 



TGAGACCGTT 


ATTTGTATTA 


GGGAAATGGG 


TATCTATTTT 


TAATGCTGTG 


GGGATTTTGA 


60 


TTGTTTCTAT 


TATTCAAACC 


AAAAGCTTGT 


CAGGTATTGG 


AGCAGGATTG 


TTTAATCTAT 


120 


ATAACATTTC 


ATCTTATATA 


GGTGATTTAG 


TTAGTTTCAC 


TCGATTGATG 


GCATTAGGAT 


180 


TATCTGGAGC 


AAGTATAGCA 


TCAGCTTTCA 


ATTTAATTGT 


TGGTTTGTTT 


CCGGGAATAT 


240 


TGGCTAAACT 


GACAATTGGA 


TTAGTATTAT 


TCATTCTTTT 


ACATGCGATC 


AATATTTTTC 


300 


TATCGTTACT 


ATCAGGATAT 


GTTCATGGAG 


CACGTCTGAT 


ATTTGTTGAA 


TTTTTTGGTA 


360 


AGTTTTATGA 


GGGTGGAGGA 


AAACCATTTC 


AACCTTTGAA 


GGCTTCTGAG 


AAATATATTA 


420 


AGGTTATTAC 


AAAGAATTAA 


TGGAGGATAT 


ATATAATGGA 


ACATTTAGCA 


ACTTATTTTT 


460 


CAACCTATGG 


AGGAGCTTTC 


TTCGCTGCAT 


TGGGAATTGT 


ATTGGCGGTT 


GGATTAAGCG 


540 


GTATGGGGTC 


TGCTTATGGA 


GTTGGTAAGG 


CTGGGCAATC 


TGCCGCAGCT 


TTACTGAAAG 


600 



WO 98/18931 



PCT/US97/19588 



922 

AACAGCCTGA AAAGTTTGCC TCAGCTTTGA TATTGCAATT ATTGCCCGGA ACACAAGGAT 660 

TATATGGTTT TGTTATTGGA ATTTTAATTT GGTTGCAATT AACTCCAGAA CTTCCTTTAG 720 

AAAAAGGCGT TGCTTATTTC TTTGTAGCTC TTCCAATTGC TATTGTAGGA TACTTTTCAG 780 

CTAAGCATCA AGGAAATGTA GCAGTAGCGG GAATGCAAAT CTTGGCTAAA AGACCAAAAG 840 

AATTCATGAA GGGAGCAATT TTAGCTGCCA TGGTAGAAAC CTATGCAATT CTTGCTTTTG 900 

TCGTATCATT CATTTTGACC CTTCGTGTAT AAGAAATAAA TTTGCAATTC AAAGGAGGTG 960 

TCTAAATGAG CAATTTAGAA AACTTACGAG AGTCTGTTAT TGAACAAGCT CATGAAAAAG 1020 

GGCGTATGAA ATTATTGGAT TCCAAAAAGA AGATTGATGA TGAATTTGAA ATGCAAAAGT 1080 

CGCTCATTAT AAAGAAAAAA GAAGCTGAAC ATGAACGAAA GTTAAAAGAA TTGCAACAGA 1140 

AATATCAAAT AATTTTTCAA CAATTAAAAA ATAAGGAACG CCAATCAACG TTAGTATCAA 1200 

AACAGAAAAT ATTAAAAGAA CTTTTTCAAT CTGCTTTACT AGAAATGGAA TCTTGGAGTG 1260 

CAGATAAAGA AATGGAGTTC ATCTATCGAA TTCTGGAACG ATATTCACAA CAAGAGGTCA 1320 

TAGTAACCTT TGGGGAACGG ACTTTAGCTA AATTCAATTT GGAACAATTA GAGAAATTGA 1380 

AATTCTCTTT TCCAAATTAT TTATTTAGTG AACAACCTAT CTCAAATGAA TCAGGCTTAC 1440 

TTATTTCAAT AGGTAAAATT GATGATAACT ATTTGTATAA AACATTAATT GGATCGATTT 1500 

CTAAGGAAGA AAGTTCAAGT ATCGCAAATC AAATTTTTAT CAATTAAGGA TGAAATTGGT 1560 

TAATCCTTCT TAGAAATTTG GAGTATTCCA ATAAAATTAG AAAGGTATTT TATGGATACT 1620 

AATCTTTTTT CAAAAATAAA TACGACGATT TCGGTAAAAG AAAACGATTT TATTACAGAA 1680 

GAAAAATTTC AAAAAATTAT ACAATCCAAA GATACGGAGA CATTGGCATT TATCTTAGAA 1740 

TCAACTCCCT ATCATTTATC GATTGACATC TTAGAAGATC CTAGTCAGAC AGAGATTTCG 1800 

CTAATGACAA AATTAGTCAA TGATTATAGA TGGGCCTATG CTGAAAGTCC GTCTGATATA 1860 

ATTGTGACTT TATTTGCTTT ACGATATGTT TATCATAATA TCAAAGTTTT ATTAAAATCT 1920 

AAGGCGGCAA TTAAGAAAGA TTTTTCTAAA TTATTAATTC CAATAGGGAT TTTTGATATA 1980 

GAAAGTTTAA AACATTTAGT TTCTTCCTTA CATTCAGATA CACTTCCTGA TTTTATGGTT 2040 

CGTGAAGTAG AATCAATTTG GAATGAGTAT GAAACTTTTA ATAATATTCG TGTACTTGAT 2100 

GTCGGAGCTG ATCTAGCATA TTTTAAACAT CTGAAACTTT TATCTAATGA GTTAGATGAG 2160 

GTACTGTCTC AGGTTATTGT CGAAATGATT GACTTTTATA ATATTATTAC TGTAAAACGT 2220 

GGTTTATCTC AAAATAAGAG TCATGGGGAT ATTTTACAAT TACTTTCAGA TGAAGGAAGT 2280 

ATTTCTGCTA AAGAATTTAT AT AC ATT GT A GAAAATCAAG AAATATTTGT GTGGTTCAAT 2340 

AAAATAAATC CAAGCTTAGA TTCAATCTTT TCAACTTATG AATTGAAGAT GCAGGACGCA 2400 
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ACAATTTCAT CTTCTGAGTT AGAATTTTTA 
CAAGGAAGGT ACAATGTAGA GGGGCCGTTA 
TTTGAAGTAA AGAATCTCAG AATGATCATA 
TCAATAAAAG AAAGGATACG CCCACATTAT 
TGGTAGCCGT GATATTATTT TACCATTTAG 
CCAAGAACAA GAAGCTATAA ATACACTAAG 
TTATATCACT GAAGACATTG CTTCAATGAT 
AGTTGTGCCT GCTATTATTT TATTACCGAC 
ACGTATAGAG GATAATGTAG AGAAAGCAGT 
ATTGTCTGTA ATATTATTCT ATAATTTTTG 
GGGAAGATTA TAAAAGTATC GGGACCTCTA 
ATTCAAGATA TTTGCCGTGT AGGTAAGCTA 
AGAGATCAGG CATCTATCCA AGTCTATGAA 
GTTGTTACAA CTGGAGAACC TCTCTCGGTT 
TTTGATGGCA TACAACGCCC ATTAGATCGA 
GTTCGTGGGG TAGAAGTTCC AAGTTTGGAT 
ATAGCAATTG GTCAAAAAGT GAGTACGGGT 
GTAGTTAATC ATAAAATTAT GGTTCCTTAT 
TCTGGCGATT TTACAATTGA TGAAGTTGTA 
TATAAAGGAA CGCTTATGCA AAAATGGCCT 
TTAATTCCAG AAGAACCATT AATCACAGGT 
ACCAAAGGGG GAGCTGCAGC AGTTCCTGGA 
CACCAAGTAG CTAAATTTGC CAATGTTGAT 
GGAAATGAAA TGACGGATGT ACTGAATGAG 
CAATCAATTA TGCAACGGAC AGTTCTGATT 
CGTGAGGCTT CAATTTATAC AGGAATTACC 
TCTGTCGCCA TTATGGCTGA TTCAACTTCA 
GGACGTCTAG AAGAAATGCC TGGTGATGAG 
GCTGAATATT ATGAAAGAGC AGGACGTTCT 



923 

TGTGATTTAC TATTGTATAA 
GTTCTTGCTA GATATTTATT 
TCAGCTCTTC AAAATACAAT 
GGAAGCTAAT AAGTATAAAA 
CATGATTGGG TTTGATATAT 
AAAATTAGCT CAATCTGATT 
ATTAGATACA ATTCGCCATT 
TCATAAACAA GGTTTAAATT 
AGGACACAAT ATTTTATAAT 
GACTTAGTAA GGAGAATAAC 
GTTATTGCAT CAGGTATGCA 
GGGTTAATCG GTGAAATTAT 
GAAACATCTG GTCTTGGTCC 
GAATTAGGGC CAGGATTGAT 
TTTAAATTGG CTACTCATAA 
AGAGATATTA AGTGGCATTT 
GATATTCTTG GAACTGTCAA ' 
GGAGTATCTG GAGAAGTCGT 
TATGAAATAA AAAAATTGGA 
GTCCGCAAGG CGCGTCCTGT 
CAACGAGTTA TTGATGCATT 
CCGTTTGGAG CAGGAAAGAC 
ATTGTTATTT ATGTCGGTTG 
TTTCCTGAGT TGATTGACCC 
GCTAATACTT CAAATATGCC 
ATGGCTGAGT ATTTTCGTGA 
CGTTGGGCAG AAGCGCTACG 
GGTTATCCTG CTTATCTGGG 
CAGGTTCTAG GGCTTCCAGA 



AACTTTAGAT 2460 

GGGATGTGAG 2520 

TCCCTTTGAA 2580 

TTGGCATAAT 2640 

TTCCTGCCTA 2700 

ATGGTGTCAT 2760 

ATGATTCCCA 2820 

TAGGATTAAA 2880 

AATGTACAAA 2940 

TTTGACTCAA 3000 

GGAGGCTAAT 3060 

TGAAATGAGA 3120 

GGGAGAACCT 3180 

TTCTCAAATG 3240 

TGATTTTCTA 3300 

TGATTCCACT 3360 

GGAAACCGAG 3420 

TTCTATTGCA 34 80 

CGGTAGTTTC 3540 

TTCTAAACGT 3600 

CTTTCCAGTA 3660 

AGTTGTACAA 3720 

TGGAGAACGT 3780 

TAATACCGGA 3840 

TGTTGCTGCT 3900 

TATGGGCTAC 3960 

TGAAATGTCA 4020 

AAGTCGTATC 4080 

ACGTGAAGGA 4140 
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ACGATTACTG CTATTGGAGC TGTATCGCCA CCTGGTGGAG ATATTTCAGA ACCAGTTACT 4200 

CAAAACACTT TACGGATTGT GAAAGTTTTT TGGGGGCTTG ATGCTCCGTT GGCACAGCGA 4260 

CGTCATTTTC CTGCAATTAA CTGGCTTACA TCTTATTCAC TATATAAAGA CAGTGTGGGC 4320 

ACTTATATAG ATGGTAAAGA GAAGACAGAT TGGAATAGTA AAATAACTCG TGCGATGAAC 4380 

TACTTACAAC GGGAATCTAG TTTAGAGGAA ATTGTTCGTC TTGTTGGAAT TGATTCTCTG 4440 

TCTGATAATG AACGACTAAC GATGGAAATT GCTAAACAAA TTCGAGAAGA TTATTTGCAA 4500 

CAGAACGCTT TTGATTCGGT AG AT AC ATT C ACTTCGTTTG CAAAACAAGA AGCAATGCTA 4560 

AGTAATATTC TCACTTTTGC TGATCAGGCA AATCATGCTT TAGAGTTGGG TTCTTACTTT 4620 

ACAGAGATTA TGGAAGGTAC CGTGGCAGTT CGAGACCGTA TGGCGAGAAG TAAATATGTT 4680 

TCAGAAGATA GATTAGATGA AATCAAAATT ATATCAAATG AGATTACACA TCAAATTCAT 4740 

TTGATATTAG AAACAGGAGG TCTATAAATG AGTGTTATAA AAGAATACAG AACTGCTAGT 4800 

GAAGTTGTTG GGCCTCTTAT GATTGTTGAA CAAGTAAATA ATGTGTCTTA CAATGAGTTA 4860 

GTTGAAATTC AACTTCATAA TGGAGAAATT CGTCGTGGAC AAGTTTTAGA GATCCACGAA 4920 

GATAAAGCAA TGGTTCAGCT TTTTGAAGGA TCTAGTGGAA TAAATTTAGA AAAGTCTAAA 4980 

ATTCGTTTTG CTGGTCATGC ATTAGAATTG GCTGTATCTG AGGATATGGT TGGTCGTATT 5040 

TTTAATGGGA TGGGAAAACC AATTGATGGT GGACCAGATT TAATTCCAGA GAAATATTTA 5100 

GATATTGATG GTCAAGCTAT TAATCCTGTA TCTAGAGATT ATCCAGATGA ATTTATTCAG 5160 

ACAGGGATCT CCTCTATTGA TCATTTGAAT ACTCTTGTAC GTGGTCAAAA ATTACCAGTA 5220 

TTTTCAGGTT CGGGCTTACC TCATAATGAA TTAGCTGCTC AGATAGCAAG ACAAGCGACT 5280 

GTTTTAAATT CTGATGAAAA TTTTGCGGTT GTATTTGCAG CAATGGGTAT TACTTTTGAA 5340 

GAAGCTGAGT TTTTTATGGA AGAACTCAGA AAAACAGGAG CGATCGATCG TTCGGTTTTA 5400 

TTTATGAACT TGGCAAATGA TCCTGCAATT GAGCGTATTG CAACTCCCCG CATTGCTTTA 54 60 

ACTGCGGCAG AGTATCTAGC TTTTGAAAAA GATATGCACG TTCTAGTTAT CATGACGGAT 5520 

ATGACTAACT ATTGTGAAGC GTTACGTGAA GTCTCGGCAG CTCGCCGTGA AGTTCCAGGG 5580 

AGACGAGGCT ATCCGGGATA TTTATATACA AATTTATCAA CTCTATACGA AAGGGCTGGT 5640 

CGCTTAGTTG GTAAAAAAGG TTCGGTGACA CAGATTCCTA TTTTAACAAT GCCAGAAGAT 5700 

GACATAACAC ATCCAATTCC TGATTTAACT GGATACATTA CTGAAGGGCA AATTATTTTG 5760 

TCGCATGAGT TGTATAATCA AGGTTATCGT CCACCAATCA ATGTTTTACC TTCTCTCTCT 5820 

CGATTAAAAG ATAAGGGATC TGGAGAAGGT AAAACTCGTG GAGATCATGC TCCAACTATG 5880 

AATCAACTGT TTGCAGCCTA TGCCCAAGGG AAAAAGGTTG AAGAGTTAGC AGTAGTATTA 5940 



WO 98/18931 



PCT/US97/19588 



925 

GGAGAATCGG CTTTATCTGA TGTAGATAAA TTGTATGTGA GGTTTACAAA GCGTTTTGAA 6000 

GAAGAGTACA TAAACCAAGG ATTTTATAAA AATCGAAATA TAGAAGATAC GTTGAATCTT 6060 

GGGTGGGAAT TACTATCAAT TCTTCCTAGA ACAGAGTTAA AACGTATCAA AGATGATTTG 6120 

CTTGATAAAT ACTTACCTTT GGTAGAAGTT TAATCCGGAA ATGGAGTGAT TATCTATGGT 6180 

ACGTTTGAAT GTAAAACCAA CTCGTATGGA ATTGAATAAC TTAAAGGAAC GTTTGACAAC 6240 

AGCTGAACGT GGACATAAGT TATTAAAGGA TAAAAGAGAT GAATTGATGA GGCGATTTAT 6300 

TTCTTTGATT CGTGAGAATA ATCAACTTCG GAAAGAAGTG GAAAGTTATC TAATTGATAA 6360 

TCTAAAATCC TTTGCAGTTG CTAAATCATT AAAGAATTCT CAAATGGTGG AGGAATTATT 6420 

TTCAATTCCA TCGAAAGAAA TTGAATTATT TGTTGAGAAA GAAAATATCA TGAGTGTAAC 6480 

AGTTCCTAGA ATGCATATGA ATATTACTTC TCAAAATGAG AACAGTGAAT ACAGCTATTT 6540 

ATCTTCTAAT AGTGAAATGG ATGATGTATT TGCTACAATG AATAGTTTAA TTTATAAATT 6600 

ACTAAGACTG GCAGAAGTTG AAAAAACGTG TCAGTTAATG GCTGATGAAA TAGAAAAAAC 6660 

ACGTAGACGT GTAAATGGTT TAGAATACTC GATTATTCCA AACTTGTCGG AAACTATTCA 6720 

TTATATAGAA TTGAAACTAG AGGAGGCAGA AAGAGCCAAT TTAGTTCGTA TTATGAAAGT 6780 

GAAGTAGATC CTTTATTTAG ATTATTAATT AGATGAACAA ATATCAGCTT GGATAAGGCT 6840 

TTAAGCCTTT CTAAGCTTTT TTTATTGACA GTATCAGGAT ATCTTTTTCA AAATTTTGGT 6900 

TTGTTAGATA ATGAAAATGT TTCTACTAAT CTAGATTTAG GATTAGTAAA TCGTAAATGT 6960 

AATTATATAG AAAGTAAGCG CGTCATAACA AGGTATCTAT CATTCATGGA GCTCCTCCTG 7020 

TATACTATTA GTAAAGTAAA ACTATTGGAG GATATTTTAA TGCCACAACC TATTGTTCCT 7080 

GTAGAGATTC CACAATCTCG TCGTTTTGAT TCTAAAAAGA GAAATGATAT TCTGCTTAAA 7140 

ATTCGTATTG GCAAGCTTGA AGTAAGTTTT TTTCAATCTC TCAATCTCGA AATGGTAGAA 7200 

CAGCTTTTGG ATAAGGTGTT GCTCTATGAC AATTCATCTA TCTAGCCTAG GGGAGGTCTA 7260 

TCTCGTGTGT GGGAAAACTG ATATGAGACA AGGAATCGAT TCACTGGCTT ATCTGGTTAA 7320 

AACCCACTTT GAATTGGATC CTTTCTCCGG TCAAGTCTTT CTCTTTTGTG GTGGACGTAA 7380 

AGACCGCTTT AAAGTCCTTT ACTGGGATGG TCAAGGATTT TGGCTACTAT ATAAACGCTT 7440 

TGAGAACGGC AGATTGATTT GGCTAAGTAC AGAAAAGGAT GTCAAAGCTC TCACACCAGA 7500 

ACAAGTAGAC TGGCTTATGA AGGGCTTTTC TATCACTCCA AAAATATAGT AGATTGAAAC 7560 

TAGAATAGTA CACCTCTGCT TCTAAAACAT TGTTAGAAAT CGATTTTACT GTCCTGATCG 7620 

ATTTGTCCTG TTCTTATTTC ATTTTACTAT AAATCCATCA GAAAGTCGTG ATTTCTATTG 7680 
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AAATGAGGAC TTTCTTTTTA TACTCATCTG CTTTCAAAAA GCATTCTAGT CCATCTCCGA 7740 

TTAACGATGG ACTTTATCAC CTCCTTCTCC AGTCCTTGTA TAACATCTTG GAGTTGATTC 7800 

ATGACATCTT CCAAAGTTTA AAAGGCTTTA TTCTTAAATC CACGTTTACG AATCTCTTTC 7860 

CACACTTGTT CAATGGGGTT CATCTCTGGT GTGTATGGAG GAATAAATGC AAAGCCAATA 7920 

TTAGTCGGAA TCTTTAAGGT ACTTGATTTA TGCCATATAG CATTGTCCAT AACGAGTAAA 7980 

AGATAATCAT CTGGATAAGC TTGTGAAATC TCCTATTCCT AAAGCCCCTT TAGCGCATAA 8040 

CTTTGGCTCA GCTTCTATTA TCGCTCACAC CATCCATCAG AAGTTTAATC TGAAGGTACC 8100 

CAATTATCGC CAAGAAGAAG ATTGGGCTAG GATGGGTTTA CCAATCACAC GTAAGGAAAT 8160 

CTCTAATTGG CATATCAAGG CGAGTCAATA CTATTTGGAG CCCCTTTATA ACCTCTTGCG 8220 

AGAGAGACTA TTGACTCAGC CCTTAGTTCA TGCGGATGAA ACTTCTTATA GGGTGCTAGA 8280 

GAGTGATAGT CAGCTGACTT ACTATTGGAC TTTTTTGTCA GGTAAAGCAG AGAAACAAGG 8340 

GATTACGCTT TACCACCATG ATCAGTGTCG AAGTGGTTCA GTAGTACAAG AATTCCTAGG 8400 

AGATTATTCT GGCTATGTGC ATTGTGATAT TTTGCGGCAG TAACTTAGGA CTTTAGTCCT 8460 

CTAGTTCTGC CTATGCGATA GCAGTCCAAG GTTTAGGAGC AAGGCGACGC TAAGCTTGGT 8520 

AAACTTCGAA CCGCTCGTCT GCTTATCGTC AACTGGAAGA AGCTGAACTT GTTGGATGTT 8580 

GGGCGCATGT GAGAAGGAAG TTTTTTGAAG CGCCCCCCCA AGCAAGCGGA TAAATCATCC 8640 

TTAGGAGCTA AAGGTTTAGC TTATTGTGAT CAGTTATTTT CCTTGGAAAG AGACTGGGAG 8700 

GCTTTGCCAG CTGATGAACG ACTACAGAAA CGTCAAGAAC ATCTCCAGCC CTTAATGGAA 8760 

GACTTCTTTG CTTAGTGCCG GCGTCAGTCA GTTTTAGCAG GTTCAAAACT AGGAAGGGCA 8820 

ATTGAATACA GCCTCAAGTA TGAAGAAACC TTTAAGACCA TTTTGAAAGA CGGACATCTG 8880 

GTCCTTTCCA ATAATCTAGC TGAACGCGCC ATTAAATCAT TGGTTATGGG ACGGAGTAAA 8940 

AGAGTCCAGT GGACTCTTTT AGCCTAAGCT CAGTTTAAAA AAGCGAGGGT GGTTATTTTC 9000 

TCAAAGTTTT GAAGGAGCTA AAGCAAGAGC T ATT ATT AT G AGTTTGTTGG AAACAGCTAA 9060 

ACGTCATCAA TTAAATAGCG AGAAATATCT ATCCTATCTT CTAGAATGTC TTCCAAACGA 9120 

GGAAACTCTC GTAAACAAAG AGGTTTTAGA GGCTTATTTA CCATGGACTA AAGTTGTAC A 9180 

AGAAAAGTGC AAATAAGAAA TCTCCAGATT AGGAACTATC CGTGAGTTCT CCAGTCTGGA 9240 

GATTTTTCAA TAGACTTCCT GCGAAACAAA ATATGGTATA ATAGTTCTAT GAATGATGAA 93 00 

GCAAGTAAAC AACTAACCGA TGCACGATTT AAGCGTCTTG TTGGTGTTCA ACGCACGACT 9360 

TTTGAAGAGA TGTTAGCTGT ATTAAAAACA GCTTATCAAC TTAAACACGC AAAAGGTGGA 9420 

CGAAAACCTA AATTAAGTCT AGAAGACCTT CTTATGGCCA CTCTTCAATA TGTGCGAGAA 9480 
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TATCGAACTT ATGAACAAAT TGCGGCTGTT TTTGGTATTC ACGAAAGCAA CTTAATCCGT 9540 

CGGAGCCAAT GGGTTGAAGT AACTCTTGTT CAAAGTGGTG TTACGATTTC AAGAACTCCT 9600 

CTCAGTTCTG AGGACACGGT AATGATTGAT GCGACGGAAG TAAAAATCAA TCGCCCTAAA 9660 

AAAAGAATTA GCGAATTATT CTGGTAAAAA GAAATTTCAC GCTATGAAGG CTCAAGCGAT 9720 

TGTCACAAGT CAAGGGAGAA TTGTTTCTTT GGATATCACT GTGAACTATT GTCATGATAT 9780 

GAAGTTGTTC AAAATGAGTC GCAGAAATAT CAGACAAGCT GGTAAAATCT TGGCTGACAG 9840 

TGGTTATCAA GGGCTCATGA AGATATATCC TCAAGCACAA ACTTCACGTA AATCCAGCAA 9900 

ACTCAAACCG CTAACAATTG AAGATAAAGT CTATAACCAT GCGCTATCTA AGGAGAGAAG 9960 

CAAGGTTGAG AACATCTTTG CCAAAGTAAA AACGTTTAAA ATGATTTCAA CAACCTATCG 10020 

AAATCATCTA AACGCTTCGG ATTACGAATG AATTTGATTG CTGGTATTAT CAATCATGAA 10080 

CTAGGATTCT AGTTTTGCAG GAAGTCTATT ATCAAAAATA CCATCAAGAT TATATAAGAT 10140 

TGATACAGGA AAAGTTTTAT TTGATGGTGT AAATATTAAT CAAATAGATA AAAAAATATT 10200 

AAGTCAAAAT TTAGGAGTAG TTCCACAGGA TTCATTTTTA TTGAACCGAA GTATTCTTGA 10260 

TAATATAACT TTAAAGCACG AAGTTACTTC ACAAAAGATA GAGGAAGTTT GTAAAGCAGT 10320 

TCAAATCTAT GATGAAATCA TGGCTATGCC GATGAAATTT AATACTATCA TCTCAGAGAT 10380 

GGGGTCAAAT ATTTCAGGTG GGCAAAGGCA ACGGATAGCA CTGGCACGTG CATTAATAAA 10440 

TAATCCTAGT ATTGTAATTT TAGATGAAGC AACTAGTGCA TTAGACACTA TTAATGAGGA 10500 

AAGAATAACA AAGTATATAC AAAGTCAGGG CTGTACTCAA ATAATTGTAG CTCATAGATT 10560 

GTCAACGATT AAGGATGCGG ATGTTATTTT TGTAATGAAA GGTGGTAAGA TTGTTGAGTC 10620 

AGGAAATCAT AAGTACTTAA TGGATCTTGG TGGAGAGTAC TACAGCTTAT ATACAAAAAG 10680 

GAAATGAGGT GTAAAGAAAA TGAAGAAAGA AAATGAATAT GTAATTTTAA CAACAGCCTC 10740 

ACTAGGGGTG ATGATTGGAA TAGTGTTTGC AATTTTTTTA GATTTTCCAG TTGAATATGG 10800 

TATTTCTTTA GGCTTGTTGA ATGGAATAGT ATTGGGTTCG CTGATTGTTT ACAAAAACAA 10860 

TAAGAATTAA GCATAATTTT TTGCTGTAAA CTAAGGAGTA GAGATGGCTA TAGTTGAAAT 10920 

TATAAATCTA ACAAAAAGCT TTAAAGATAT TGAAGTTATT CATAACACTT AAATAATAGA 10980 

GCAACTACAG TAGTAGCTTA AAAACATGAT TAAATCGCTA TTCTTAGGAG TAGCGGTTTT 11040 

TCTTTTTGTT TAATACTCTT TGAAAATCTC TTCAAACCAC GTCAGCTTTG CTTTACCGTA 11100 

CTCAAGTACA GCCTGCGGCT CGCTTCCTAG TTTGCTCTTT GATTTTCATT GAGTATAAAA 11160 

AGGGTCAAGT AAGTATAGTA AATTGAAATA AGATATGAAC AAATCGATTA GAAAAGTCAA 11220 
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ATTAATTTCT 


AGAAATATGT 


TAGAAATTGG 


TTTGAATTCC 


GCAATCAATT 


TGTTCAGTTT 


11280 


TTATTTCATT 


TCATTTTATT 


TAATTAGATT 


TTCCAATTTT 


TTAATTCAAG 


CTAAAAATCC 


11340 


CCAATCGTAG 


TGATTGAGGA 


TTGAGTAAAT 


AAATCTTAAA 


CAATACCTTG 


TGCAATCATG 


11400 


GCATTTGCTA 


CATTTTCAAA 


GGCAGCAATG 


TTAGCTCCTG 


CAAGGTAGTC 


TTTATCAAGA 


11460 


CCGTATGTTT 


CTGAAGTCGT 


TTTAGCTGTG 


TTGAAGATGT 


TTGTCATGAT 


GTCTTTGAGA 


11520 


CGGCCATCAA 


CTTCTTCACG 


AGTCCATGAG 


AGGCGAAGAC 


TGTTTTGGCT 


CATTTCAAGA 


11580 


GCTGAAACGG 


CTACACCACC 


AGCGTTGGCA 


GCTTTTGCAG 


GTCCGTAGAA 


GATACCATTT 


11640 


TCTTTGTAAA 


CTTTGATGGC 


ATCAAGGTCG 


CTCGGCATGT 


TGGCACCTTC 


AGATACACAG 


11700 


ATAACGCCTT 


GAGCAACCAA 


ACGTTTAGCT 


GCTTCACCGT 


TGATTTCGTT 


TTGAGTGGCA 


11760 


CATGGAAGAG 


CAATGTCATA 


GTTTCCAGCG 


TAAGTCCATA 


CAGTACCTTC 


GTGGTAGGTT 


11820 


GCAGTTGCTT 


TTTCAGCTGC 


ATACTCAGTC 


AAACGAGCAC 


GACGTTTTTC 


TTTAACATCA 


11880 


ACCAAAAGAT 


CGAAGTCGAT 


ACCATTTTCA 


TCGATGACAT 


AACCATTTGA 


GTCAGAAACA 


11940 


GAAATAACAG 


TTGCACCGAG 


TTCAGTTGCT 


TTTTGAAGAG 


CATATTGAGC 


AACGTTACCA 


12000 


GAACCTGAAA 


TAACGACTTT 


CTTACCAGCA 


AAGCTGTTAC 


CGTTAGCTTT 


GAGCATTTCT 


12060 


TCAGTATAGT 


AAACCAAACC 


GTAACCAGTT 


GCTTCTGGAC 


GAATCAAGCT 


ACCACCAAAT 


12120 


CCAAGAGGTT 


TACCAGTCAA 


GACACCAGCA 


TCAAATTGGT 


TAAGACGTTT 


GTATTGACCG 


12180 


TAAAGGTAAC 


CAATTTCACG 


TCCACCAACA 


CCGATATCAC 


CAGCAGGTAC 


GTCAAGTGAT 


12240 


GGTCCGATGT 


GTTTTTGCAA 


TTCAGTCATG 


AAGCTTTGGC 


AGAAGCGCAT 


1 1 UAuLA 


1«JUU 


TCTGTTTTAC 


CTTTAGGATC 


GAAGTCTGAT 


CCACCTTTAC 


CTCCACCGAT 


AGGAAGTCCA 


12360 


GTCAAGACAT 


TTTTAAAGAT 


TTGTTCAAAT 


CCGAGGAATT 


TCAAGATCCC 


TTGGTTTACA 


12420 


GTTGGGTGGA 


AACGAAGTCC 


ACCTTTGTAT 


GGTCCAACAG 


CTGAGTTGAA 


TTGAACACGG 


12480 


TAACCACGGT 


TTACTTGAAT TTTTCCATCA CGGTCAACCC AAGGAACACG 


GAAAGAAACC 


12540 


ACGCGCTCAG 


GCTCAGTAAT 


ACGTGCCAAG 


ATATTTTCTT 


CGATATACTC 


AGGGTGTTTT 


12600 


TCAAATACAG 


GTTCTAAAGT 


GTTGAAAAAT 


TCTTCAACAG 


CTTGGAGGAA 


TTCAGCCTCG 


12660 


TGCCGG 












12666 


(2) INFORMATION FOR SEQ ID NO: 138: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3083 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

AGCAACTGTT GTGAACCAAT TCCGATAAAT TCCAAGAATT GGTTAATAGA GCCATTTTGA 60 

CCAAAAATCC CGATAAAAGC ATAGGCTTTA AGGAGCAAAT TGATCCAGGT AGGAAGGATA 120 

ATCAGCATGA GCCAGAGTTG ACGGTGTTTG AGACGGGTCA AAAAGAGGGC CGTCGGATAA 180 

CTGATAAGCA GTGCCACAAA GGTCACAATG CCTGCATAAA GCACTGAGTT GAAACTCATT 240 

TTAAGATAGG TCAAGTTTTG TGACGCAAAG TAAGATTTGT AATTTTCTAA ACTGAACTGG 300 

CCTTCGATGT TGAAAAAGGA TTGACCGAAA ATCAAGACCA AGGGTGCCAA TACAAAGAGC 360 

GCAATCCAAA GCATGTAGGG TACTACAAAG AGTTTAGAGC TTGTTTTCTT CATCTCTTTC 420 

CTCCTCGATT GCATTGATCA AACCTGCTTC TTGCTCTTCG ATTTCTACGT ACTCCTCAAT 480 

ACGAGCATCG AACTCTTCTT CGGTTTCATT GAGACGCATG ATGTGGATGT CTTCTGGTTC 540 

AAAGTCCAGA CCGATTTCCT CACCCACGAT AGCCTTACGG GTTGAGTGGA TCATCCATTC 600 

ATTTCCAAGT TCGTCATAGG CGATAATTTC ATAATGAACT CCACGGAAAA GCTGGGTATC 660 

GACCTTAACT TGGAGCTTGC CTTCTTCAGG AAGGGTAATG CGCAAGTCCT CTGGACGAAT 720 

AACGACCTCA ACAGGTTCAT TTGGCTTCAT CCCACCATCA ACCGCTTCAA AGCGTTTGCC 780 

GTTAAATTCG ACCAAGTAGT CCTCAATCAT GGTACCTGGC AAGATGTTTG ACTCCCCGAT 840 

AAAGGTGG CA ACAAAGTGGT TGATTGGCTC ATCGTAGATG TCCACAGGGG TTCCAGACTG 900 

GACAATCTCG CCATCATTCA TAACGAAAAT CCAGTCACTC ATGGCAAGAG CTTCTTCCTG 960 

ATCGTGAGTG ACAAAGACAA AGGTAATGCC CAATCGTTGT TGTAATTCAC GCAATTCGTA 1020 

CTGCATGTCT GTTCTCAATT TCAAGTCCAG CGCTGATAAA GGCTCGTCCA ACAAGACCAC 1080 

ACGGGGTTGG TTGATGATAG CACGGGCGAT GGCCACACGC TGACGTTGTC CTCCAGAAAG 1140 

TTTGCGGATG GAACGTTTTT CATAACCTTC CAACTGAACC ATCTTGAGAA CTTCCGCTAC 1200 

ACGCTGCTCG ATTTCTTTCT TATCAATTTT ACGCAAGCGA AGTGGAAAGG CAACATTTTC 1260 

AAACACATTC ATATGTGGGA ACAAGGCATA GGATTGGAAG ACGGTATGTA CGTCGCGCTT 1320 

GTTGGTTGGA ATATCATTGA TACGAACACC GTCTAGCATG ATATCTCCTG TCGTCGCATC 1380 

CAGTAAACCT GCAATAATGT TTAGGATAGT TGATTTCCCC GAACCAGATG CACCTAGAAG 1440 

GGTGTAGAAT TTCCCTTCTT CCAACTCAAA GTTGATGTCT TTGAGAACCT TGGTGTTGCT 1500 

GTCTTCAAAA ACTTTAGAGA CGTTTTTGAA TTCGATAATT GGCTTTTTCA ATTGGCATAA 1560 

ATTCCTTCTT TTTCATAGAT TAACCGATCG GGGCTCTGTC AGGTCCCCAC TACCTCTTGC 1620 

AGGGAGTAAA ACCACCTGCA TACATCTTCG CTACCGATAG GCTTTCACCC AAGATCCGGA 1680 
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CTTCTCTTTC AAGCGTAATA CCTGAGTGTT CCTTGACTTT TTCGATAACC GATTGGATCA 1740 

AGTCCTCGTA GTCTTTGGCC GTTCCATCTG CGACATTGAT CATAAATCCT GCATGCTTTT 1800 

CTGACACTTC TACGCCACCG ATACGATAGC CTTTCAAGCC AGCTTCTGAA ATTAACTGAC 1860 

CTGCAAAATG CCCGACTGGA CGCTTAAAGA CCGAGCCACA AGATGGGTAT TCCAAAGGTT 1920 

GCTTGAGTTC ACGTAGGTGC GTCAAGCGGT CCATTTCCTG CTTGATAACC TGATGGGTTC 1980 

CTGGAGCTAG GGCAAATTTA ACTGACAAGA CAACTGCACC AGACTCCTGA ATAGCTGAAT 2040 

GACGGTAACC AAAAGCCAAG TCTTTAGCAG ACAGGGTTTC GATTTCTCCA TCCTTGGTCA 2100 

AGACCTTACA AGACTGCAAG ATGTGAGCAA TCTCGCCACC ATAGGCACCC GCATTCATAA 2160 

AGACAGCACC GCCAACGCTT CCTGGAATAC CACAAGCAAA CTCAAAGCCA GTTAAACTAT 2220 

GACGGAGGGC AATGCGAGTT GTTTCAATCA AGTTAGCCCC AGCTTCTGCT TCAATGGTAT 2280 

AGCCATCAAC AGAAACGTTA TTGAGCTTGT CACACAAGAT GACAAATCCA CGAATCCCAC 2 340 

CATCACGAAC GATGATATTG CTTGCATTGC CAAGAACCAT CCAAGGGATA TTTTGTTGGT 2400 

TGGCAAATTT CACAACGCGA GCCAACTCAA AACGATTTCG TGGAAAGACC AAATAATCAG 2460 

CCTCTCCACC TACTTTTGTA TAACTATAGC TATGCAAGGG TTCCTTAAAA CGGATATCAA 2520 

TTCCTTCTAA GATTTCAAGC ATTTTTTCTC TTACAGACAT GTCACTCTTC CTTTTACAAA 2580 

ATTCATTCCA TTATACCATT TTTAGAGACA TTTGACGACC ATAAAAATAC CTTGTTTGGA 2640 

TTTTGCATAA GAAAAAGAGG TTCCCCCCTT TTTATGATTT TTTACAAAAG ATTTCCTTGG 2700 

TTCCATAGGC GACCAGAACG AGCTCCAGTG CTAGAATCAC TTCAACCAAG ACTGGATTTG 2760 

TCAACCAGCC TACTTGGAAA AGAGATGGTG CCAGATCAAA GAAGGCATGC AAGCCATAGG 2820 

CTGCTAGGAG ATAAATCCAT TTCTTCTGGC GAACAGCTTG GTAAACCCAA ACTGTCAAAA 2830 

GTAATTGGAA ACCAAGCGCC AAGATTCGCT CAAAACCAAG CAAATAAATC TGCCAGACCG 2940 

AAAGTGACTG AATGGTTTTT AACATATTTT CAGACAGTAA TTGCATAACC TGTGGATTC* ">CC0 

GAGTTTGAAC TGCCGAAAGA ACAATGTAAA GATTGAGTAA ACTAGTAAGG CCTAGAAAAA 3060 

TCAACTCCAA GCCACCATGC CCC 3083 
(2) INFORMATION FOR SEQ ID NO: 139: 

(ij SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15363 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xli SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
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CCGGAGGATA TTGACCACCA CCAAAAGCAG GGGGAAAATC GAAATCAACC AATAGTAGGC 60 

TACTGCGACA CTGGTCAACT CACTATCTGA TGCTTGATAA TAATGCAAAA AAGCTTTTAA 120 

TAAAGGTTTG TCTATCAGCT CTTTCCACCA CTTTTTCATG TCATACTCCT TCACTTATAA 180 

TCTTATACTC AATGAAAATC AAAGAGCAAA CTAGAAAGCT AGCCGCAAGC TGCTCAAAAC 240 

ACTGTTTTGA GGTTGTAGAT AAGACTGACG AAGTCGATCA CATACATACG GTAAGGCGAC 300 

GCTGACGTGG TTTGAAGAGA TTTTCGAAGA GTATTAACTA ATTTCTTCTT ACCAATTCCA 360 

CCATATCATA CGGTAGGGTA TTGGCAGCTT CCTTCAAGGA ATAGTTCTCT AAGTTATTTA 420 

CATTTTGTCG TAATTTCTTG GCATACTTAG TCGTAATCAA TCGTTTTTCT TCGTATTCGA 480 

AAATCAACTT GCGCTCCAGA TAATAGCCTC TCAGCATTTC ATCGATATTG TTGGGTTTGA 540 

CACGATTGAT AACCCGTTCG ACAAAGGCAC CACTGCTGAT AATAGCTGTT TCTCGAAGAC 600 

GAGACTCCTG CATAAAACTA ATCAAAGAGC GTCTGTAGAC TCCCTTCAGG TTTTCCAAAC 660 

TTTCAATAAT CATCTCTGTA TTGGCAAGAT AGAGCTCTGC AATTTGGTCA TAATCAAGAG 720 

CACGGAGACG GCTTTGCTCC TTGTTCTTCC AGCTACGGAA GGTCTTTCCG AGAGTAAAAA 780 

CTTCATGAAG GAGAAAACGT AAAATCCTCA AGGAAACAAG AAAATAATAG GTCAGTCTTG 840 

AGGCAAGTTT ACGATTGATT CCTTGTTCTA TATTTTTCAG ATAACGTTGG TAAACTCGGT 900 

AAGCACGATT GCTAATGTTC CCCTCTTCAT AGGCCTGTTC CAAACCATCA CTTTCAATAC 960 

TAAGAATCAA GAGTTTCAAA GCAGCCCAGT CTTCTTGATC ATCCTGGTTT TCTTGGCTTA 1020 

AAATGAGATT TTCAATACGT CCATGATAAT TGTCAATAGC CGCATAGAGG GGAAGTTTAT 1080 

TTCTGGTGTC TTCCAACTCT TTTTCCAACT CTAGCGTTAC TTCATTCAAA ATGGCGATAT 1140 

GCATAAGATA ATCCTTGCTT TCTTCCTCTT CATCAGAAAG ATGAGGCAAG ACCAAGAGAC 1200 

CTGTTAAAAA GCTAACAAGC GTCACACCTG CAACAAGGAA AAGCAAAAGA GGATACTCCT 1260 

GTTCTAGATT ACTTGGTATC AAGAGAATCG TAGCAATCGA CACCGTTCCC TTAACACCTG 1320 

AAAAGGTCAA GAGAAACATG TCCTTCATAT ACTTATTTAG CTTTTTCTTG AGGCGTCGGG 1380 

TTCTATAGGC ATAATAGCCA TAGATCATAA TAAAACGAAT GACAAAAAGG ACAAAGGTAA 1440 

GGGCGATAAG AGATAGCAAT AAAAGTAGAG GATTATAGAT TGGATTGGTC AAGATAGGTT 1500 

CTGCTATCAT TTCCAACTCC ATCCCTAAAA TCACAAAGAC AGAACCGTTG AGCATAAAGG 1560 

TCACTGTATG CCAGACCGTC TCGGTCACCG TATCCACTTG GGCTTCGAGG AGCGTGATTT 1620 

TCTTGAAGCG ACTTGCCTTT AAAATTCCAG CAACTACGAC GGCAATAATA CCTGAAACAT 1680 

GAACTTCTTC TGCCAGAAAG AAGGTCACTA GAGGCAAACT CAATTCTAAT AAAAGTTCAC 1740 
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TGGCAATATC CGTTGCGCGC ACACTTAGCA AGAAGGTATG GAGGAAGCGG TTGGTCATGG 1800 

CTGTTAAAAA TCCAATTAAA AAACCGCCTA GGATTGAAAA GATGAGCGAA CTGCTAGCTT 1860 

GCCCCAGAGA AAAAGCTCCA GTTGTCCAAG CTGTCAAAGC TACCTGAAAA GCCACCAAAC 1920 

CAGAAGCATC ATTCAAGAGT CCTTCGCCCT TAAGAATATT GGACACGCGC TTAGGAAAGC 1980 

TAAAACGCTC CGAAAGAGAG GCAAAGGCCA CCAAGTCCGT AGGACCAAGG GCTGCCCCAA 2040 

CAGCCAAGCA AGCTGCCAAG GGAAGGCTGA ACCAAAGAAG ATGGGCCAAG CCACCCAAAC 2100 

TCAGGGTCGA GATAAAAATC ACTGGAAATA TGAGATAAAC AATGATTCGC CAGTGTTTTA 2160 

AAATAGCCGT AACATCTGCT TCTTCAGCCT CTCGGAAAAG CAAGGGTCCG ATAACCAGTG 2220 

CCAAAAACAA CTCCGTATTA AGGTGAAAGT CAGTATTGGG TAAAAAGAGA CCAATCACAA 2280 

TTCCCAAAAG AATTTGCACC AAAGGGAGAG GCAAAAAGGG CAGGAGCTTA TTGGTTGTAC 2340 

TTGAGACAAT CAAAACCAGT AAAAATAGGA TGAGGTAAAT CAGTAATTCC ACGCACGTCC 2400 

TCCTTAATCT TTTTTACAAC AGGATTCAAA TATCTCCTTC TGCTCTTTGA TTTTTTGGTC 2460 

AATCTTGGAA CAGTCTTTGT GCTCAATTTT TCTCTGGCAC CGTTCCATTT CAAGAGCAAC 2S20 

TAATTTTTTC TTGATTTTAA GCATTTTTTT GCTCATATGC GCTTGGTCTA GCACGCCCAT 2580 

CGCTCGTTCG TGGTGGGTTG ATTCAACAAA ATTCTGGCGC ATGGCATCCA GCTTTTCGTG 2640 

TAAGTATTGT TTATCCATGT CTGTATCTCT CTAATTTTTC AATCATCACT AAAAACGGCG 2700 

GGTTGTTGAC TTGGTTTAAA GTTCGGTAAA TGGCAGCTGT GTACTCTTGT TGGTTCAACT 2760 

GGATCACAAA ATCCAAGACA GCATCTCTCT CGAGATCGCC TCCTTCATGA CCATAGTAAA 2820 

TCATAATAGC AATTCGTCCA CCTTTGACAA GTAAGCCACA TAGCTTTTCT AATGCCTCAA 2880 

TCGTTGTCTG CGGTCGGGTG ATGACAGACT TATCAGCTGC CGGCAAATAG CCCAGATTAA 2940 

AAATCCCTGC CTTAGCTTTT ATCACAAACT GGTCCAGTGT CTCATGGCCT TGCAAGATTA 3000 

ACTGGGCATT TGTCAAGTCA GCCTGATGCA AACGCTCTTG GGTCTTTTCC AAGGCTTGCT 3060 

TCTGAATATC AAAGGCATAG ACTTGCTTGG CTAGCTTGGC TAAAAAAAGC GTGTCATGAC 3120 

CATTTCCCAT AGTCGCATCC ACTACGACAT CCTCTTTTGT CACGACCTCA GCCAAAAAAT 3180 

CATGTGCCAT CTCAAGTGGT CTTTTCATTT TCAAACTCCT GTTTTACAGC CTTGCATCCT 3240 

TGAACACTTC CACGACGTCG CATCTCCATC TCAATGCTGT TGAGGACTTC CCATTTATTG 3300 

AGGCTCCACA TAGGACCAAG CAGCATATCC CTAGGCGCAT CTCCTGTAAT TCGATGGATG 3360 

ACGATATGTT TGGGAATAAT TTCCAGTTGG TCACAGATGA CCCTGACATA TTCGTCCTGA 3420 

CTCATCAATT GTAAACGCCC CTCATGGTAA TCTCGTTGCA TACGAGTATT TGTCATAAGA 3480 

TGGAGCAAAT GCAGTTTAAT CCCTTGAATA TCGTTATCCG TGACACAACG GCGGACATTT 3540 
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TCAACCATCA TCTCATGGGT TTCACCAGGC AAAGCATTGA TCAAATGGGA AACAATCTCA 3600 

ATTTTTGGAT ACTTTCTCAA ACGCTTGACC GTTTCCACCT ACAATTCATA AGAATGCGCA 3660 

CGGTTAATCA GGTCAGAGGT TGCTTCATAA GTAGTTTGCA AGCCCAATTC AACCGTCACA 3720 

TGCATGCACT CCGATAACTC AGCCAAATAT TCGATGGTTT CGTCTGGTAA ACAGTCTGGG 3780 

CGCGTTCCAA TATTGATTCC TACCACACCT GGCTCATTGA TAGCCTGTTC ATAACGCTCT 3840 

CGAATAACTT CCACCTTTTC ATGGGTGTTG GTAAAATTTT GAAAATAAAC CAGATACTTC 3900 

CGAACATCCG GCCACTTGCG GTGCATAAAG TCAATTTCCT TATAAAATTG CTCACGGATA 3960 

GGCGCATCCG GTGCCACAAT GGCATCTCCA GAACCAGAAA CCGTACAAAA AGTACAGCCC 4020 

CCATGAGCCA CAGTCCCATC ACGATTGGGA CAATCAAATC CCGCATCAAT AGGGACTTTA 4080 

AAAGTCTTTT CTCCAAAGAG TTTTCGATAA TAATCATTCA AGGTATTATA AGATTTCATG 4140 

ACTTTCATTA TAACAAAAAT CACCCACAAT CTCAAAAGCC TGACTTTCCT ATAAATTCCT 4200 

CTGTTTCTCG TTTCCATTAG CCTTTTTTTA TGATACAATA TGGGTATGAT TTTAATGAAA 4260 

TTAGCATCTA TTTTATTATT GATACTGACC TTAGTCGTCT GCATTATCCT AACCAAACTT 4320 

TTTAGATTAA AAAAACTAGG ACGAAACTTT GCGGATTTGG CTTTTCCAGT CTTGGTATTT 4380 

GAGTATTACT TGATTACAGC TAAAACCTTT ACCCATAATT TCCTCCCTAG ACTGGGGCTA 4440 

GCCCTCTCGA TCCTAGCCAT TATTCTCGTC TTTTTCTTCC TTTTGAAAAA ACGCAGCTTT 4500 

TACTACCCTA AATTTATCAA ATTCTTCTGG CGTGCAGGAT TCTTATTAAC CCTTATCATG 4560 

TATATAGAAA TGATTGTTGA ATTGTTCTTA ATGAAATAGT CGAATCCCTA AGCATTTTCT 462 0 

AGGGATTTTT GCTTTCTCTA CAAAATAGTA TAGACAATAA CACTATACAA TTTTATACAA 4680 

AGAAAAGAGT CTGGGACAAT AGTCTCTTAT ATCCAAAAAG GCAACGGATT TGCCGTTGCT 4740 

TTTTTGGATG GTTACGATAG TCTTGGTAAA ATAGAATTGC CCAATAAACC ATTTAGAAAG 4800 

GCTATCCCAT GCATATTCAC TATAACACAA ATCAAACAAC TTTACCACTA GAAATCAGTT 4860 

CCTTCTTACC ACAAGATCAT CTCGTTTTTA CTATTGAAAA AGTGGTGAAT ACCTTGGAGG 4920 

AACGTCACTT CTACACCTCC TATCATGCCT TTGATCGCCC GTCTTATCAC CCTAAAATGC 4980 

TTGTATCTAC TCTTCTATTT GCCTATTCAC AAGGGATTTT CTCTGGTCGA AAAATTGAAA 5040 

AATGGAAGAG TTAGTGACCT TAGATTGTTT GTTTATTGAC AGAACTAAGA TTGAAGCCAA 5100 

TGCCAACAAG TATAGTTTTG TGTGGAAGAA AACGACAGAG AAATTCTCCG CCAAACTTCA 5160 

AGAACAGATA CAGGTCTATT TTCAAGAAGA AATCACTCCC CTTCTGATTA AATATGCCAT 5220 

GTTTGATAAG AAACAAAAGA GAGGGTATAA AGAGTCAGCT AAAAACTTAG CGAATTGGCA 5280 
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CTATAATGAC AAGGAGGATA GCTACACACA TCCTGATGGC TGGTATTATC GTTTTCACCA '5340 

TACCAAATAT CAGAAAACAC AGACAGACTT TCAACAAGAA ATCAAGGTTT ACTACGCCGA 5400 

CGAACCTGAA TCAGCCCCTC AAAAGGGACT GTATATGAAC GAACGCTATC AAAACTTGAA 54 60 

AGCTAAAGAA TGTCAGGCGC TTTTATCTCC CCAAGGTAGA CAGATTTTCG CTCAACGCAA 5520 

GATTGATGTG GAACCTGTCT TTGGGCAGAT AAAGGCTTCT TTGGGTTACA AGAGATGTAA 5580 

TCTGAGAGGG AAGCGTCAAG TGAGAATTGA CATGGGATTG GTACTTATGG CCAATAACCT 5640 

CCTAAAATAT AGTAAAATGA AATAAGAACA GGACAAATCG ATAAGGACAA TCAAATCGAT 5700 

TTCTAACAAT GTTTTAGAAG TAAAAGTGTA CTATTCTAGT TTCAATCTAC TATACAATAA 5760 

GAGAATGACT CAAAATTAAA AAGCTAGAGT TCCACAATTG GAAATATCTA GCTTTTTTGT 5820 

GGTTGAGAAC TATTTTGTCT CAGGCTCTTT ATCTTCTATT TAGGACAAGA GTTTTTCTTT 5880 

GGTCTTTAAT GATAAAGAAG GTATCAAAAT TTCTAGTCTT CTTTTTTACC TTTAGTAACT 5940 

ACTAATCCTG CACTCAAACC TAGAAGAGTT AAACCTGCTG CTACTGCTGC TTGGCTTGCC 6000 

GCACTACCTG TACTTGGTAA CTGGGCTTTA TTAGTTTGAC TAGCTTCACT TGAATCAATT 6060 

GGTTTTGTAT CTGCTTTTTC TGACACTTGT GGTTTTTTAG CTTCTTGAGC TACTGGTTTG 6120 

GTTCCAACCA AGACGATGCG GTCTGTCGGA ACTTCTACCA CTTCACGGAG TTTTTCTTCC 6180 

TTACTTCCAT CAGGATTAAT CGCTGTAAAG ATACGTTCTT TTCCAACTTT TCCTTCTTGT 6240 

TCTACACGAG TTTCACCTAG ATACAGTGTT GAATCTTTTT TCTCAACTGT CTTGTATGCC 6300 

AAATCTTTTT CAACAAATTC GATTTTTGGA AGATCTTCTT GTACAGCAGC AACTGTCTTC 63 60 

TCAGAAACTG GTTTTTCCTT AGTCAAGTGG ATACGGTATT CCTTGACTTG TTTTCCACTT 6420 

TCTGAAACGA GGCGAACAAG TACTGGAAAG CTATCTTCTC CACTATCTAC CACAGTTGAA 6480 

GCTACTTGAT TGTTTTCTTC AACTGAGACT TTTGGCCGTT GACCTTTATA GGTAATTTGA 6540 

TAGTCTTGAC GATTTTCAGC GAAATCAGCA AGTTCTTTTC CATCTACAAG AATCTTTGAT 6600 

TGAGTGCTTT CTTGAGGCAA TTCACTTGGT GCAAGGAAGG TCATCTCAAT CATCGCAACA 6660 

CCGCTCTTAT CTGCTTTACG CTCCATACGC CATCTCATAG CTTTGGCTTT GATAGCTTTA 6720 

AATGTTACGT TGATTTCATC ACCAGCTGCA ATGTCTTTAT CCGCACGATA AGGAACAGCT 6780 

TCCCAATTTT CTGGATTGTT GAATGGATGG TCTGCGTCGT AGGCTTGGTA GTTTGAATAG 6840 

TAGGTTGGCA CTTCAAACTC TGGACCGACA TAGCGTTCTA AAACGAGTTT AGATGGTGCA 6900 

TCCGTACCAC TATCTGCAAA GAACTGAACT TTTCCTTGTG TAACAGTCCG TTCTACAATC 6960 

TTACCATTTT CACGGAAAAT CACACCCGCT GATACTTCTG GATTAGAAGA TGGTGTTGGT 7020 

GACCAGTTTG TCCAACGACG ATTTTCTGAA TGATCTCCGT CATTGAGATA GTCAACGCGG 7080 
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TCATGAGAGT TTTTGTCAAT ATCATTGGTT GCTGAAGCAA AGGCCTGGTT ACTGTTTTCA 7140 

TCATAGTTAG GGTTATCTGA AAGAGTCTCA CCAAGTTTGT CTGTCACTCG TACAGTGATC 7200 

TCAGCAACAA GGTTACTACC AAGGACACGG CCTCGAACAG TAAATTGACC TGCTTTTGTC 7260 

AGATTTTCCG CTGGAACTTC TTCCCATTCA ACTGTCAGGT CTTTTGTTTC GTAGCCGTCT 7320 

TTACCTGTGA AGTAAACTGG AACCTTAGTC GGCAATTCAA GTGCTTGACC TACTTGTAGC 7380 

AAGCGAGCTT GTTTAACCGC AGCAACTGGT TTATGAGAAA GTAAGCTCTT ATCCTTAGTG 7440 

AAGTGCAGAC GGTATTCTCC TAAGATGTCG CCATTTTCAG CTTTCGCGAT GACACGAACT 7500 

GGCTCACCTT CACGAACGCT TGGAACGACG GTAGCGAGAC CATTGTTGCT AACACTTGCT 7560 

GTGACTGCCG GAACTTTTCC ATCTACAGAC TCAAGGTAGT AGTCTGTCAA ATCAGGGTTG 7620 

AAGTTTGCTA AGTCTTTGCC GTCAACTTGG ATTCTTGTTT GTCCTTGCTT GGCTGCCGCA 7680 

ACTTGTTTCG CAAAGATTTG TACCTCTGTG ATAGACGTTC CACGCTTGTT ATCTGCTTTA 7740 

ACCATGCGAA TACGAACAGC ATAGGTTTCA ACTTTATCAA AGCTAAAGTG GTTCATTTCT 7800 

CCAGCCTTGA GTTGAGCAGG GGCTTTTAGA TTAGTAACTG GTTTCCAGTT GGCAGAATCA 7860 

TTAAAGACAT GGTCCTCATT ACCAACAAAA CTAGGGTTTT TAGGAGCTGT TGGGACAGTC 7920 

TTACCAACAT AATACTCAAT CACATAAGAC TTCGGTACAC CAACTCCATG GTCTTCATGG 7980 

AATCCGACAC TTAGATTATC AACGGAGCGT TTGCTCAAGA TACCTGAATC TCCAAACAGA 8040 

ACACCGACTG AAGCTTCTGG ATTAGTACGA TTCCAGTTTG TCCAACGATT GGCTGGTTGG 8100 

TTATTGTAGG AAATGAGCTT GTCATTAACA TTTGAAACTG GGTCGCTTGG ATTTGAGTCT 8160 

GAAGCAAAGG CAAGTGGCAA TTCTGAACCG GTCCATTGGT CAGAAATGTT TGCACCTTGC 8220 

TCAGTTTGAG CAGATACGCG AACATGAAGT TTAGTTGTTA ATTGCGTACC TTCTAAGCGA 8280 

CCATTAACTG TAAAGACACC TTCCTTAGCG TATTGCTCTG GACGAATCGC ATCCCATGCA 8340 

ACCTTAGCTG ATGAAACGTG ACCATTTGAA TCATATGTCC GAACACTTTC TGGTAATTGT 8400 

GGTGCTTCTG CGATTGGAGT TGTCACACTG ACTTCTTCAA CTGAAACGAT ACCTTCTACA 8460 

GAGACTTTTG CACGCGCTTC AAGGTCAATT CCTTCAACTT TACCTAGTAC TTCAAATGTT 8520 

TGATAGGAGT CTAGTTTTTC TTTCGGAATA GCTTGCCAAG TGACTTTATG AGTTTTAGGG 8580 

AAACCTTTGT CATACTCAAC TGTTACTGTT GCTGGAAGAC TTGGTTCCTG ATGCAAATCT 8640 

GTCACTACAT TTACAGGACG GATGGATTGC GCAATCTTCT TCTCAGTATT GGCTTGGATA 8700 

GTGAGTTCAA CTTGGTCTTT AGCTCCCTCA TATTCAGCGT TCAGAGTGAC TGCTCCTGGC 8760 

TTATGCAACT CAAGCATTCC TTTACGAATT GCGACTTCCC CTTCACCACT TGTAGAGAAG 8820 
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GTTACTTTAT CAGCTGGTAA TACAGCTTGC GTTCCATCTT GATAGTGAGC TCGAACCGAC 8880 

AATTTGACAG TTTGGTCTTC TTTGAGACTG TCAGCTTTTT CCACTTGCAA GCTCAAGTGA 8940 

GCAATTTTTG GCGCTTCTTC AAGGAATTGA ATTGCATAGG TTTGAAGAGG GCCACCATCT 9000 

TTAGGCTGAA TAAAGATGCT CGCACGCATG CCGTTTGCTG CGCTTGCTTG AAGAACTGTA 9060 

ACAGCTGCAT TTTTAGCACT TGCTGTGACT TCTGGCAACT TAGCTCCATA AGCAAGAGTG 9120 

CGGTATTGCA TTGGTTTTTG ACTAGTAAGA CCTGTTACTG CCTCACCACC AACCGTTACA 9180 

GTTGGTACTG CAGGTGCCGC AGGATTGCCT TCTTCTACCA CAAGGGTTGC ATGAATTGGT 9240 

TGACCTTCTA AATAACCGGT CGCTTGAATA CGAGAACCTG GAATTGCTAA CTTAGCTTTA 9300 

TCTTCTTCGG CAATCTCCCA CTTGTCCACT TCATACTCTT CAACACTTCC ATCAATCAAA 9360 

ACATAGGAAA CAGATTTGTC TACAGAATTC AAGTCAGTAT TTGGAGCAAT ACGTTTCACA 9420 

ACTGGTAGCT CTGATTTAAG AGCAATCACT TCTACACGAG CTTCTACTTC TCGTCCGTCA 9480 

GCCATACCTT TCACCGTTAC AATACCAGGC TTGCTCACAT CTACTGAAGA CCAGGTTACA 9540 

GGACGTTCTG CACGGCTACC ATCACTGTAT ACAAACGGAA CAGTGGTAGG CATTTCAGGT 9600 

GCCTCTCCAA TAATGGTCTG TACTTTTGGC ACTTCTGTCC CCAAAACAGT CTTCTCTTGT 9660 

CCTTCTTTCT TACCAGTAAA GACAGTGACT TGGTTCGATT TCAAGAGATC AGAGTGGGCA 9720 

GTCAGGGTGA ATTTCCCTGC TTGTTCAGTT GATTTGACAA TGGCAACACC TTTACCATTA 9780 

AATGCTTTAC GAATCCAAGA ACCATCTGCT TGCGCCTTAT AGCGTTCACG GCTGGCTTGT 9840 

TCTCCGTTAT CTACACCGAC CAGTTGACCT TGGCCATGCA ATTGGAAGCG AACCAGATTA 9900 x 

TTAGCAGTTG GAACCACATT CCCCTGGCTG TCAACAATTT CATAGTAGAT GTAAGTCAAG 99 60 

TCTTTTCCAT CTGCTGCAAT CGCATGGTCT TCCTTAATAA GACGAACTGC CGCTGGCTTA 10020 

CCAGCAGTCG TAATCTTATC TCGAGCAATT TCCTTGCCAG ATTCATCACG AGCAATTGCT 10080 

TCCAAGGTAC CTGGTTGATA GGCAACTTTC CATTCAAGAT AAAGTTCATT AGCATTTGCA 10140 

CCTTCTTGGT AAGTCCGCCC ATCGCTGGTT TGTTTTTTAT TGAAAGTCTT AAGACCAAGA 10200 

GATTTTCCAT TCAAGAACAA TTCTACACTA GAAGCATTCG AATAAGCACG AACTGGAATC 10260 

TTACCTTCTG AGTCAGCTAC TTTGGATGCT AATTCTTTGT TTTCCCAGTT CCAGTGAGGA 10320 

AGAAGGTGTA CCATCGGTTT CTTCTTAACA GAAACCCATT GGCTTTGGTA GAGATAGAAG 10380 

TCATGTTTTG GAATGCCGGC TGTATCTACG ATACCAAAGT AAGAGCTCTT AACAGGAGTT 10440 

TGATTTTGGT TGTGCCATGG TGTAGGTTCA CCAATATAGT CCGTACCTGT CCAGATAAAC 10500 

TGTCCAGCAT AGCCAGCGTT GTCACGGTCA AAAGTCCATG AAGCGGTTGC TGTTTTCCCC 10560 

CAACCCACAC GATCATTTCC ATAATCTGAC TGTTCATAAT TACGCTCAGG TCCATTGCTA 10620 
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TGTTTCAATT 
TCTGATCCAT 
GAATAGTTAA 
TTACCGAAAC 
ATAACCTTAA 
ATTTCATTAC 
GTACGTAGGT 
TTTTCAAAGA 
TCTTCCTGAA 
TTGTGGGTTG 
TCTGCTTTAT 
TGGAATTTAA 
CGGTAACCAA 
ATCAATTCGT 
TCTAAAATCG 
ACTAAGCCTG 
TGGTCTTTGT 
TGTTGTTCTT 
TCTGTCACTT 
GGCTGTTTGT 
TTGGTGATAT 
TTGACATAAA 
AGGTCTTTTT 
CCACCTTCAT 
GGTAAATCTA 
TTTGCATTGA 
TGATTCACTT 
GAAGCTTGTG 
TTTGGAGTTA 



CACGTTCAGG 
AAATCAACCA 
ATCCAACAGC 
GGAATTTATC 
CCAAACGTTT 
CAATTGACCA 
CAAAATCAGA 
AACGTCCATA 
CGAGTAAACC 
TACGGATGGA 
AGTTTTCTTC 
TACGTTCACC 
ACAAATCCTT 
ACAAGGCAGG 
CATCTAGGCT 
TTACAGCATG 
CGTCCGTATT 
CAAGTTTTGG 
GTAAAGTCAC 
TGACTGCATG 
CATATGAGAA 
CTTGAGAATC 
CATCTAGTTT 
TTTGTGCAGG 
ATTTTTTCCA 
GTTTAAAGTA 
CTTCATTTGT 
ATTCTATCCT 
CGGCTTCATC 



GCGATAGTAA 
TTTTGGATGC 
ATCGAGTTCA 
TGCTCCCATG 
AACAGTTGCT 
CATGAAGATA 
CCATTTTTCA 
GTCATAAGGT 
TAGTTCTGCT 
GTTAACTCCC 
TGCTCCAAGC 
ATTCAAAGAG 
CTTAGCATCA 
TTTGTCATTT 
TGTTGATTCA 
ACCACCTCGT 
GACGATTTTG 
TGTTAAAATA 
ATCACGATAG 
GACAGCAATC 
CTGGTTATAA 
CATGTAGACG 
GAAAGTCTTG 
AGATTCATGA 
CGTAGATACG 
CCAATTTTGA 
TACAGCTTTA 
TGGAGCTTTT 
TTCTTTCTTC 



937 
CTTCCACGTG 
TTAGCTCTAA 
TCAGCAATTT 
GTAACATAGC 
AAAGAGTGGG 
GCAGGGTTGT 
CCTTTTCGAG 
TTCTTGCCAC 
GCGATTTGCA 
ATCTCCTTCA 
GCCCCATGGT 
AAACCTTCAT 
ACCAATTGAC 
AAAACAGTCC 
TGTGCTTTTA 
TCAACGATTT 
CTGGTCACAT 
GTTGTCCCAT 
ATACCACTTC 
ACATTCTCAC 
CCATTTGGAT 
CCATCAAAAG 
CGATACCAAG 
TCGAAATCGT 
TCTGCATCAG 
TTAAAATCCA 
GCATCTTCCT 
TCTTCCGGTT 
TCAGATGCAA 



TACGGGTAGC 
GGGCTTTGTA 
TCTCATGCCC 
GAGTCTTATC 
CATCACCATT 
TTTTGCCTCT 
CTTCTGGGTG 
CATACCACGT 
AGGTTTGCTC 
TTTGTTTGAG 
CGTGGTGCAA 
TTGGAGTCCA 
CGTCACGGTA 
AGAGTTTTGG 
AGGTACGACT 
GATATTCGGC 
GAGTTTCAAC 
TTTTCTCAAC 
CTGAATACCA 
GACCATCTTT 
AATGCCCCAC 
TAAGGCGAAC 
CTTCCCCACC 
TAAAGATACT 
GTTTAATGGC 
CTTTCCTGTC 
TGAGCGGTTT 
TAGCAGACAC 
TAGCCTCAGT 



TGAAGATGTT 
ATTATCTTCA 
TCCGCTACCA 
AACATCCTTG 
AGCTTCACCT 
TTCGACCATG 
AGTGGCATCT 
ATCAAAGGCC 
ACTAGCAGGG 
ACGGCGATAT 
GGATACTCCA 
GTGATAGTAA 
AACACGCGTA 
TCTTTCAACT 
CGCTGTACGA 
TACAAGTTCA 
CTTGCCATGT 
ATGCACCTTA 
ACGGCTACTT 
TTGAAGGTAT 
TAACTGACCA 
ATTTTTCTTG 
GTTGAGCTGT 
CCAGTCATAC 
TTCCTTAGAA 
TTCAATCATT 
TTCTTGATTT 
TTTTTCCTCT 
TGAACTAGGT 



10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
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TCACTTTGTT CTGTCCTTTC AACTATATTT TTAGTTTCCA AAGCTTTATC AGCCTTTTCT 12420 

TCTACTATCA TTTTTTCCTC TTTAGGTTTC TCAGCAGTAT GAGTAATAAG TGTTTCATCC 12480 

GCATAAACTA CAGATTCTCC AGCTATATTT CCTCCTAATA AAACTGCACA AGTCCCAATC 12540 

ATTACTGAGC AAGCTCCCAC AGCAAACTTA CGAATGCTAT AAACTCTTTT CCGATTCCAA 1260O 

TGGCCTTTCC CCATAAAACC CTCCTTATAT TATATTTAGT GCAGTTAGCT ACTACCAAAG 12660 

CCCAAGTGGT ATACATGGTA TGACAACCTA GTTTCAACAA TTTACACTCT GCGAAAATCC 12720 

AATTCAAACT TCGTCAGTGT CGCCTTGCCG TAGATATGAT TACTGACTTC GTCAGTTTCA 12780 

TCTACAACCT CAAAACCATG TTTTGAGCTG ACTTCGTCAG TTTCATCTAC AACCTCAAAA 12840 

CCATGTTTTG AGCTGACTTC GTCAGTTTCA TCTACAACCT CAAAACCATG TTTTGAGCTG 12900 

ACTTCGTCAG TCTTATCTAC AACCTCAAAA CTGTGTTTTG AGCAACCTGC GGCTAGCTTC 12960 

CTAGTTTGCT CTTTGATTTT CATTGAGTTT ATATTTTATA GGAGCGCATT ATTTTGCTTT 13020 

TGCTGCGTAC TCTTCGTTAC GTTTGATCAT TTGTTTTCTG TACCAAGCAA AGATACCGAT 13080 

ATAGAATACA AGGAAGACTA CTGCACCAAG GATTGCTTTG ATATCACCAG TTGTAGTGTT 13140 

ACCAATTGTC CAACCAAGAA GTTTTTCGAT TGGTCCTTCA AGAGTAGAGT GAGTAATCAA 13200 

TTGAGTTTGG CTCACACCTT CTGGGAAGGC ACCTACACCT TTAGCAAGTT CTGTTGCAAA 13260 

TGGTGCAATA AGTGTACCTG AAAGAAGGAA GAGTGGCAAC AAGAGTGTTC CGAAGATAAT 13320 

CATACGGAGC AATTTACCAC GAGTTACAAC CAAGAGAGCT GGAGTAACAC CCATAGCGAT 13380 

GATACCTGCA AGTGGCAAGA TACCATTTCC AACTTTTGAA AGAAGCACTG CTTCAATCAA 13440 

CATGATTGGT GCAAGTACGT TGGCACAAGC CCAGATTTCA GCACGACCAG CGATGAATGG 13500 

CCAGTCAAGA CCGATATTGA ATTTACGTCC TTGAAGACGT TTAGTAGCAA CGTTTGTAAT 13560 

ACCTTGTGAT AGTGGTTCTA CGGCTGCGAT GAACCATGAA CCGATAAGTG AGAAGAGTTC 13620 

CAAAGATACA CCGGCAGTCA AACCAAGAGA CAACCATCCT TTGATAACAA GACGCCATTT 13 680 

ATCTGCATCT GCAACACCTG CAATTGGATG TGGAGTTCCC AT AAT AC CG A TAACGATACC 13740 

AAGGATGAAA CCGATGAAGA ATTTAGATCC CCAGAAACCG ATTTTCTTGT TCAATTTAGC 13800 

AGCATCAAAG TCATATTTAT CAAGGCCTGG GAAGAATTTT TCAAAAATCT TATCCAAAAC 13860 

CATGATAACT GGGTTCATCA TGTAGTTCAT GTGAGTTGAT GTCATTGGTG ATGAACTTGG 13920 

GGCGTTAAGA AGGTCATCAA ATGTAGGTTT CATCAAGTCA GAGTTGATAA TTTTCAACAC 13980 

ACCGACAAGG ACGATAGCTG CTGTAGCAAT AAAGAGTGAA ACCCCTTGAC TCACACCATT 14040 

GTTATCAGCA TACCATTTAA TCAAGAGACC TGTGATAGAC AAGTGCCAGA TATCAAAGAT 14100 

ATCGACATCA AGTGTATCTG TTTTCTTCAT AGCTAGCATC ACTATGTTGA CAATCAACAT 14160 
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GATGAGCAAG 


AAGTATAGTG 


TCCAAGCAGA 


ACCCCAAGTG 


ATTGTAGCAA 


GTGGTGCCCA 


14220 


ACCAACGTCG 


GTAATACTCA 


ATTGGATACC 


AGTGTTTTCA 


ACGAATTTTG 


CTAGTGATGC 


142B0 


TGAGAAAGCA 


GTGTTTAGCA 


TACCGATGAT 


AGCACCGATA 


CCTGTAAGAG 


CGATGGCAAG 


14340 


TTTGATACCA 


CCTTCAAGCG 


CTTTGGAGAA 


TTTCACTCCA 


AAAAGTAAAG 


CCAATACTGT 


14400 


CAAAATGATT 


AACATGATGA 


CAGGTCCACC 


CATTTCTAAG 


ATGGGATTGA 


AAACCTTTCC 


14460 


GATTAGGTCA 


AAGATTGCAT 


CCATAACAGT 


TCCTCCCTTT 


TTGATGTTAT 


ATGAATGTTA 


14520 


ACAAATTAGA 


ATTAGCTTAA 


TCCGTGTTCT 


TTAATAGCTG 


CTTCAATATT 


GTCAAATACT 


14580 


GGAGCGCTCA 


TTGCTGGGAT 


ACGGAATAAG 


ATTGGCCCAG 


CTTCGATAAC 


TGGGATACCT 


14640 


GGTTCAAAAC 


CAAGGTCTGT 


TGCAGCGATT 


GGTGTAAAGA 


TATCGTAACC 


TTTCATAAGG 


14700 


TCTTCGTTTA 


CATCTTTCAC 


CATGACTGCA 


TCACAGTGAA 


CATCATAACC 


ACGGTTTGAA 


14760 


AGTTCTTCTT 


CTAGAGCACT 


TTTAATTTGG 


TGACTTGAGT 


TAACACCTGC 


ACCGCAGGCA 


14820 


GCAAGAATTT 


TAATCATTTA 


GATTTCCTCC 


GATTTTATTT 


TTTAATAGAC 


AAGATTAAGC 


14880 


GGTTGCTTCA 


GCAATGTAAG 


TATAAAGGGC 


TTCTGGTTCA 


GAAATTTTTG 


ATAGGTCTTC 


.14940 


AAGATGACCA 


TTTCCTGTGA 


AGAAGTCCAT 


TAACTGAGCA 


AGAATGTTCG 


TTTGACTTGA 


15000 


ACTTGAATTA 


TTAATGATAA 


AGAAGAGTAG 


GGATACTTCT 


ACTTCCTTAT 


CAGGAGCTAT 


15060 


CATATTGTGA 


AAAGTTATTG 


GTTTTTCTAA 


TCGAACAACC 


ACCACTTTCT 


CAGCTAGATT 


15120 


ATGAACAATA 


TCTGTGTGAG 


GAATCGCTAC 


ATTTGGCAAG 


TCCTTTCCTA 


GAAATTCCAT 


15180 


ATCTAAACCA 


GTTGGAAATG 


ACTTTTCACG 


CGTGATCAAG 


GCTTCACGAT 


AAGTTGGAGT 


15240 


GACAATTTCT 


CGTTCTTCCA 


ATAAAGTTGC 


AACCTGATCA 


AAGAGTTGTT 


CTTGACTATC 


15300 


CGCTTCTAAG 


CAAAACACAA 


GGTTTTTGTC 


AAAGAAATAA 


TCTAATACCA 


TAAGTTTTTC 


15360 


CGG 












15363 



(2) INFORMATION FOR SEQ ID NO: 140: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28882 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 
TAAGACTATT TAATAGTGGA GTGAAATAGG ATACGAACAA ATTGATTAGG AAAATCAAAT 60 
GAATTTATAG AAATCTTTTA GCAGTTATGT TATCCTATTC TAGTTTCAAA ACGCTATAGA 120 
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AGCAGCATTG TGCTAGTCkA GATTCAGTTT ACTATACTAA AACGAGTAGC TTGAAATCAA 180 

AAAACCCACC CTCACAGGCA GGTTTTATCT GTATTATTCA GCTAGATTAT GCTTTACCTT 240 

CTGAACCGAA TACGTCGATA CGTTCTTCAA CCGATGCTTG GATAGCTTTT ACACCGTCAG . 300 

CCAAGAATTT ACGTGGGTCG AAGAGTTTTT TCTTGTCGTA TTCTGCTTCG TTTGCTTCGT 360 

AGTCACGAGC AAATTTACGA GTTGCGTTAG CGAATGCGAT TTGGCATTCT GTGTTAACGT 420 

TAACTTTGGC AACACCAAGT TTGATAGCTG CTTGGATTTG CTCATCAGGA ATACCTGATC 480 

CACCGTGCAA TACGATTGGG AATCCTGGAA GAGCTTCTGT CAATTTTTGC AAGTGGTCAA 540 

GGTCAAGACC TTCCCAGTTT ACTGGGTAAG GACCGTGGAT GTTACCGATA CCAGCTGCCA 600 

AGAAGTCGAT ACCAGTTTCA ACCATTGCTT TAGCGTCTTC GATTGGAGCC AATTCACCTT 660 

TACCGATGAT TCCATCTTCT TCACCACCGA TAGTACCAAC TTCAGCTTCT ACTGAGATAC 720 

CTTTAGCGTG TGCTTTTTCA ACAACTTCTT TAGCCAATTT AAGGTTTTCT TCAACTGGAA 780 

GGTGTGAACC GTCAAACATG ATTGAAGTAT AACCAACTTC GATACACTCA AGTGCATCTT 840 

CGTAGTGACC GTGGTCAAGG TGGATAGCTA CTGGTACAGT GATACCCATT GATTCAACAA 900 

GGTTAGCGAT CAAGTTGCGA GCAACTTTGT AACCACCCAT GTATTTAGCA GCACCCATTG 960 

AAGTTTGGAT CAAAACTGGA GCTTTTTTAG CTTCTGCTGC GCGCAAGATA GCTTGAGTCC 1020 

ACTCAAGGTT GTTTGTGTTA AATCCACCAA CTGCATAACC GTTGTCACGG GCTGCTTGGA 1080 

CAAATTTTTC TGCTGAAACG ATTGCCATTT TATCAGGCCT CCTGTATATT TTTATGGGTC 1140 

ATCCCATTTA CATTGTTCAT TTTATCACTT TTTGCCAAAA AAATCTAGTT TTTCCCGCAG 1200 

TTTCGATTGA TTTTCTTCTA ACTCCATCTA TGTAAACCCT TTCTCTCCCT AGTCTTGGAC 1260 

GACTTTTGGA AAATCTATAA AGAAGGTTAA ACTATTCTCC TCCATCTCGA AACGATAAGC 1320 

T AATTTTT C A TGTTCTAATA GACTCTTAAC CACAAAGAGC CCCATACCAG ACCCCTTGAC 1380 

CTTGCGACTG GCATTGTCAG AAAAAGACTG GGCTAGTTTT TCTTGTTCCT CTGAGCTACA 1440 

GCTATTTTCG ATAAAAAGTT CTCCTTCTCT TTCTCCAATT CGAACTAAGC CACCTGGAAC 1500 

AGAGTGCTTA ATGGCATTGC TGATGAGATT AGAAAGAATC AACTTCATAA CTGATGGGTT 1560 

TAGATAAGCC TGCTGATGGG TCAAACTATT GTCTATCTGG AGCTCTCTTT CCTTGGCTAG 1620 

CAAGGCATAA TCTTTGACCA GATTTTGCGT CATCTGGAGG AGGTCAATTG TTTCCCTATC 1680 

ATCTCGCAAT TCCTGCACAG AAGAGAGGGA AAGTATCTGC AGAACATGGT GATTGAGTTC 1740 

ATCCACAATC CCCAAGGCAA CTCCCAGATA CTGGTCTCTA TCCTTATAAC GACCGATATT 1800 

CTCTCTCATA TTTTCGATTA GGATTTTCAA ACTAGCCAGC GGTGTTTTCA ATTCATGAGA 1860 

AGCTCCTCGT AGGAATTCGA CCTTCATCTT CTCCAGCTGG AGAATGGCTT CATTCTTTTC 1920 
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ATGCAAGTCC GCAATAACAG TCAAGAGATG CTGGTAGAGG CTATTGATTT GTTCCTTGAG 1980 

ATTACCTATC TCATCCTTAG AATCCACGCG CAATCGCACT TGGGAATCCA GGTCCATCAT 2040 

CCGACGGGTC ACCCGCTTGA TTTCCAAAAT CGGTGCAACA ATAGTCCGAG CGTAGATGTA 2100 

GGCCACCAAA AGGGAAATCA GAAAGGAGGC CAGCAAGGTA TAGGGAAGAA ACTGGAGACT 2160 

GATTTGCTCC GCTTCCTTTT GTAAATCCAT GGAAGCTAGA AACTGGAGAA TCATAGTACC 2220 

ACCGTCTTGC GTTTTCACCT CGCGCTCCTC AATAAAGAGA GAGGTTGTCT GGCGGTCTGT 2280 

GTCCAGAGGA AGACTGTCCT TGACTTCTAA CTTGTCCTCG GTCATCTCAC CTTTGACGGT 2340 

CCCCTTGATA TCACTAGTCT GGGAATACAA GTCTAACACT TGCTCGATAC TCTGCCTATC 2400 

TTTCCCTTCT AGGGACTGGG CAATGGCTGT TGCCTTTTGA CCAATGGTTT CCTGACGATG 2460 

ACTCAGATAA GTCGAAGGAA AAAGAAAATA AATAGCTAAA TGAAGGCAGA TAACCAGAAC 2520 

ACTAAATATC GAGAAGGTAT AGATAAATAT CTTTGCAAAT AAACCTGTTC GTTTCATTTT 2580 

CGCTCCAATT TATAACCAAC ATTGCGCACA GTGAGGATAC AATCCAAGTC TAGCTTTTTC 2640 

CGCAATTCCT TGATATAAAC ATCAATAACA CGGTCAAAGG GAACCTCATC TGTCGCTTTC 2700 

CAGACGGCAT CGATAATCTG AGATCGAGTC AAGGCCCGGC CTTCATTTTT CACTAGATAG 2760 

TCCAGAATTT CCAACTCTTT GGCATTGATA GGCACTTCTT GACCTGCGAG GCTTGCACTG 2820 

TAGCTTTCAA AGTCCACCTT GGTATCCTTG TAAGAAAAGA TTCGTCCTGT ATCGTAGTAG 2880 

CGCTTGAAAA TCGCGTCCAC CCTCACTTTT AAAAGGGAGA GGGAGAAAGG TTTTTCCAGA 2940 

TAGCCATCTG CCAAAGAGGC AAAGGCACTC ATCTTGTATT CCTCATCTTG AAAAGCTGTC 3000 

AACATCAAGA CAGGAACCTG ACTGGTTTTA CGAATCTCAG CTAGGACTTC TAAGCCGTTG 3060 

AGCTTGGGCA TCTGGATATC CAGTAAAACC AGGGCCACCT CATAGCTAGA AAATTGCTCC 3120 

AGAGCTTCCT GACCGTCCGC TGCCTCAATA GTTTCATAGC CACAATCCGT CAAATAATCA 3180 

CTGACCCCCT CACGGATCAT CTCTTCATCT TCTACAATTA AAATTTTCAT ACTTTAACTG 3240 

CTCTCTATTT TTTATTTTTC TTAGAATAAA TACCTACCCT ATTTTCTATT ATAGTCTCTT 3300 

GCTGGCCTTT TGTCTGCAAG CAACTGACCA CTAGATAAAA CGTTGTGAAA TTCCTTTCTC 3360 

ATAAATTCCA TAACTTTAGT ATATTATATT TAAGCACTAA AGTACAAAGA AAGCAACTGA 3420 

AAGCAATGAT TTTCACCACT GCTTTCGGAT TTATTTTGAA TTGTTAAATA GCCATTCCTA 3480 

TCCACTATTC TTGAATAGAA ACACAAGATG CAATCTTTAT TCTAGACTCA TTTTTTCAAA 3540 

TTTATTCACC ATCCAGCAAG AGCTCTTTTG GTTGTTTTCT AAGGAGATTG CTTGAAGCAA 3600 

GCGCCATAAC GAGAACCACT AGAACCAAGG CAAGGACAAA AATGATGATA AAGTCTGATG 3660 
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TCTGAATGGA AATGTCTAGG 


CTCGACAAGG 


TCTTGCTAAA 


GCCATCTACT 


TCTGCACCAC 


3720 


CACCAAGGTT AGAGGCTTGA 


GCCGCCTTAC TAGCCTGTTT 


GGCAACACCT 


GAAGTCACAT 


3780 


TGGCAAGGAC 


AGTGTTTCCA 


ATTGCACGGG 


CAGTGTAATT 


AGCTAGGAAG 


TAAGCAGAAA 


3840 


CTAGAGCAGG 


GATAGCAATC 


AAGATAGATT 


CGGTGATGAA 


TTGACCCAAG 


ATACTTGCCT 


3900 


GCTTGAGGCC 


GATAGAGAGG 


AGAATTCCCA 


CTTCCTTGCG 


ACGGGCGTTG 


ATCCAAAGGC 


3960 


TGAGCAAGAG 


GGCAAGGAGG 


AGAACTGAGA AGCTCAAGCT 


ACCCCAGAAG 


AGGAGGTTGG 


4020 


CCATCTTGTA 


CATACCAGAG 


ATAGATTGCT 


CAAGAGCTGG 


GTAGTTAGAG 


GAGCTCTTGA 


4080 


CGAGTGTGTA 


GCTCTTCCAG 


TTGATACCAC 


TGATGCCATT 


CAACTCTTTC 


ATAACATCAT 


4140 


CCAAGTTCTT 


GTCTGCTGTT 


ACAAAGAAGG 


TTGCGTCCCC 


ATAAATGGCT 


GTGTCTTCTG 


4200 


TGTATCCATA 


AAGTTTTGCA 


GCAGTGTGAA 


TGTCTGTAAT 


AGCTGTGTTT 


TCGTAAAGTT 


4260 


CTTGTGAGTA 


GGTTACTGCT 


GACTTATTAT 


GACCATCAAA 


GAGTCCCTTG 


ATTGTCACTT 


4320 


CAACTGTTTC 


CTTGGCTCCT 


TTTTCATTAT 


CTGCATCGTA 


GATATTAGAG 


TCCAGTTTAA 


4380 


CCTTGTCCCC 


TACTTTCCAG 


CCGTGTTTGG 


CTGCCAAGTC 


CTTGTGCAAG 


AGGATTTTAT 


4440 


CCTTGTCGTC 


GTTGGTTAAG 


TGCTCTCCTT 


CGACTAGTTT 


ATAAGAACCA 


GAGACAAACT 


4500 


TGTCTTCTTT AGAGGAGTCA 


TTGACACCTG 


TAATCATCAA 


GCTACTTCCA 


AAACGCTTGG 


4560 


CACGATCAGC 


AGTGAGATTC 


TTCTTGGTTT 


CTGGCGTTTC 


AATCAGGTCA 


TATCCAGTCA 


4620 


AATCTCCGAT 


AGCGTTGATA 


CGTTTGACAT 


AAGACTCAAT 


GGCCTTGTTT 


TCGGTGATTT 


4680 


TTTTGATGTC 


TTCACCCTTG 


ATATTCCCAG 


CACCACGAGG 


CGTTCCTTGG 


TTGACGCGAC 


4740 


GATTGATTTG 


CATGGAGAAG 


CTATTGGTGA 


TATTTTTAAA 


GGTCTCCTGA 


GAAGCCTTGG 


4800 


CAGTAGCTCC 


CTTGATTGAC 


AAGCCGACCA 


AACTCAAGCT 


CGCCATGAGG 


AGAATAATCA 


4860 


GGAAGATGAC 


AATCGATTTG 


AAAAACTTCC 


TTGTAACATA 


GGCAAATGCG 


TTGTGTAACA 


4920 


TAGATTCCCT 


TTCTAGATTT 


TGTTTTAATC 


ATTCTATTAA 




AATT ATT T AC 




TAGTATTGCG 


CGTTTCAGTC 


AGTTTCTTAT 


CCTTTAATTC 


AAGTGTAATA 


TCTGACGCTT 


5040 


GTGCCACTTC 


TTTACTGTGA 


GTTACGACAA 


TCACACATTT 


ACCTGTTTTC 


TGGGCAAGTG 


5100 


ATTTGAGTAG 


TTCGACAATA 


TCTCCAGCAG 


TTTTAGGATC 


CAGATTTCCT 


GTTGGCTCAT 


5160 


CAGCTAGAAT 


AACTGGAGCT 


TCTGAGACCA AACTGCGAGC 


AATGGCAACA 


CGTTGCTGTT 


5220 


GACCACCTGA 


TAACTGGAGA 


ACATTCCGCT 


TGATCTGGCT 


TTCATCCAAA 


CCAAGCTCAA 


5280 


GAAGTGTATT 


CTTGCTTGCC 


TTTTTGTTGA 


CCAATCGGAT 


ATTTTCCAGC 


GGAGAAAGAT 


5340 


AATCTATCAA 


GTTATAATTT 


TGAAAGACCA 


GGGAAATATG 


GTGCATGCGA 


TGGTAAGAAT 


5400 


AGCCCTTCTT 


ACGAATATCC 


TCTCCTTGAA 


AAAGGATAGA 


ACCTTCAACA 


GGACTATCTA 


5460 
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GACCAGCAAG TAGGGACAAG AGTGTGGATT TTCCTGCTCC TGACTCCCCA ATAATACTGT 5520 

AAAATTTTCC GGGTTCAAAA TTATAATTGA TCTGATATAG GACTGCTTCA GCAGTATTCT 5580 

TATAACGGTA GGTAACATCT TGTAATTGTA ATAAAGTCAT GATTTCTCCT TCTTAACTAA 5640 

TAGATGATAA AATTTCTTTC GGTGATTTTC TAAATAAGAA TAGGAAACAA AGGGCTACAG 5700 

ATAAGCAACT AAGCAGAACT AGAAAAACAT AGGATTCTGC AAAAGATAAG ATGCTAGTTG 5760 

ATAAACTGCT TGCTTTGGCT AGTGTATCTT GTAAGCTTGC CTGATCTCCA CTTGCTAGTA 582 0 

GAGTTTGGAG TAGGTAAGTT GTGATTGCGT TTCCTGCAAC AAATGCTGGA AGCAAAGCTC 5880 

CAAGAGATAC CAAAACTACC TCTAAACAGA ATTGTAGGAA GATCGAGCTC TTGCCTTTTC 5940 

CAAGTGCAAG TAAAATCCCC ACTTCATAGA CCCGTTCTCT CAACCAGAGA GACAAAACCA 6000 

GAATTAAGGC TCCAGCTCCT GCTATCAACA TCCCATAAAG GAAGATGGTC AGGAAGGTTT 6060 

GGAAAGTTGC AACTGAGTCT TTGATTTGTT CAAAAGCCTT GTTTTCCTTT TCGACTTGGT 6120 

AGCCTTGATT TTCCAAGGCC AAGTTTTCTA CCTGCTTCAT GAGTCCGTCC ATTTCCTTAG 6180 

GATTTTCTAC ATAGAAGCGT GCTGCACTGA CTTGAGCTTC ACTATTGCCC AAAAGGGTTT 6240 

GGCTACTTTC ATAGTCTGTA AAGACTTGAT TTTCACTGAA GTCAGAAGAC AAGCCTGTGA 6300 

ATTTCTCTTG TTTTTTACCA GAAAAGATGC CGATAATCTC AAACTCTACT GTTTGTCCTT 6360 

TTCCAGATTC AGACTGACCA GCATCCAAGC CAATCTTGTC ATGAAGCGAA AGACCGTTCT 6420 

TCTTAGCCAA TTCTTCGTGG ATAAGGATTT TCTTGGAATC CCCTTTTTGA AGGTGTCGCC 6480 

CTTCTTTTAG ATTGAAAGCC GAACTGGTAA AGGTTACATC CTTGGATGAA TCCTCAAGAG 6540 

CCGTTAAGCT AACCAAGTTA TTGTCTGCAG CTGATAAATC ATCACGCTCC ACGCTCTGCT 6600 

CGCCAGTCAC TGCTTCCTTG TCTTTTAGTT TTGCGACCGT CTCAAGTTCA GGAGAGACAT 6660 

TTTCCAGCCC CTTAATCTTG CTTACAGATG CTAGGTCTGA CAACTTGAAT GTCTGACCAT 6720 

TCTCTATCTT CTTAATAGAA AAAGATGTAT TGAGTGATTT ATAAAGATTG CTTTCTACTG 6780 

TTTTGTTGGA CTTCATCAGA GTCAAACAGG CTGAAATTCC GGCCAATAAG ACCAATAAAA 6840 

TCAGAAATAA AATAAAACTT CTCAGTCGCT TTCTGCTGAC ATAAGCCCAA GATCTTTGGA 6900 

TTGGATTCAT TTGTCACCTC CATATTTGTA AGACTATTAT AAAACCCAAA TATGAAATAT 6960 

TTATGAAATA CGAAAAAAAA ATATCGAGTA GGGGATAATC TCTAGCCCCT CTCACACCAC 7020 

CATACGTGCC GTTCGGCATA CGGCGGTTCA ACTAACTTTT AACGCATGTC GTTCAAGGTA 7080 

ATAATCCAAA CACGAAACCA GTCCACGTTT TTCAAGGACT GGTTTTGATA TAGCACGTTT 7140 

AAGTACCGAC TTCTGAGCTA CTATAGTAGA TTGAAACTAG AATAGTACAC CTCTACTTCT 7200 
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AAAATATTGT TAGAAATCGA TTTGACTGTC CTGAACAATT CGTCCTATTC TTATTTCATT 7260 

TTACTATAAT TGATAGTGGT CGCCCCAGCC AGATACCTTA TCTGCTATCC ATTTAGGAAC 7320 

CCCTAACTTA AGCAATCCCC ATAATCGTCT CGATTTCTTC TTCCATTGCT TCCAGATAAT 7380 

CACTCGTAGG CGAGTACGCA AGCGCTCATC TATGCTAGTG ACTATACTTT TCATATTTAT 7440 

AATTCATTCC TTTCGTTTCA CTCAAGGCAC AACACAGAAT GAAAAAGTGT TGTGATCTTT 7500 

ATTTTGTTTT ATAATAATAG TGAGAAAACC TATCACTACT ACAAATCACG GGGAGGTGAA 7560 

TAAGTGAGTG GTACAGCCAC TACCTCGCAT ATTTTGTCAC ATCATTTAAC GGTACATAAT 7620 

AAGTTGTACC ATCTGAATAA GTTGCTACAA TATCATTTGC ATGCTCTCCT TCACCTTTAG 7680 

CAAAGGTTGG AGCTCCTGCT GGATGATTTT TATTTGCCTC TTTCAATTTT TCAATAATGG 7740 

CATTTTTTCT GTATCTTTTA TATTATCAGG ATTTTTCACT AAGATTTTGT CTGGATATGT 7800 

CGGTTTAGCA GAAACAATTT TTACTGTTAC TTCTTTTTTA TTCGAAGCAC TTGTCCAGTT 7860 

TCCAGCATTA TCTTTAGCAT TTAATTTTAC AGTAATTCCT GAACTAGGAA CTTCAGTAGC 7920 

AGGTTGATTA TCAACATTAT TCAACTTTAA TTTCAAAAGA GCTGTTGCAT CAGACGTTTT 7980 

ATCAATCGTT ATATATAATG ATGAATTGTT ATTATAAACA GTTCCTTCAT ATTTAGCTGT 8040 

TTGTGAGCTA CTTGAAACAG AACTGAAATT ATACCCACTA CCTCCCTGAT TATCTTCAAT 8100 

GCTTACGTCT AAATGAACTT CCCCACTATT ATTTGGCTTA GCAACAACTG TTATAGTAAA 8160 

ATAACATAAA ATTTGCATAA ATAGATTAGG GAAATCAAAG CAGCTTCTAG GAATGTTTTA 8220 

GCAGTCACAG TGTACTTTCC CAGCATCAAG CCACTATAAC TCTGCACATA AAAATGGAGA 8280 

AGATGGCAAT CCTCTTCTCC AAATATTAAC TTCTTTACAA ACCAACTATA GTTGACAAAG 8340 

AACCTAAAAT CAATTGATAA CACAAGGTCA GGTCGGTCAA CTCTTTCAAC TGAAGCCCTG 8400 

TCAACTCTTC CCATTTATCA ATCTTGTATT GGAGAGAATT GCGGTGCAGA TAGAGTTGCT 8460 

GGGCTGTTTT AGTGAGAACA GCACTATTTT CCCAAAGAGA GAGAATGATT TCCTGAATCT 8520 

GATCTTGATC CAAAATCATC TGGTGTAGAC ATTCCTTGAT TGGCTTCAAG TCCACGAGTC 8580 

TTTCTCCCAT ACTCCAAAGA TAGAGCTGAG AAAAAGTATG AACACCTTGG TGACCCTGAC 8640 

GCCACCATGT CTTGAACAAA TCCCGCTCAG CTTTGATTAA GTCTGATAGG GCTTGATGTC 8700 

CCGTCTGAGA CCAAACCTGA CCCAACATGA TAGAAAGACG AAGTCCAAAG TCATACTCAA 8760 

CCGCTTCAAT CGTATCACTT AAAATATCTC TTACAGAAGT GTATTTGTCT TGTTGAAGCA 8820 

CGAAAACATA ATCCTGAGCT CCGACCTGTA GCACTGTCTG ACAATTCGGA AAAAGAGTCC 8880 

GCATCATATC TAGCCAAGAA GCCAGATTTT CCTGCTGAAA ATAAGAAAGA TGGCAATAAA 8940 

CCAACTGAAT CTTTTTAAAA ACTTGCGGTG CCTGTCCCTT GCCCTCAACC AGATAGGAAT 9000 
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ACCAAGGGTT TAGCGAACGA GCCTGCTCCT GCTGGGTCAA AAGGGCAACC AACTGCTTTT 9060 

CACGCTCGCT GAGCCCAGCT TCCTCCAGCA AAATCCACTG CTGAGAAGCT AAAGGGAGCG 9120 

TGAGATAGCC CTCTTTCTCT ACTGGTTGGT CTGAAATCCG AGCCTCAGGA AACCAGTCTT 9160 

GTAGTTCTTT TGCCCTCATG TTCTAGCCCT CCACTTTTTG GATGCACCAT GAAACCAAAC 9240 

TCTCAAGACG TTCCAGATTC TCAGTCATAT GGAGATAGCC CATAACCGCT TCAAATCCCG 9300 

TGGACATACG ATAAGTCACG ACATCTGCAT TTTTAGCCTT TGTGTGGCTA TTGGTATTGC 9360 

GGCCACGTTT GTAGATTTCT TCTTCTTTTT CCGTTAGGAC CTGCTCCTCC AACATGAGAG 9420 

CAATCAGGCG AGCCTGAGCC TTGGCTGACA CGTACTTAGT TGCTTCTTGA TGGAGTTTAT 9480 

TGGGTTTGGT CATACCTTTG AGGATGAGGT GACGGCGAAT ATACATAGAA TACACCGCAT 9540 

CCCCCTCAAA GGCTAGCGCA ATCCCGTTAA TGAGATTGAC ATCAATCACG TGTCCACCTC 9600 

ACTCCATCCT TGGTATCAAG GAGCTTAATT CCTTGAGTAA CCAATTGGTC ACGGATTTGG 9660 

TCTGCTGTCG CAAAGTCACG ATTGGCACGC GCCTCTTGGC GTTTTTGAAT CAAGTCTTCA 9720 

ATCTCTGCAT CCAAAACTTC CTCAACAAAG ACAATTCCAA AAATTTCTAA CATATCTGCA 9780 

AGAGCTTGCT TGACACTTGC ATCATAGTTC CCTGAGTTGA TCCATTTGGC CATTTCAAAG 9840 

ACAACTGTGA TACCGTTGGC AGCATTAAAA TCTTCATCCA TAGCTGCTAC AAACTTATCT 9900 

TTAAAGTTTT GTAACTCTTG GGCATCCACA TTTCCTGTAA ATGGTTGTTC GTAAGTATTC 9960 

TTGAGATACT TGAGATTGGT CTCGGCATCG CGAACTGCCT TTTCCGTGAA GTTGATAGGC 10020 

TTACGGTAGT GCTGGGTCGC AAAGAAGAAA CGAAGTACTT GCCCATCAAG AGTTTTAAGG 10080 

GCATCGTGTA CCGTAATGAA GTTACCCAAG GACTTAGACA TTTTGACATT GTCGATATTG 10140 

ACAAAGCCAT TGTGCATCCA GTAGTTAGCA AAAGCCTTGC CTGTTTTAGC TTCAGACTGG 10200 

GCAATTTCAT TGGTGTGGTG TGGAAACTCT AGGTCAGCTC CACCACCGTG GATATCAATG 10260 

GTATCACCTA AAATCTCTGT CGACATGACT GAACACTCAA TATGCCAACC CGGACGTCCA 1032O 

GGTCCCCAAG GACTATCCCA AGAAATCTCA CCTGGTTTGG AAGATTTCCA TAGAGCAAAG 10380 

TCTACAGGAT TTTCCTTACG AGCCGTTTCT TCATCGGTAC GACCTGAAGC ACCTAGCTCC 10440 

AAATCTTCCA AGGTTTTATT AGCCAATTTA GCATAGTTGT GGGATTTTTC TACACGGAAA 10500 

TAGACATCCC CTTGACTCTC ATAGGCAAAG CCTTTCTCGA TCAAGTCTTC CACAAAACGG 10560 

ATGATGTCTG CCATAAACTC CACTACACGC GGATGGCGAG TCGCAGGTTT CACGCCCAAT 10620 

GCCGTCACAT CCTCACGAAA GGCAGCGATG TACTTATCCG CAACCTCCTG AGGCGTGATA 10680 

CCTTCTTCCC TGGCACGGTT GATAATCTTA TCATCCACAT CTGTAAAATT GGAAATATAG 10740 
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GCAACCTTAT ACCCACGGTA CTCAAAATAG CGACGAATCG TATCAAAAGC TACCGTCGAA 10800 

CGGGCGTTTC CTACGTGGAT ATAGTTGTAC ACCGTTGGCC CACAAACATA CATCTTGATC 10860 

TTGCCGTCCT CAATCGGGAC AAATTCTCGC AAATCACGAG ACATGGTGTC ATAGATTTTA 10920 

ATCATAAATC ATAATCAGGA AAGCTGAAAT CCAAGAACAA TTAGTTTCAT CACTAAAAGT 10980 

TCAAGTAAAT TTCAGTCCGA ATATCTCTAC ACTTCGGAAT CCCTTGCTCC TTTCTCATTC 11040 

AGATAAACCA CCTGAGTCTG TTTGACAAAG CCAATTTTTT CATACAAACG TTTGGCACCT 11100 

ACATTGCTAT CTTCCACTGC AATCTGAAAT TCCTTGTCAT TTTGCTCAAT TAGTTGGTTG 11160 

ACGAGGGATT TTGCTAAGTA GCTTCCATAG CCTTTTCCAC GTTCAGGTTC CAATATTGCT 11220 

AAACCGTAGA GGTAATTCGT ATTAGTCGAT AAATCAACCG TACAAGTTCC AATAACCTGA 11280 

CCAGCTTTTA ATAAAATATA TAGTCGGCTT TCTGGATCTT TCAGAGCTTC AGCGACATAT 11340 

CTATCCACAA CTTCTCTCGA TTCATGTTCC TCTGAAAATG CCTGAAATTT TAATTGACTA 11400 

ATTTGATCCT GATACGAACT ATCTGCTAAC AAAACTTCAA GATGGGAAAC ATTTGCTAAC 11460 

GGATAAGGTC TTCTATCCTT ACCTAACCAA GTTTCTGTCT CTTCATCCTC GATTAGTCCC 11520 

CAGTTACTGG CAAAGTCAGG ATGATTCTCT AAAAAAATAC GTTCTGTCTG AAAAGTGACT 11580 

GACCGAATGG GGAAAGAAGC TGTTTCTCTC TCAAAACTAG TAAACAATGC ACGCGCAATC 11640 

CCCTGACGGC GATGACCTGG ATGAACCAGT ATCGTCACTT CTACATCTTG GTCATCTGCA 11700 

TAGACAGTTA ATAAACCAAC AAGTTCGCCT TTTTCATAAT AAAGGAAAAA GGCGGGCATG 11760 

TTTGGGTCAA AATTAAGCAT GTTAGAGAGA TAGGGATCGC GATAGGTACC GTCATAGTTT 11820 

TGGCAACAGT TAATTACTTT TTTCGCCTCA GATAGCTCCT CTTGGCTTAA CTTGTTTCTT 11880 

GCTTGAATCA TATAGGTATC CTCTACAAAC CAGACGATCT GTGACTGGCA TCTTTAGCCT 11940 

GCTCGAGTTT ATTGACATAA TACTCTCGTT TTTCTTCGAC TTCGTGAATG ACAGGCTCAT 12000 

CTTTCTTACC ATGAAGACGG ACAATCTTGG CCGGAATACC GACAACCGTC ACGTCACTAG 12060 

GTACATCTGC TACGACAACT GCTGCAGCAC CGACCTTGGC ATTTTCACCA ATTTCCACAG 12120 

GCCCGATAAC TTGGGCATGG GCTGATATGA GGGCTCCCTT TCGTACAGTC GGATGGCGTT 12180 

TGCCACAGTC TTTCCCTGTT CCCCCGAGAG TCACTCCGTG ATAGAGAAGA ACGCCTTTTT 12240 

CAACAATCGC TGTCTCTCCA ATCACCAGAC CAGAACCATG GTCAATAAAA ACACCTGAAT 12300 

CAATCTGGGC TCCTGGATGA ATCTCAATCT GAGTCCAAAA GCGCCAAAAC TGACTGTACA 12360 

TACGAGCTAA TAGTTTGAAG CCGTGCTTCC AGAGAAAATG CGAGAGACGG TGGGCCGCCA 12420 

AGGCCTTGAC ACCTGGATAA GTCAGCAAAA CCTCCAAAGT GGTGCGGGCC GCTGGATCAT 12480 

TTTCTTTTAC AATATCAATG GTTTCGCGCC ACCACCCCAT ACATTTCTCC TTTTCTTATT 12540 
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CTGAATCTTT TGATGTTTCT GTAAATTCTT TCTTAGGTTT GTAATCCTTT TGATGACGTG 12600 

GGCGGTGAGG GCGCTCAGAC TTTTCACCTT TTTCATCATG CTCAGGTTTT GGCGGACGAG 12660 

GTAGAAGAGC CTTCATAGAG GCATCGATAC GGCCTTTTTC ATCAATTTTG ATAACCTTAA 12720 

CATCAACTTC ATCCCCGATT TCTACCAAAT CCTCTACACG ATTGGTACGA GTCCAAGCCA 12780 

TCTCAGAGAT ATGAACAAGG GCATCTGTCT TATCAAAGAG GTTAACAAAG GCACCAAATT 12840 

TCTCGATACG AACGACTTTA GCACGGTAAA CTTCATCCAC TTTGGCTTCA CGAACCAAAC 12900 

CAGCAATAAT TTCTTTGGCA CGGTTAATAG CATCTTGGTC ACTAGAGTAG ATAGACACAT 12960 

TTCCTTCTTC GTCTATATCA ATCTTAACAC CTGTTTCAGC GATAATCTTG TCGATGGTTT 13020 

CTCCACCCTT ACCGATGACA ATCTTAATCT TGTCCACATC AATCTTGATC GTATCAATTT 13080 

TCGGAGCAGT TGGAGCCAAT TCTGGACGAA CTTCTGGAAT GGTTGCTTCA ATGACATCAA 13140 

GGATTTCAAA ACGCGCTTTC TTGGCTTGAG CAAGAGCCTC CGTCAAGATT TCTGCAGTAA 13200 

TCCCTTGAAT CTTGATATCC ATTTGAAGGG CTGTAATCCC ATCACGAGTA CCTGCAACCT 13260 

TGAAGTCCAT ATCTCCAAAG TGATCTTCCA AACCTTGGAT ATCTGTCAAT ACTGTGTAGT 13320 

TATTTCCATC TGAGATAAGC CCCATAGCAA TACCAGCTAC TGGCGCCTTG ATTGGCACAC 13380 

CACCAGCCAT AAGGGCAAGA GTTCCCGCAC AGATAGAAGC TTGAGATGAA GAACCGTTTG 13440 

ATTCCAAAAC TTCTGCTACT AGACGGATAG CGTAGGGGAA TTCTTCCAAG CTTGGCAAGA 13500 

CTTGAGCAAG AGCACGCTCA CCAAGGGCAC CGTGACCGAT TTCACGACGA CCTGGCGCAC 13560 

CGTAACGACC TGTTTCCCCT ACAGAATATT GAGGGAAGTT ATAGTGGTGC ATAAAGCGTT 13620 

TCTTGTACTC TGGATCCAAA CCATCAATGA TTTGAGTTTC TCCCATCGGA GCCAAGGTCA 13680 

AGACTGAAAG AGCTTGAGTT TGCCCACGAG TAAAGAGACC TGAACCATGT ACACGAGGAA 13740 

GGAAGTCAAC AACCGCATCC AAAGGACGGA TTTCATCGAC CTTACGACCA TCAGGACGCA 13800 

CCTTGTCTTC TGTAATTAAA CGTCGCACTT CTGCGTGTTC CATTTGTTCC AAGATTTCAG 13860 

CCACATCACG CATAATACGG TCAAATTCTT CGTGGTCCGC ATATTTTTCT TCGTAAACGG 13920 

CAGTCACTTG GTCTTTCACT ACTTGAGTCG CAGCTTCACG GGCCAATTTC TCTTCTACTT 13980 

GAACTGCCTT TTGGAGGTCA CTGTTGTAGG CTGCAATGAT TTCAGCTTGC AATTCAGCAT 14040 

CCACGTGAAG CAATTCCACT TCTGCTTTTT CTTTACCGAC AGCAGCAACG ATTTCTTCTT 14100 

GGAAGGCAAT CAATTCTTTG ACAGCTTCGT GCCCTTTAAG GAGCGCTTCC AACATGATTT 14160 

CTTCTGACAA TTCTTTGGCA CCAGACTCTA CCATGTTGAT AGCGTGCTTG GTTCCAGCTA 14220 

CTGTCAATTC AAGAAGAGAT TGCTCTGCTT GTTCTTGACT TGGGTTGATG ATGATTTGGC 14280 
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CATCTACATA TCCCACTTGT ACCCCAGCAA TTGGTCCGTC AAATGGAATA TCTGAAATAG 14340 

ACAGTGCCAA AGATGAACCA AACATAGCAG CCATTGGTGC AGATGCATTT TCATCATAAG 14400 

AAAGCACTGT ATTGATGACT TGGACTTCAT TACGGAAACC TTCCGCAAAC ATAGGACGAA 14460 

TCGGACGGTC AATCAAACGC GCTGTCAAGG TCGCATCTGT TGAAGGACGT CCTTCACGTT 14520 

TCATAAAGCC ACCAGGAAAC TTCCCAGCCG CATACATTTT TTCTTCGTAG TTGACTTGGA 14580 

GTGGGAAGAA ATCCCCAGTT GCCATTTTCT TAGACATAAC GGCAGCAGTC AAGACAGTTG 14640 

ACTCACCGTA ACGTACGACA ACAGATCCAT TTGCTTGCTT AGCAACCTGA CCAGTCTCTA 14700 

CAATTAACTC ACGACCCGCA AAAGTCGTTT GAAACACTTG TTTTGCCATT TTAATCCCCT 14760 

TTGGATTGAT GAAATTATAC GCCTTGCCTA CAAAGATCAA GATACCAAGG ACGTCAAAAG 14820 

CAAAGTAAAA ATAGGAAACT GACGAAGTCT TCGATGAAGA CAAGACAGTT TATCTTTTTT 14880 

ACACAGCTTT TCGGCCGTGT TCAATTACAC AAGATATTTT GGACGGTTCG GCTTGCCGAA 14940 

CATTTCTGTA GAAAAATAGG AAGGTGACGT CGCACTCGAC GAGTGCTAGG AAGCTTATCT 15000 

TTTTTCCTAA GAAATGAGAC CAAAATTCAA GTCATCAAGA TACCAAGCCG TCAAGCAACT 15060 

CAAAGGAAGA TAGGAAATCG AACGACGGAG CGACTACTCC TAGGGAGATT TATCTTTTTC 15120 

CACAGAGTTG TAGGCAAGTT CAGTTTTCAA GATACATCAT TAGAAAGGTT TAATACTAAA 15180 

GTATCTAAAG CTTTCACGCT AATCGCTATC GGGCGATTAG CTAAATGCTT TACTAACTCT 15240 

CTCGTCAAAT AACATCGATT TGACTCACTC GTGTCGTTAA ATCTTACAGT TTAAATGCAT 15300 

TGTATTATTT AATACCTTCA TCTTTGTATC AAGTACGTAC AGAATTTATT TTATCATATT 15360 

TTTCTTAAAA AGTGAGGTCT TTACCATTAA AAAGGAACCA TTCCCCTCAC CTGAGAAGAA 15420 

TGGTTTGCTT TTATTATCCT AGAGACTGGT GATTAAACAA GGCATGGGTT GCTTGATGGA 15480 

TGTATTTTGC TGTATCAGCA TTATTCATCG TATAGAGATG CACACCGGCA ACATCCTGAG 15540 

TTACCAAGTC CACGATTTGG TCCACTGCAT AGGCAAGTCC TGCTGCTCTG AGCGACTCAG 15600 

GGTCATGCTC ATACTTGTCT AAGATGGCTT TAAATTTGCG TGGAAGATGG ATATTCTCAC 15660 

AAGTCTTCAA GAGTCGGAGA GCCTGATTTC GATTCAGAAT TGGCATAATT CCTGCATGAA 15720 

TGGGAACATC AATCCCAGCC AAGATACACT TGTCCTGAAA ATCATAGAAG CGCTCATTGT 15780 

CAAAGAAGAG CTGAGTTACG AGGCTCGAAC AGCCTGCATC CACTTTCTTC TTAAGATTTT 15840 

GAATATCTGA AATCTGATTT GGCGAATCTG GATGCCCTTC TGGATAGCAA GCTCCAATAA 15900 

TATCAAAGTG AGGGGTTTGT TCCTTGATAA ACTCAATCAA GTCGGTTGCA TAGCGGAAAT 15960 

CCTTTTGTGG TTCCACGTCT GGAATAATAT CCCCACGAAG AGCCAAGATT TTCTGCACCC 16020 

CAACTTTGTC CAAGTCAGCA ATAGTTTCAG CAACCTTGTC CTTAGTTAGA TAAATAGCTG 16080 
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GCAAGTGGGC AATGGTCGGA ATCGCCAAAT CATTTTGGAT AAAGTCAGCC AAACGAACCG 16140 

TCGTTTCCTT GATATTAAAT TTATTATTGC TGGCAGTTAC ACTGATAAAA TGGGGAGCCA 16200 

ACTCCTGCAT ATCCTGCAAG GCTGAAATAA TGTTATCATT ACCCACGGCT GGGTTTGGAG 16260 

GGAACACTTC AAATGAGAGT GACGGTGTTT GGCGTGACAT ATGTAATAAC CTTTTCTAGT 16320 

TGATTTCTTT TTGAACAACC ACTGTATGGA GAGAAATCCA ATCTTACAAT TTCTCACGCG 16380 

CAGCTTTAGC TGCTTCAACA AGGCGGATCA AGCTTTCTTT TGTTTCTGGG ATACCACGTG 16440 

TTTTCAAACC ACAGTCAGGG TTGATCCAAA CTTTCTTGCT TGGCACTTTA GCAAGGATGG 16500 

CTTCGATTGT GTTGTCGATT TCGCCTTCAT TTGGTACACG AGGTGAGTGG ATATCGTAAA 16560 

CCCCAGGTCC CACTTCTGTT TGGAAGTTTT TCGCTTTGAG TTCGTCCAAG ATTTCAAGGT 16620 

TTGAACGGTT AGCTTCAAAG GAAATAACGT CTGCATCCAT GTTATCGATA GCTGGGATGA 16680 

TATCTGTAAA TTCTGAGTAA CACATGTGAG TGTGGATTTG TGTGTCTGGC GCTACTGTTG 16740 

AGTGTACCAA GGGGAAGGCA GGAATAGCCC AGTCAAGGTA GTCTTCGTAC CAGTCGCTAC 16800 

GGCGGAGTGG CAATTTTTCA CGAAGAGCAG CCTCGTCGAT TTGGATGATT TTCACACCAG 16860 

CAGCTTCAAG GTCAAGTACT TCATCCTTGA TAGCAAGGGC GATTTGGAGA GTTGAATCCT 16920 

TGATAGAGAT GTCTTCACGT GGGAATGACC AGTTAAGGAT GGTAACAGGT CCAGTCAACA 16980 

TACCTTTAAC AGGTTTGTTT GTACGACTTT GTGCATAGCT AGACCATTTA ACAGTGATAG 17040 

GGTTAAGACG AGTGACATC A CCCCAGATGA TTGGTGGTTT TACCCCACGC ATACCGTATG 17100 

ATTGTACCCA TCCATTTTTA GAGAAGAGGT ATCCTGACAA GTTTTGACCG AAGTACTCAA 17160 

CCATGTCATT ACGCTCAAAT TCACCGTGAA CAAGGACATC AAAGTCAATA TCTTCTTGCC 17220 

ACTTGATCCA TTCGTCAATC GTTTCAGCAA GGAAAGCGTC GTACTCTTTT TGAGACAATT 17280 

CACCTTTACG GTAAGCCAAA CGTTTGGCAC GAACTTCTTT TGTTTGAGGG AATGAACCAA 17340 

TCGTTGTTGT TGGAAGAGCT GGAAGTTTGA AAGCTTCTTC TTGGATAGCT TCACGTTCTG 17400 

CAAAGGCTGG CAAACGAGTG TAGTCTGCGT CTGTCAAGCC AGCGATACGC GCACGAAGTT 17460 

CAGCATTTTC ACCAACACGC TCAGTCGCAA AGAGTTCTTT GTTGGCTGCA AGAGCTTCTG 17520 

AACCTTGACC ATTTCGGATA GCATCCAAGT CACGGATTTC ATCCAATTTT TCAACTGCAA 17580 

AGGCAAAGTG GTTCAAGAGT GCTGGTTCAA ATTCTTCATT AGCAGTTGTA AATGGCACAT 17640 

GAAGAAGTGA GCAAGAGCTT GTCAAAACGA TGTTTTCAGC TGGAATTTGC TCAAGAACAG 17700 

CCAAGCTCTT TTCGTAGTTG TTGCGCCAGA TGTTTTTACC ATTGACAATA CCTACATAGA 17760 

GAGTCTTGTC AGCTGGGAAG CCACCTTTAA CGAGTTCAAG AGTTTTCTTA CCTTCAACAA 17820 
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AGTCAAGACC GATAGCATCT ACTGGTAAGT TTACAAGGTC AGCGTATACG TCACGAACAT 17880 

CACCGAAATA AGTTTGAAGC AAGACTTCAA GACCTTTTTT GTCAGCCAAG AGTTTGTTGT 17940 

AAAGGTTCAA GAAGAGAGCT TTTTCTTCAG CTGTCAAGTC TTTTACAAGA GCCGCTTCAT 18000 

CCAATTGGAT GCGAGTCGCA CCAAGTTCAG CCAATTTAGC AAAAACTTCT TGGTAAGCAG 18060 

CCACTAAGCT ATCTACGAAG TCGTCTGCTT TCACGCCTTC TTCAAAGTCT GACAATTGAA 18120 

GGAAAGTGAA GGGACCTACA AGAACAGGAC GAGTGTTCAA TCCAAGTTCT TTGGCTTCTT 18180 

GGAACTCATC GAAAATCTTG TGACCAGCCA ATTTTACTTG AGTGTCTTTT TCAAATTTAG 18240 

GAACGATGTA GTGGTAGTTA GTGTTGAACC ATTTCTTCAT TGGAAGGGCG CGAACGTCCC 18300 

CTTTTTCTCC CTGGTAACCA CGTCCCAAAG CGAAGTAGCG CTCAAGGTCA GACAAGTCCA 18360 

AGTTTTGAAC GGATGCAGGC ACCACGTTGA AAAGGAAAGC CGCATCTAGG AAGTTATCAT 18420 

AGTGAGAAAA GTCATTTGAT GGAATTTCAG TGATGCCTTT TTCTTTGACA ATGTTCCAGT 18480 

GTTTAGCACG CAAGTCTTTT GCTGCTGCTA AAAGTTCTTC TTCTGAGATT TCTTTTCTAA 18540 

AGTATTTTTC AGTTGTAAAT TTTAATTCAC GGAATTCGCC CAAACGAGGG AAACCGATGA 18600 

TTGTAGTTGA CATGATGTGT CCTCCAAAAT TTGTTGTTGA AACTATCTTA ACAGAAAAGA 18660 

AAGCGTCTGT ATAATTGTAA AAAATTAGGG TTTGATATAG TTTGAAACTA TATATCTGTT 18720 

TCGGACAAAA GAAAAAGACT TGAAGCAAAC GTCTCAAATC CTTTGTAATT CTTACTTTAC 18780 

AGCTATATTC CAATTAGAAT ACTAAAACAT GTTATTAGTA ATTCTTATAA GTGACTATGA 18840 

CCTGTTATTA GAAAAGACTA TAACTGATTC TAGTCAACTT TTTCCCTGTT CAAGTGGGAC 18900 

GATTGCTAGT GTCTTTCCTA AACTGGCTAG GACTTTTAAG ACTGTATCCA ACTGAGGACT 18960 

AGTCTTTCCT GTCTCCATCC TAGCTATGAC AGGCTGGCTT ATTCCACTGA CTTCTTCCAG 19020 

CTTTTTCTGA CTGATTCCTT GTTCATACCT AGCCTCAATC AACTCGCTCA TGATAGCCAC 19080 

TCGCATATCA CTTTCAAGGA TTTCCTCCTT GCTAAAGAGC TCAGATGGAC ATCCTTCCAA 19140 

TTACTCCCAA TAGCACTATT CTTCATCACT TAACCCTCTT TTTTTTACGT CTATGTATTT 19200 

TTAAAAAAAT GAGCGAATTA TGATTCGATA GATTGACCAG TGGGTTTAAA GTTGGTGCTA 19260 

GCCTATTTCT TAAGCGATTT TCCTTTTCTA GGATAAAGCA GTTCCTGCTT GCTTAACCCC 19320 

AATTTTCCAC GATGAATCCA ATAGTAAATG GTTGAAATTC CCACGTTAAC CCCTTTAGCC 19380 

ATCACCATCA TTTCAGGCGA AAATTTTTGG TTATGTTTTT GGTTATGTAT AGTGGAGAAT 19440 

CTTTTCCTTT AGTTTCTTAA GACTGTTGAG CGTAGTCGGC AGAATAAATC TCTTTGAAGC 19500 

GCCCTTTTCC AAGACATTGT CGGACTGTCC CACGCTTGAT TTCAGTGTGG ATAGTTTGAG 19560 

GAGCTTTTCC AAGTAGAGAG GCAATTTCTC TATTTGATTT TCCTTCTTTT TTCCATCGTT 19620 
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CGATTAAGCG ACGGCTATCG ATTGTCAAAT GTTTGCCTTT TGTAGTATAA TTGTCTTGCA 19680 

TTTCTGTGCC TTTTAATCAT TTCAATCTTA AATTGGACTT TTTTTACTTG GGTTGTACTT 19740 

AATCTATGAG GAAGACAAGA AAAAGAATAT CAATCAAGTA AAGTCACAAA GTCACATTAG 19800 

CTCCGAGCAA CCATTGCAAA TTGAGGTACT CACACAATGA TTAAAACATT TCTCTCTGCC 19860 

CTTTCGGTCA TTCTCTTTTC TATCCCTATC ATAACTTATT CTTTTTTCCC ATCTTCTAAT 19920 

CTTAACATTT GGCTATCTAC CCAACCTATC TTGGCACAGA TTTATGCCTT CCCCTTAGCT 19980 

ACTGCAACTA TGGCTGCTAT TTTAAGTTTC TTATTTTTTT TCCTATCTTT TTACAAGAAA 20040 

AATAAACAAA TACGGTTTTA CTCTGGCATT TTGCTCTTAC TATCGCTCAT ATTACTATTA 20100 

TTCGGAACAG ATAAAACCCT TTCTTCTGCA TCAAATAAGA CTAAAACCTT AAAATTAGTA 20160 

ACTTGGAACG TCGCTAATCA AATAGAAGCA CAACATATTG AGCGAATTTT TAGCCATTTT 20220 

GACGCCGATA TGGCTATATT CCCTGAACTA GCTACCAATA TCAGAGGTGA GCAAGAAAAC 20280 

CAGAGAATCA AACTATTGTT TCATCAAGTT GGACTTTCTA TGGCCAACTA TGATATTTTC 20340 

ACTTCTCCAC CTACCAATAG TGGAATAGCT CCTGTGACTG TGATTGTCAA GAAAAGTTAT 20400 

GGTTTCTATA CAGAAGCTAA AACTTTTCAT ACAACACGGT TCGGGACAAT TGTATTACAT 20460 

TCGAGAAAAC AAAATATACC AGATATCATT GCCTTGCATA CTGCGCCTCC TCTGCCAGGT 20520 

TTAATGGAAA TCTGGAAGCA AGACTTAAAC ATCATTCATA ATCAATTGGC TTCAAAATAT 20580 

CCAAAGGCTA TTATTGCAGG TGATTTTAAT GCAACTATGC GTCATGGAGC ACTTGCAAAA 20640 

ATAAGCTCTC ATAGGGACGC ATTAAATGCA CTGCCACCTT TTGAAAGAGG AACTTGGAAT 20700 

AGCCAAAGTC CAAAACTTTT TAATGCAACA ATAGATCATA TTTTATTGCC TAAAAACCAC 20760 

TACTATGTTA AAGATTTAGA CATTGTAAGT TTTCAAAACT CTGATCATAG ATGTATTTTT 20820 

ACAGAAATCA CATTTTAATT ATTTTATATA AAATCACCCC TCTAATGTTC ATAAACTAGA 20880 

GGGGGAATTT GTATCCTACT ATCGTTTAAC GCACTTCTGC ATTGACTTTT TCTTCGAGAG 20940 

ACGCTTGGAT TTTTTCCATA TAGCGTGCGA CTTCTTCGTC CGTTAAGCTG TCTTCTGGAT 21000 

TTTGGAAGGT CAAGCTATAA GCCATTGACT TCATACCAAG TCCCAGTTTT TCACCTGAGA 21060 

AGACGTCAAA GAGTTTGATA TCTGTCAAAC GTTTCACGCC GGCAGCTTGG ATAGCATCTA 21120 

CAACTTCTTG GTGAGTCACT TCTGCCTTGA GGAGAAGGGC AACGTCACGG CTGACTGCTG 21180 

GGAATTTGGT GATTTCCACA AATGGAACAG CAGGTTGGAG CGCCCCTTCG ATGGCTGAAA 21240 

GGTTAAGCTC AGCTACATAC GTTTCTGGAA TATCGTAAGC CTTGGCAGTG ACTGGATGCA 21300 

CTTGGCCAAG GAAACCAAGA ACTTGGTCAC CGAGTGAAAT CACGGCTGTA CGACCTGGAT 21360 
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GAAGGCTAAC GATTTCAGAT GTTGCTGTAT AGGTTACTTG GAGTCCCAAA CGAGTAAATA 21420 

GGGCTTCAAG GATTCCCTTA GCATAGAAGA AATCAACTGG AACTGCTGCT GTTTGGAAAT 21480 

CTTTTTCAGC AACCAAGCCT GTCAAGGCAA AGGCAAAGCT GTTGATCTCA TTTGGAAGTT 21540 

CTTCTTTTGG ATTACCTGTT TGTTCAAAGA CTTTTCCAAT CTCATAAAGG GCCAAGTTTT 21600 

TATTCTTACG AGCCACGTTG TAGGCAACGG TATCAAGGAT CCCTGAAATC ATATTTTGAC 21660 

GGAGGACTGA ACGATCCACA GTCATTGGCC ACATGAGTTC AGTAAGGTTA CTTGGTTGAG 21720 

CTGTGAACTC AACTGCTTTT TCAGGAGTTG TC AG AG CAT A GGTGATGATT TCTGTCAAAC 21780 

CTGCTCCTTC AGCAATGGTA CGAACTTGAC GGCGGAGTTT TTGTATCACA GTCAATTCAC 21840 

CAGCTGTACC ATCGTCTTTT GGAAGGCTGG TTGGCAAGCG GTCATATCCA TAGATACGAG 21900 

CGATTTCTTC AAAGAGATCA GCTTCGATTG TGATATCCCA ACGACGACGT GGTACGCTGA 21960 

CTGTAAAGCT GTCTGCATTT CCAGAAAGAC CAAAGCCAAG ACGACGGAAG ACGTCTTCTA 22020 

CATCAGCATA AGACAGCTCA GTTCCGAGGA CACGGTTAAC ATCAGCAAGG GTTGAAGAAA 22080 

CTTCCACATC AGAGGTATCA AGCTCACCCG CTGAAACGAT ACCCTTACGC ACCGTCGCGC 22140 

CTGCAAGCTC TGCAATCATG CTAGCTGCCG CATCAAGGGC TTCATTAACT GTTGCCACAT 22200 

TAATTCCTTT TTCAAAGCGA GAAGATGACT CAGAACGAAG GTTCAGGCGA CCACTTGTCT 222 60 

TACGGATAGA TTTGCCATTA AAAACAGCAG CTTCAAGGAT AACACGACTA GATTTTTCAG 22320 

AAATTTCTGT AGCCTGACCA CCCATAACAC CGGCAAGGGC TACTGGTTTG TCAGCAACTG 223 80 

TAATCACGAG GTCTGTCTCA GCCAAGTCTC GTTCTTCACC GTCCAGGGTC ACTAATTTTT 22440 

CACCATCACG CGCTTCACGC ACACGGATGT CAGTCCCTTC AAATGTGTCC AAGTCAAAAG 22500 

CATGCATAGG TTGACCAAAG TAGAGCAGGA TGTAGTTTGT CACGTCTACA ACGTTATTGA 22560 

TGGGACGGAT GCCTTCGTTC ATGAGAAGGT TTTGCAACCA TTGTGGACTT GGTGCGATAG 22620 

TCACATTGTC CAAGATACGA GCTGCATAGT AAGGCGCCTT GTCTGTCTCA ATCCTGACAG 22680 

AAAGGGCATC TGCCGCAGCT TCATTAGTTT CTGTTAGAGT AAATTTTTTA AAGTTGACTG 22740 

CCTTGTCATA GATGGCTGCC ACTTCGTGAG CCACTCCACA CATAGAAAGG GCATCTGCAC 22800 

GGTTTGGTGT GATGGAAAGT TCGATGATTT CATCATCCAA GTCTAGGTAA GAAAAGACTT 22860 

CCTCACCTGG CACGGCATCT TCAGGCAAGA TTTGGATGCC ATCTGCGAAT TCCTTAGGCA 22920 

CAACTGAGTC AGAAATTCCC AATTCACCAA GTGAACAGAT CATTCCAAGT GACTCCAAAC 22980 

CACGGATTTT TCCTTTTTTG ATTTTGTAGT TATCAGCGAT ACGAGCTCCT GGAAGAGCCA 23040 

CCATGACCTT GATCCCAGCA CGCACATTTG GGGCACCACA AACGATCTGA CGCTCTTCTT 23100 

CTTCGCCAAC GTTAATCTGA CAAACATGGA GGTGAGTCTC TGGCACATCT TCGCAAGACA 23160 
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AGACCTCACC GACGACAATT TTTGAGAGAC CAGCAGCTGG TGATTCGACA CCCTCTACCT 23220 

CGATCCCTGT AGTTGACATT TTTTCAGCCA ACTCTTGTGA TGGCACATCA ATGTCCACCA 23280 

ATTCTTTTAA CCATTTATAA GATACAAGCA TAATTTAGTT CTCCAGAATG ACAGTTGTCA 23340 

CTCTAGTTCT TTTCCTTTCC TATCATTTCA ATAGAAGAAT CCTCTTCTTA CCTTAATTTC 23400 

TTTCTCAGTA ACCAATCCGT ATCTACTTTT TGACCAACCA TAAAATGATG TTGGCTAAAT 23460 

TTTTCAAAAC CATATCGGTT ATAAAACGCT TGAGCTTTTG TATTATGCTC CCAAACACCT 23520 

AGCCAAGCCC AAGAAAAACT ATTTTTTGTA GCAAGTTCAA GTGCGAATTC AAACAGTTGC 23580 

TTACCTAGTC CAAATCCTTG GAATTTTTGT AGCACATAGA GACGTTGAAT TTCAAAAGCG 23640 

TCCTCTAATT CTCTCTCAGT TTGAGCACTT CCCCAGTTGA CTTTGAGAAA ACCAGCTATC 2370O 

TCCTCCTCAT GCATAATGAA ATAGGTTTCA GAGTCAGGAT TTCCCAACTC AGTTGACAAA 23760 

GTTTTCAGAC TATAAGCCTC TTCAAAGTAT TCCTGTAACT GCTCTTCCGT ATTATCATAC 23820 

GCAAAGGTTT CACGAAAGGT TTGTTTGGCA ATTTTAGCCA ACACCTCAAC ATCTGCCATT 23880 

TCTACTTTTC TAATCATTAT TTAAACTGTT CTGAGAAGCG GACATCTCCT TGGTAGAATC 23940 

CACGGATATC GTTGATTCCA TAACGGAGCA TAGCTACACG CTCTTGTCCA AGACCAAAGG 24000 

CAAAGCCAGA GTATACAGTC GCATCGATAC CACTCATTTC AAGGACACGT GGGTGAACCA 24060 

TACCGGCCCC CATAATTTCG ATCCAACCTG TTTTCTTACA TACATTACAG CCTTCTCCAC 24120 

CACACTTGAA GCAAGAAACA TCCACCTCAA CAGATGGCTC TGTGAATGGG AAGTAAGATG 24180 

GACGCAAACG AATTTGACGC TCTTCACCAA ACATTTTTTG GACAATCAAC TGAAGCGTTC 24240 

CTTGAAGATC AGCCATAGAG ATATTTTTCC CAACTACCAA GCCTTCGATT TGGTGGAATT 24300 

GGTGACTGTG GGTCGCATCG TCCGTATCGC GACGGAAGAC ACGCCCTGGC GAGATCATCT 24360 

TCAAAGGACC TTTAGAAAAA TCATGGGCAT CCATAGCACG CGCCTGAACT GGAGACGTGT 24420 

GGGTACGGAG CAAGATTTCT TCAGTGATAT AGAAAGTATC CTGCATATCA CGAGCTGGGT 24480 

GGTCTTTTGG AAGGTTCATA CGTTCAAAGT TATAGTAGTC TTGCTCCACT TCAAAACCAT 24540 

CCACGACTTG ATAACCCATA CCGATGAAGA TATCTTCGAT TTCTTCACTG GTTTGTGTCA 24600 

AAACGTGACG GTGACCAGTC GCAACTGGAC GACCTGGAAG CGTCACATCT ATACTCTCGC 24660 

TAGCCAGTTG AGCCGCGACT TTCTTTTCTT CCAAGAGCTT AGCTGTTTCT TCAAAAGCAG 24720 

CAGTCAAGAC ATCACGAGCT TCATTGACGT GTTTCCCGAT GATTGGACGC ATCTCAGCAG 24780 

AAACATCTTT CATCCCTTTG AGGATTTCAG TGAGCGAACC CTTTTTACCA AGGACAGAGA 24840 

CACGCAAATC TTGCATCTCT TTTTCATTTC CAGCAGTAAT CTGCTTCAAG CTAGCCAGCG 24900 
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TTTCTTCGCG AAGCGCTTTT AATTGTTCTT CAATAGTTGA CATATTTCCT CCATCAGTCT 24960 

CTCGTAGATA AAAAGAAAAC CACATGCCAA AAACTCCACT CGGAGCGTTG ACACGCGGTA 25020 

CCATCCGTTT TCATCTGACA AGTCAGACCT TCATTTCTAA ATCCATGCGC AAGTGAATTC 25080 

ACCCAGCTTT CATATAGAGA GCTTGCAGTC ACGGCTCTCC TCCCTGATAT ACTTCCCTTG 25140 

GGCTACTAGT CTTTCAGATT CCTATTCAAT TACTACTTAG TTTATCAGAT TTTTACCATT 25200 

CTTGCAAGAC CTATCTTACT TCTGCTTGTT AGCTTATTCT TATCTAAATT TATATAAACc 25260 

TTATCTAAAT TAACTATTTA TAATTTTTGT AACAAAATTA AATTAATTGA CACTCCCCTA 25320 

TAAAATAAAG AAGTTTAGAA TTTAATGTCT TCCAAACTTC TTTATTCCAT ATTTAATGAA 25380 

ATGCCACCTT AACCGTGATA ATAGCTAGTC ATCAATAAAA AACTATTTGA ATAAGGATTC 25440 

TCCATTTGAT TCAATCACTT CTTTATACCA AGTAAAAGAC ATTTTCTTAT ATCGATTTAA 25500 

TGTACCACTT CCATCATCGT TTCGATCAAC ATAAATGAGA CCGTACCTTT TAGAAAGTTG 25560 

TGCAGTGGAC ATAGAAACAC AGTCAATACA TCCCCAAGAC GTATAGCCCA TAATTTCAAC 25620 

ACCATCCTGT AGAGCTTCAG CAACTTGCAA TAAATGTTCT TTCATATACT GAATTCTATA 25680 

ATCATCTTGG ACGGTTAAGT TATTAAGTTC ATCTTTTATT AGTTGATCTT TAGCACCTAA 25740 

TCCATTTTCT ACTATAAATA ATGGGATTTG ATAACGGTCA TAATATCTAT TTAAAATTAT 25800 

ACGTAGTCCA ATTGGATCAA TTTGCCATCC CCACTCTGAA GACTCTAAAT AAGGATTTAC 25860 

TAAACCACCA ATAATATTCC CTTCTCCTGA ATTATACTGT GTTGGAAGAG CAGATTGAGT 25920 

CACACTCATG TAATAGCTAA AGGATAAAAA ATCTACGGTA TAATTTTTTA ATAACTCTGC 25980 

ATCTTCAGCT GCAAACTCTA TGTTAATGTC ATTTTCCTTA AAATATCTTT TTGCATAATT 26040 

CGGATAATAA CCTCTAACAT GCACATCTGA AAATAGATAA TTTAGATTCT CATACTCATG 26100 

AGTCGCCCAT ACATCTTTTG GATTTGGAGT CATTGGATAA GCTGGCATAG CTAATACCAT 26160 

ACATCCCACC TTAAACTCTG AATTAATCTC ACGAGCAATT TTTGTAACCA AACTTGAGGC 26220 

GACTAATTCA TGATGTATAG CTTGATATAA TTCTTGTTTC GAAAGATTCT CCTTAGGTAT 26280 

ATCTATTCCT CCACTAGTAA ATGGTAATTC CAAAACAGAG TTTACTTCGT TAAATGTAAG 26340 

CCAATATTTA ACTTTATCTT TATACCTTTC TAAAACTGTT CGAGCAAATT TTTCATAAAA 26400 

ATGAATCATT CTCCTATCAA CCCATCCATG ATATTTTCTT GCTAAATATA ATGGAGTCTC 26460 

ATAGTGTGAA AGAGTTACAA GTGGTTCTAT CCCGTGAGCA TGTAGTTCAT CAAACAATTC 26520 

ATCATAATAT TTCAACCCAG CTTCGTTAGG TTCTTCCTCA TCTCCTTTTG GAAAAATTCT 26580 

ACTCCATGCA ATAGAAGTAC GAAAAACATT AAAGCCCATT TCAGAAAACA AGGATATATC 26640 

TTCCTTATAT TTATGATAAA AATCAATACC TATCAATTTT AAGTTATCTT CTGTAGGATT 267O0 
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TTCTGTTGCT TCTCCTAATC CACCTTTGGG TAACACATCC 
ATCTTCATTA TATGCTCCCT CTACTTGATT AGCTGCAACA 
ATCTGGAAAA ATGGTCATAA CTTTCCTCCA TTATAATATT 
CTCGATTGTC TGATTATTAG GTAATACTAA TACATCTAGA 
AATTACTGGT GTAACTGTTT CGTAGCCTTT AGTCTTGATT 
AATCAACTGA TTTTTGAAAA CTCTGTCTCC TTCTTCTACA 
TTTTAGCTCA ACAGTATCTA ATCCAATATG AATTAGTAAC 
CAATCCAATT GCGTGCTTAG TCGGAAAAAT ATTTGTAATT 
AACCTTACCT TCACTTGGGA TAATCGCTAC TCCGTCTCCA 
TTTATCCTGG ACATCGCTTA ACGGAATGAT TTCTCCTGAT 
TTTATTTGAA ACTCCAGCTT CAACTTCTAA ATTGCTAGAA 
AAATATATAA GCTAATACAA AGGTAATAAC AACCGAAATG 
TACAATATTT GATGGCACAT CAGAATAAAT AAATTGAGGC 
AGCAAATAGA TATGCTTTAA CACTAGTAAG ACCTGCAAAT 
AATCATAGCT GCATAAAGCG GTTTTTTATA TTTTAAAGTC 
GGTAATCCCT GCAAGTAAGG CTGAGAAACC TGCTGCAAAA 
ATTTTTACTC TTTAATGCAA CAGCCATCGA AGCAGCCCCT 
CATTGCTGGA AGAATTAATA CGTCTGGAGT AGCAATAGAT 
AAAAGCCCAA TGCATTCCAG TCATAACAAT AAATGGCATA 
TGTAAGCCAT CCAGCTACAC CATACATTTG CCCAACTAGA 
AATTACTCCA ATAGGTCCGA CTACAACTAA GGCAATACAG 
CGTAGGTTGC AAAAAACTCT TAGTAATAGC TAGTGTTAAT 
ATATTTCATC AACCAAACCA TAATAAGAAT TGGAACGACT 
TGTCACAGGT GCACCAAATA AACTAAGAGG ATTCCCTGAT 
TGGATGGAGA AGTACACCTG CTACAGACAT AGCTAATGTA 
TGATGCAGAA TAAGCTAATA ACAGCGGTAA GAAATAATAT 
CAAAAAAGCA ATAGTCTGAG AATCTGATTG CAATATACCA 
CAAGACTTTC AACATACCTC CCCCTAACAT TGCTGGAATG 
GAT AT ACT CA ATGATTCTTT CTAAAATATT CCCTTTGTGC 



TGAACTGATA AGCCCTTACC 26760 

GCTCCACCCC AAAGAAAATC 26820 

ACCAGTAATT CCTTAGAATG 26880 

AAATCATTGG TATTCGTTAC 26940 

AAATTCAAGT CCATTTCAAA 27000 

TGACTAATAA AACCTTGACC 27060 

TCAACACCCT CATCACTCTT 27120 

TTCCCATCAA ATGGTGCATA 27180 

ATTAGTTTAT CTGAAAATGT 27240 

ATAGGAGAAA ATATCATTTT 27300 

CTCTCTTCTT CATCGATTCC 27360 

ACCGCCACAA TTAAAGCATT 27420 

AACGCTATCA AAGATGGGAC 27480 

AATCCCGCTA ATCCACCACC 27540 

ACACCATATA ATGCAGGTTC 27600 

GCAATTTGTT TTGTATTATT 27660 

TGAGCTAAGT TTGACCCTAA 27720 

GCCGCCAAAA AAATAGGTGC 27780 

ATAGCACCAA GAATAGCTAA 27840 

TTTGATAATC CTTCACCAAC 27900 

CTTGATACTA ATAATACTAG 27960 

TTAGCAATTA TTTTTTCAAT 28020 

GATGAACCAT AACTAGCTGG 28080 

TGCACCATTT GAACAAAATT 28140 

GATGTTACTT TTAATTTTTG 28200 

GGAGCATCCC CAAAAAATGT 28260 

AGCATTGGTA AAATGATTAC 28320 

ATTGGAGTCA TGGAACCAGC 28380 

CCTTGAACAA CTGAATCGGA 28440 
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TTCAAAATTG CCAAGTTTAA CGAATTCTTT ATAATAATTA GCTACATCAT TACCAAGTAT 28500 

AATTTGATAT TGTCCATTCT TTTTCATAAT ACCTATTACA CCTGGTATCT TCTTCACATC 28560 

ATCATCATTG ACTAAATTTT CATCTTTTAA TTCTAATCTT AAACGTGTTA CACAATGGGT 28620 

AACTCTATTG ACATTTTTTT CACCTCCAAT TACATCGAGG ATTTTTTGTA CCGTATCTTT 28680 

ATAACTCATG GTATTCTCCT ATTCTATTAA TCTAAATTTT TTGTTAAGCG ACGAATATGA 28740 

GCCATCAAAT AAACTAATTC ACTAGAAGTC AGCAAATAAT TGTACTCCGT TTGTATAAAC 28800 

ATTGCTACCT GTTCACCACA TTCATATTCT CTAGGATATT TATTTTTCAT TAATGCTAAC 28860 

AAGTCTTCAT CATCATCGTC GG 28882 



(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12835 base pairs 
{B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 



GCCTATGTCT 


TTTTCAAAAA 


AATGCTTGAC 


TTGAGACGGG 


AACTAGGGAA 


GTCTAAAGGC 


60 


GGAAGGCATT 


GATTTATACT 


CTTCGAAAAT 


CTCTTCAAAC 


CACGTCAACG 


TCGCCTTGGA 


120 


TTATATATGT 


AACTGACTTC 


GTCGATGCTT 


ATCTACAACC 


TCAAAGCAGT 


GCTTTGAGCA 


180 


ACTTGCGGCT 


AGTTTCCTAG 


TTTGCTCTTT 


GATTTTCATT 


GAGTATTATA 


TTACTTTCTA 


240 


TTTGTAGGAG 


GTGGCTTATG 


AAGATTCCTC 


TCTTAACTTT 


TGCAAGGCAT 


AAATTTGTTT 


300 


ATGTCTTGCT 


TACTTTGCTT 


TTTCTTGCTT 


TGGTTTATCG 


TGATGTTTTG 


ATGACTTATT 


360 


TCTTTTTTGA 


TATTCATGCG 


CCCGATCTAG 


CTAAATTCGA 


TGGACAAGCA ATT AAAAATG 


420 


ACTTATTAAA 


ATCAGCATTA 


GATTTTCGTA 


TTCTCCAGTT 


CAATCTAGGT 


TTTTATCAAT 


480 


CATTTATTAT 


TCCAATCATC 


ATTGTTTTGC 


TAGGTTTTCA 


ATATATTGAG 


CTGAAAAATA 


540 


AAGTTTTACG 


ATTGAGTATT 


GGAAGAGAAG 


TGAGTTATCA 


AGGGTTAAAA 


AGAAAGTTGA 


600 


CTTTGCAAGT 


TGCAAGTATC 


CCTTGTTTGA 


TATATTTAGT 


GACTGTGCTG 


ATAATTGCAA 


660 


TTATAACCTA 


TTTCTTTGGG 


ACTTTTTCTC 


CTCTTGGATG 


GAATTCTCTA 


TTTTCTGATG 


720 


GAAGTGGTTT 


ACAAAGACTC 


CTAGATGGAG 


AGATAAAAAG 


CTATTTGTTC 


TTTACTTGTG 


780 


TCCTACTAAT 


CGGTATTTTC 


ATCAATGCAA 


TCTATTTTTT 


ACAAATAGTT 


GATTATGTGG 


840 


GGAATGTGAC 


TCGTTCGGCA 


ATCACCTATT 


TGATGTTTCT 


TTGGCTTGGT 


TCTATGCTGC 


900 


TTTATAGTGC 


CTTGCCTTAC 


TATATGGTTC 


CTATGACGAG 


TTTGATGCAA 


GCTAGCTATG 


960 
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GGGATGTAAG TTTGATGAAA CTCTTTACTC CTTATATCCT TTATATTGTC CCTTACATGG 1020 

TGCTTGAAAA ATATGAAGAT AATGTTTAAG AATTTTAACA ATATTTTGCT AAATAGAAAG 1080 

ATTGTTTTAC TACTTCGTAT AGTTCTGATG ATGATTTTGA TAAACCATCT ATTGTCAACA 1140 

GCGGTTCAAA AGCAGGATGC TGTTATCTTT TTCAAGAGAG AATTGATTTC AATTTTTTCC 1200 

TATAATGACT ATTCTGAAGC GAATTTAGAA ATCCCCAAAC TATTGTTAAA CCTTTCGCTT 1260 

TTCATGGTAG GATGGCTCTC TGTCATTTTA CTTGAAAGTG ATTTGGCAGA CCATTACCAT 132 0 

CACTTGATTC GCTATCAATC AAGCTCCTTT TTCGATTATA CAAGGAAACG ATTGGTTGTC 1380 

ATTTCTAAAT TTTTTACTCA AGATTTGTTT GTCTGGTTTC TTGGTTTACT TCCTCTAGGA 1440 

ATTCATTTCA AAACAGTCGC ACTTTTCTTT TTACTTGCTC AGTTAATGAT GTTGTACTTA 1500 

CTACTGTCTT ATCTGATAGC ACTGATTAGT GCGGGCGCTG GTTTTTCCTT TTTTCTCTAT 1560 

TTTTTAGCAT TTGTGGGACA AGAATGGATG ATGGATCATA TTGTAACAGT GTATTTAGTA 1620 

CTCTTAAGTT TATTAGTTAT GTTGATTGTT AGTCGCTTGG AAGAGAAATT TAAGAAAGGA 1680 

TAAACGATGA GACTTGAAAT TATAAATGGA CAGAAAATTT ATGGGAAAAG ACCTATTTTA 1740 

AATCAGTTGA ATTTGGTGTT TCAATCAGGA AAAATTTATG GACTTAAAGG TGATAATGGA 1800 

TCTGGCAAGA CGGTTCTTTT AAAGATACTT GCTGGTTATA TTAAGCTTGA CAAAGGAAAA 1860 

GTTCTTCAAG ATGGTAAAGT TTACGGGGTA AAAAATCATT ATATTCAGGA TGCAGGAATT 1920 

TTAATTGAAA AAGTCGAGTT TTTATCTCAT TTATCCCTGA GAGAAAATTT GGAACTGTTA 1980 

AGGTATTTTT CATCTAAAGT TACGGAAAAA AGAATTGCCT ATTGGATTCA ATACTATGAT 2040 

TTACAGGAAT TTGAAGACAT TGAATACCGT CATTTATCCT TAGGAACAAA GCAAAAAATG 2100 

GCCTTGATTC AAGCCTTTAT TTCCTCTCCT TCTATACTCT TTCTCGATGA ACCTATGAAT 2160 

GCTTTGGATG AGAAGAGTGT GAGGTTAACC AAACAGGTCA TTTTATCTTA CCTGAAAAAA 2220 

GAAAATGGTC TGGTTATCCT GACGTCGCAC ATATCGGAAG ATATTTCAGA CCTTTGTACA 2280 

GATGTATTAG TTGTCGAAAA TGGACATATA CAAATGTAAA GGATATACAA TCCTAGGAGA 2340 

TGGCTTATGG CACATCTAAA ATCATTTATT ACACGATATT CCAAGGTTTA TATTGGTTTA 2400 

GTTCTGCTGA TCTGGCTGTC TTTCTTCTTT ATCCCTTGGG ATAAACCACT TCTGGGGATA 2460 

AGGATTGACA TCTTCATCAT ACAGAAAATC TTGCTAGCTT TTGGAATTCT GTCCATTCTC 2520 

ATGGCCTTGC TGTCCAAGAA AGTCAGTCTC TTTGTTTTTG GACTGATTTG CTGTCTTTCT 2580 

CTTTGGATTA ACTTATTTAT CACATTTGCC ATTTTGCCGA TTTTTGGCAA TTAAACAGTC 2640 

ATAAAAGTCG GAGAGGTTAG CTTGAAAACT AACCTCTTTT TCCTTTTCAA AATGGGGATT 2700 
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CTTCCTTGAA AATAATCAGT AATTGTGCTA AAATTAAAGG AACATTCTAA AATATTCGGA 2760 

ATTTAAAGTA AGGAAAAACA TGGCTAATAT TTTAAAAACA ATTATCGAAA ATGATAAAGG 2820 

AGAAATCCGT CGTCTGGAAA AGATGGCTGA CAAGGTTTTC AAATACGAAG ACCAAATGGC 2880 

TGCTTTGACT GACGACCAAC TAAAAGCAAA AACAGTTGAA TTTAAGGAAC GTTATCAAAA 2940 

TGGAGAATCA CTGGATTCAT TGCTTTACGA AGCATTTGCG GTTGTCCGTG AAGGTGCCAA 3000 

ACGTGTCCTA GGTCTCTTCC CTTATAAGGT TCAGGTCATG GGGGGGATTG TTCTTCACCA 3060 

TGGTGACGTG CCAGAGATGC GTACAGGGGA AGGGAAAACC TTGACTGCGA CCATGCCGGT 3120 

ATACCTCAAT GCCCTTTCAG GTAAAGGGGT TCACGTAGTT ACGGTTAATG AATACCTGTC 3180 

AGAACGTGAC GCGACTGAGA TGGGTGAATT GTACTCTTGG CTTGGTTTGT CAGTAGGGAT 3240 

TAACTTGGCT ACCAAATCTC CAATGGAGAA AAAAGAAGCC TATGAGTGTG ATATTACTTA 3300 

CTCAACTAAC TCAGAAATCG GATTTGACTA CCTTCGTGAC AACATGGTCG TTCGCGCCGA 3360 

AAACATGGTA CAACGTCCGC TTAACTATGC CTTGGTCGAT GAGGTTGACT CTATCTTGAT 3420 

TGACGAGGCT CGTACACCTT TGATTGTATC AGGTGCCAAT GCGGTTGAAA CCAGTCAGTT 3480 

GTATCACATG GCAGACCACT ATGTAAAATC TTTGAACAAA GATGACTACA TCATCGATGT 3540 

GCAGTCTAAG ACTATTGGTT TGTCTGATTC AGGGATTGAC AGGGCTGAAA GCTACTTCAA 3600 

ACTTGAAAAC CTCTATGACA TCGAAAACGT GGCTTTGACT CACTTTATCG ATAACGCCCT 3660 

TCGTGCCAAC TACATCATGC TTCTCGATAT TGACTATGTG GTGAGCGAAG AGCAAGAAAT 3720 

CTTGATTGTC GACCAATTTA CAGGTCGTAC CATGGAAGGT CGTCGTTATT CTGATGGATT 3780 

GCACCAAGCT ATTGAAGCCA AAGAAGGTGT GCCAATCCAG GATGAAACCA AGACATCTGC 3840 

CTCAATCACG TACCAAAACC TCTTCCGTAT GTACAAGAAA TTGTCTGGTA TGACGGGTAC 3900 

AGGTAAGACT GAGGAAGAAG AATTCCGTGA AATCTACAAC ATTCGTGTTA TTCCAATCCC 3960 

AACAAACCGT CCTGTTCAAC GTATTGACCA CTCAGACCTT CTTTATGCAA GTATCGAATC 4020 

TAAGTTTAAA GCGGTTGTCG AAGACGTTAA GGCTCGTTAC CAAAAGGGTC AACCTGTCTT 4080 

GGTTGGTACA GTAGCGGTTG AAACTAGTGA CTACATTTCT AAGAAATTGG TTGCAGCTGG 4140 

TGTTCCTCAC GAAGTCTTGA ATGCCAAAAA CCACTATAGA GAAGCCCAAA TCATCATGAA 4200 

TGCTGGTCAA CGTGGTGCCG TTACCATCGC AACCAACATG GCGGGTCGTG GTACCGACAT 4260 

CAAGCTTGGT GAAGGTGTTC GTGAACTTGG AGGACTTTGT GTTATTGGTA CAGAACGTCA 4320 

TGAAAGTCGT CGTATCGATA ACCAGCTTCG TGGACGTTCA GGTCGTCAAG GAGATCCAGG 4380 

TGAGTCACAA TTCTACCTAT CTCTTGAAGA TGATTTGATG AAACGTTTTG GTTCTGAACG 4440 

CTTGAAGGGA ATCTTTGAAC GCTTGAACAT GTCTGAAGAG GCCATTGAGT CTCGCATGTT 4500 
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GACGCGTCAG GTTGAAGCAG CTCAGAAACG TGTCGAAGGA AATAACTACG ATACCCGTAA 4560 

ACAAGTCCTT CAATACGATG ATGTCATGCG TGAACAACGT GAGATTATCT ATGCTCAACG 4620 

TTACGATGTC ATCACTGCAG ATCGTGACTT GGCACCTGAA ATTCAGTCTA TGATCAAACG 4680 

CACGATTGAA CGTGTCGTTG ATGGTCATGC GCGTGCCAAA CAAGATGAAA AACTAGAGGC 4740 

AATTTTGAAC TTTGCTAAGT ACAACTTGCT TCCTGAAGAT TCTATTACGA TGGAAGACTT 4800 

GTCAGGCTTG TCTGATAAGG CCATCAAGGA AGAGCTTTTC CAACGTTCCT TGAAGGTTTA 4860 

CGATAGTCAG GTTTCAAAAC TACGCGATGA AGAAGCAGTT AAAGAATTCC AAAAAGTTTT 4920 

GATTCTACGA GTGGTGGATA ACAAGTGGAC AGATCATATC GATGCCCTTG ATCAATTGCG 4980 

TAACGCGGTT GGACTTCGTG GCTATGCTCA GAACAACCCT GTTGTTGAGT ATCAGGCAGA 5040 

AGGTTTCCGT ATGTTTAATG ATATGATTGG TTCGATTGAG TTTGATGTGA CACGCTTGAT 5100 

GATGAAAGCA CAAATTCATG AACAAGAAAG ACCACAGGCA GAACGTCATA TCAGTACAAC 5160 

AGCGACTCGC AATATCGCTG CTCACCAAGC AAGTATGCCA GAAGATTTGG ATTTGAGCCA 5220 

GATTGGACGC AATGAACTTT GCCCATGTGG TTCTGGTAAG AAATTTAAAA ACTGTCACGG 5280 

TAAAAGACAA TAAAATGAGA TAGTTTAGAG GCGGATATCT TGTGAAAAGT AAATTTTTAC 5340 

TGGGTATCCG TTTGCTTTAT AAGGAGATGA GTTATGGTAT TTACAGCAAA AAGCTCTAAA 5400 

ATAAATATAG AAGAAGTTCG TGCCTTGTCA AAATTAGAAG GTCAGGCTTT GGAGAGGAAA 5460 

TCACAGCGAG ATCAAGAGCT AGAAGCCATT ATACGTGGAG AAGACCAGCG AATTCTCTTG 5520 

GTAATCGGGC CATGCTCATC TGACAACGAA GAAGCTGTCC TTGAATACGC TAAGCGTTTG 5580 

GCAGTCCTAC AAGAAGAAGT GGCAGATCGT ATCTTTATGG TTATGCGTGT TTATACTGCC 5640 

AAACCCCGTA CCAACGGAGA TGGCTATAAG GGCTTGATTC ACCAGCCTAA CGCGACAGAA 5700 

GCGCCTAGTC TTATCAATGG AATCAAAGCC GTTCGCCATC TTCACTATCG TGTCATCACA 5760 

GAAACAGGGA TGACAACTGC TGATGAAATG CTTTATCCTG AAAACCTTCC GCTTGTAGAT 5820 

GATTTGATTT CTTACATGGC AGTTGGTGCC CGTTCAGTTG AAGACCAGCA ACACCGCTTT 5880 

GTGGCAAGTG GGGCAGGATT TTCTACTGGT TTTAAAAATC CAACCTCTGG AAATCTCAAT 5940 

GTCATGTTTA ATGGGATTTA TGCTGCTCAA AACAAACAAA GTTTCCTTTT CTTAGGAAAA 6000 

GAAGTAGAAA CAACTGGGAA CCCGCTTTCA CACGCTATTC TTCGTGGTGC TCTTAATGAG 6060 

TATGGAAAAA ATATTCCCAA CTACTATTAT GACAATTTAA TTGATACCAT TGCCCAGTAT 6120 

GAGAAAATGG GCTTGGAAAA TCCTTTTATC ATCATTGATA CCAATCATGA CAATTCTGGT 6180 

AAGCAGTATA TTGAACAGAT CCGAATTGTC CGCCAGACCT TGATTAACCG TGCTTGGAAT 6240 
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GAAAAAATTA AGCAGTTCGT TCGTGGTTTT ATGATTGAGT CTTATCTGGA AGATGGTCGA 6300 

CAAAATGAGC CAGAAGTATT TGGTAAGTCT ATCACAGACC CTTGCCTGGG TTGGGATAAC 6360 

ACAGAAGCTC TTGTCAGAGA AATTTACAAA ACGTTAGGAG AATAAGATGG CATTTATTGA 6420 

AAAAGGTCAA GAAATCGATA TGGAAGTCAT CAAGGCTGAA ACCCAATTGT CTGCGGAAGC 6480 

CTTGAGACTC AAGGAAAGCC GTGACAGGGA ATTGGCAGAT ATTATTTCAG GGGAAGATGA 6540 

CCGTATTCTC TTGGTGATTG GTCCTTGCTC TTCTGATAAT GAAGAGGCGG TCTTGGAATA 6600 

TGCTCGCCGT TTATCTGCCT TGCAAAAGAA GGTAGCGGAT AAGATTTTCA TGGTCATGCG 6660 

CGTGTATACT GCTAAGCCTC GTACCAATGG AGACGGCTAT AAAGGATTAG TTCACCAGCC 6720 

AGATACTTCT AAGGCTCCAA GCCTGATTAA TGGCTTGCAG GCTGTGCGCC AGTTGCACTA 6780 

CCGCGTGATT ACAGAGACTG GTTTGACAAC GGCAGATGAG ATGCTTTATC CGTCAAATCT 6840 

GATCTTGGTG GATGACTTGG TCAGCTACCA TGCCGTTGGA GCTCGTTCTG TGGAAGACCA 6900 

AGAGCACCGC TTTGTGGCTT CTGGGATTGA TGCACCAGTA GGGATGAAAA ATCCAACCTC 6960 

AGGAAATTTG GGTGTTATGT TTAACGCCAT CTATGCTGCT CAAAACAAGC AAACCTTCCT 7020 

TTATCATGGG CAGGAAGTTG AGACATCAGG TAATCCTTTG GCCCATGTTA TCCTCCGTGG 7080 

AGCAGTCAAC GAGTATGGCA ATTATATGCC GAATTACTAC TATGAAAATC TACTCCAAGC 7140 

CATTGAACGC TATGAAACCA TGGGACTTGA AAATCCTTTT ATCCTCATTG ACACCAACCA 7200 

TGATAACTCA GGCAAGCAAT ATATGGAGCA GATTCGAATT GTTCGCCAGA CCTTGCAGAA 7260 

TCGTGATTGG AATGAGAAAA TTAAAAAGAC GGTTCGAGGA TTTATGATTG AATCTTACCT 7320 

AGCAGATGGT CGTCAAAACC AACCAGAGAT CTTTGGTTGC TCTATTACTG ACCCTTGCCT 7380 

AGGTTGGGAA AATACAGAGG CCTTGGTAGA AGAGATTTAT GTTACCTTGA CAAAATAAGT 7440 

GAAAAGGATG GAGTTGGGGA ATCTCAACTC CTTTTGATGA GAATGATAGT TGGACACGGA 7500 

ATTGACATCG AAGAATTGGC TTCGATAGAA AGCGCAGTTA CACGACATGA AGGATTTGCT 7560 

AAGCGTGTAC TGACCGCTCA GGAAATGGAG CGCTTCACCA GTCTCAAAGG ACGCAGGCAA 7620 

ATAGAATATT TAGCTGGTCG CTGGTCGGCT AAGGAGGCCT TTTCCAAGGC TATGGGAACG 7680 

GGCATTAGCA AGCTCGGTTT TCAGGATTTG GAAGTCTTGA ACAATGAACG TGGGGCGCCT 7740 

TATTTTAGTC AGGCACCATT TTCAGGAAAG ATTTGGCTGT CTATCAGCCA CACCGATCAG 7800 

TTTGTGACAG CCAGTGTCAT TTTGGAGGAA AATCATGAAA GCTAGTCCAC ATAGACCAAC 7860 

CAAGGCTCTG ATTCATCTGG GAGCTATTCG ACAAAATATT CAGCAAATGG GGGCTCATAT 7920 

CCCTCAAGGA ACGCTCAAGT TGGCTGTGGT TAAGGCCAAT GCTTATGGTC ATGGAGCTGT 7980 

TGCCGTTGCC AAGGCAATTC AAGATG ATGT TGATGGCTTT TGCGTTTCCA ATATCGATGA 8040 
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AGCCATTGAA CTCAGACAAG CTGGACTCAG CAAGCCAATC CTCATTTTAG GAGTTTCTGA 8100 

AATCGAAGCT GTTGCTCTAG CTAAAGAATA TGACTTCACC TTGACAGTGG CTGGACTGGA 8160 

GTGGATTCAA GCACTCTTAG ATAAGGAAGT GGACCTAACT GGATTGACAG TCCACCTCAA 8220 

GATTGATTCA GGGATGGGAC GGATTGGTTT TAGAGAGGCA AGTGAGGTTG AGCAGGCTCA 8280 

AGATTTGCTC CAACAACACG GTGTTTGTGT TGAAGGAATC TTTACCCACT TTGCTACTGC 8340 

TGATGAGGAA TCAGATGACT ATTTTAATGC CCAGTTAGAA CGGTTTAAAA CTATTTTAGC 8400 

TAGTATGAAG GAAGTTCCAG AGCTGGTTCA TGCTAGCAAT TCTGCAACGA CTCTTTGGCA 8460 

TGTAGAGACT ATTTTCAATG CGGTTCGTAT GGGAGATGCC ATGTATGGCC TCAATCCAAG 8520 

TGGAGCGGTC TTGGATTTGC CTTATGATTT GATACCGGCC TTGACCTTGG AGTCTGCTCT 8580 

GGTTCATGTC AAGACAGTTC CAGCTGGAGC TTGCATGGGC TATGGAGCAA CTTATCAAGC 8640 

GGATAGCGAG CAAGTCATCG CGACCGTGCC AATCGGGTAT GCAGATGGAT GGACAAGAGA B700 

CATGCAAAAT TTCTCTGTCT TGGTAGATGG CCAAGCTTGC CCAATTGTCG GCAGGGTTTC 8760 

GATGGACCAA ATCACTATTC GATTGCCTAA GCTTTATCCG CTAGGAACCA AGGTAACCTT 8820 

GATTGGCTCC AATGGGGATA AGGAAATCAC TGCAACTCAG GTAGCGACCT ACCGCGTAAC 8880 

CATTAACTAT GAGGTGGTTT GCCTCCTCAG CGACCGTATT CCGAGAGAAT ATTATTAGAA 8940 

AAGAAAGGAG TGGAGCATGA ATCTACATCA ACCCTTGCAT GTCTTGCCTG GTGTGGGACC 9000 

AAAGTCAGCA GAAAAATACG CCAAACTAGG AATTGAAAAC TTGCAAGATC TCTTGCTCTA 9060 

CTTTCCTTTC CGTTATGAAG ACTTCAAAAC CAAGCAGGTG CTGGAGCTGG AAGACGGTGA 9120 

GAAGGCAGTT CTTTCTGGTC AGGTAGTGAC TCCTGCTAGT GTCCAGTATT ATGGTTTCAA 9180 

GCGCAATCGC CTGCGTTTTA GTCTCAAGCA GGGAGAGGTC GTTTTTGCGG TGAATTTCTT 9240 

TAACCAGCCC TATCTGGCTG ATAAAATAGA GTTGGGAGCA ACCCTTGCTG TCTTTGGAAA 9300 

ATGGGACCGC GCTAAGGCTA GTCTGACTGG GATGAAGGTT CTGGCTCAGG TAGAAGATGA 93 60 

CCTCCAGCCT GTCTATCGTC TGGCTCAGGG AATCAGTCAG GCCAGTCTGG TCAAGGTCAT 9420 

CAAGACGGCT TTTGATCAGG GACTGGACCT CTTGATAGAA GAAAATCTGC CCCAGTCTTT 9480 

ACTAGACAAA TACAAACTCA TGTCCCGTTG TCAGGCAGTC CGTGCTATGC ATTTTCCAAA 9540 

GTATTTGGCA GAATACAAGC AGGCTCTTCG CCGTATAAAG TTTGAGGAAC TCTTTTATTT 9600 

CCAAATGCAG CTGCAGATGC TCAAGTCTGA AAATAGAGTT CAGGGAAGTG GTCTGGTTCT 9660 

GAATTGGTCT CAGGAAAAAG TGACAGCAGT TAAAGTAAGT CTTCCTTTTG CCCTGACCCA 9720 

AGCTCAGGAA AAGAGTTTGC AGGAAATTTT AACTGATATG AAGTCCGACC ACCACATGAA 9780 
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TCGTCTCCTA CAAGGGGATG TGGGGAGTGG AAAAACGGTA GTCGCTGGCT TGGCCATGTT 9840 

TGCGGCAGTG ACAGCAGGTT ATCAGGCTGC CCTAATGGTA CCAACAGAAA TCCTCGCAGA 9900 

GCAACACTTT GAGAGTTTAC AGAACCTTTT TCCCAATTTG AAACTGGCTC TCTTGACAGG 9960 

TTCCTTGAAA GCTGCAGAAA AGAGAGAAGT CTTGGAGACC ATTGCCAAGG GTGAGGCTGA 10020 

TTTGATTATA GGAACTCACG CTCTGATACA AGATGGGGTG GAGTATGCTC GTCTTGGTTT 10080 

GATTATTATC GATGAGCAGC ACCGTTTTGG TGTAGGGCAA AGGCGTATTT TACGGGAAAA 10140 

AGGTGACAAT CCAGATGTCC TCATGATGAC GGCGACTCCC ATTCCACGGA CGCTTGCCAT 10200 

CACAGCCTTT GGAGATATGG ATGTTTCCAT TATCGACCAG ATGCCAGCAG GTCGGAAGCC 10260 

TATTGTGACG CGCTGGATCA AACATGAGCA ACTACCTCAG GTCTTGACTT GGTTAGAGGG 10320 

GGAAATTCAA AAAGGTTCCC AAGTCTATGT CATCTCTCCT TTGATTGAAG AATCAGAAGC 10380 

TCTAGATTTG AAAAATGCCA TTGCCTTATC AGAGGAGTTG ACGACTCATT TTGCAGGCAA 10440 

GGCAGAGGTG GCTCTTCTAC ATGGTAGGAT GAAGAGTGAC GAAAAAGACC AGATCATGCA 10500 

GGATTTCAAG GAGAGAAAGA CGGATATTCT GGTTTCGACG ACGGTTATTG AGGTTGGGGT 10560 

CAACGTTCCC AATGCGACTG TCATGATTAT CATGGATGCC GATCGCTTCG GTCTCAGTCA 10620 

ACTTCACCAG CTTAGAGGTC GTGTCGGTCG GGGGGACAAG CAGTCCTACG CTGTTCTCGT 10680 

TGCTAATCCC AAGACGGATT CTGGGAAAGA CCGCATGCGT ATCATGACAG AAACGACCAA 10740 

TGGATTTGTC CTTGCGGAGG AAGATTTGAA AATGCGTGGT TCTGGTGAGA TTTTTGGAAC 10800 

CAGACAGTCA GGACTTCCAG AGTTCCAAGT GGCTGATATT ATCGAAGATT TTCCGATTTT 10860 

AGAAGAAGCA AGAAAGGTTG CTAGCTACAT TAGTTCTATA GAAGCTTGGC AAGAAGATCC 10920 

AGAGTGGCGC ATGATTGCCC TTCATCTGGA AAAGAAAGAA CATCTGGATT AAGCTTTCTC 10980 

TAAGGAAAAC TTATACTCAA TGAAAATCAA AGAGCAAACT AGGAAGCTAA CCGCAGGTTG 11040 

CTCAAAACAC TGTTTTGAGG TTGTGGATGA AACTGACGAA GTCAGCTCAA AACACCGTTT 11100 

TGAGGTGGCA GATAGAACTG ACGAAGTCAG TAACATATAT ATACGGTAAG GCGACGCTGA 11160 

CGTGGTTTGA AGAGATTTTC GAAGAGTATT AAGCTAGTTT TTAGGTTTGG CTCTTATACT 11220 

AGAGTCATCA AAAAGAAACG AGGACTCTCA TATGACAGTA ACGATTAAAG TAAATTACCA 11280 

AACCACTTTC CAAAAGAAGG AAGCAAAAAA CTAGTATAAA CAGAAGAGAG AGCGAAATGC 11340 

TCTTTTTTCG TTTCTAAAAC TACTTTCAGC CCATCATCCT AAAAGTAAAG AATCTAAATT 11400 

CACTTTCTAT TTACCCTTCT TTCTTGCATT GATTACATAG ATATGCTACA GTTGTGGTAA 11460 

CGATTACAAA ATAAAAGGAG CATGCTATGA AAAATCCAGC TTTGCTAGAA GAAATTAAGA 11520 

CCTATAGAGG AAGGGATGAG GTTCCGGAAG ACTTTGATGA TTTCTGGGAT GGGGAAGTGA 11580 
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AAAATGTTTC 


CACGCTTCCA 


TCCTACCACT 


TGGAGGAAAG 


AGATTTCCAC 


ATTCCTCAAG 


11640 


TCAAGTGCTA 


TGAGTTAACA 


TTTGAAGGAA 


GCAAGGAAGG 


AAAGGTCTAT 


GCACGCATTG 


11700 


TTCTTCCAAA 


GAGTGAGGAG 


AAGGTCCCAT 


TAATCTTCCA 


TTTTCATGGT 


TATATGGGAC 


11760 


GTGGCTGGGA 


CTGGGCCGAC 


ATGCTGGGCT 


TCACCGTAGC 


TGGTTACGGT 


GTTGTTTCCA 


11820 


TGGATGTGCG 


GGGCCAGTCA 


GGTTACTCAC 


AAGACGGCTT 


GCGTTCTCCT 


TTAGGAAATA 


11880 


CCGTGAAGGG 


GCATATTATC 


CGTGGTGCTG 


TGGAAGGTCG 


GGACCACCTC 


TTTTATAAGG 


11940 


ATGTTTATCT 


GGATATTTAC 


CAGTTGGTCG 


AAATTGTTGC 


TAGTCTGTCT 


CAGGTTGATG 


12000 


AGAAGCGTCT 


TTCTAGCTAT 


GGTGCCTCAC 


AAGGAGGGGC 


TCTAGCTCTA 


GTTGCAGCAG 


12060 


CGCTCAATCC 


TCGAATTCAG 


AAAACAGTTG 


CCATTTATCC 


CTTCTTGTCA 


GACTTCAGAC 


12120 


GGGTGATTGA 


GATTGGTAAT 


ACTAGCGAGG 


CTTACGACGA 


ACTTTTCCGT 


TATTTCAAGT 


12180 


TTCACGACCC 


CTTCCATGAA 


ACAGAGGAGG 


AAATCATGGC 


GACCCTTGCC 


TATATCGATG 


12240 


TCAAAAATCT 


TGCCCATCGT 


ATCCAAGGTG 


AGGTTAAGAT 


GATTACGGGC 


TTGGACGACG 


12300 


ATGTTTGCTA 


TCCCATTACC 


CAGTTTGCGA 


TTTATAATCG 


TCTGACCTGC 


GATAAAACCT 


12360 


ATCGCATCAT 


GCCTGAGTAT 


GCTCACGAAG 


CCATGAATGT 


ATTTGTCAAT 


GACCAAGTCT 


12420 


ACAACTGGCT 


CTGTGGAAGT 


GAGATTCCTT 


TTAAATATCT 


AAAATAAGGA 


GTCGACTCTA 


12480 


AGCACAAAAT 


CTTAAAAATT 


ACAAACACGC 


ATAGTATCAG 


GGGATTAAGA 


AAACTTTATA 


12540 


CTATGCGTTT 


TATCATGGAA 


ATATAGTAAA 


ATGAAATAAG 


AACAGGACAA 


ATCGATCAGG 


12600 


ACAGTCAAAT 


CGATTTCTAA 


CAATGTTTTA 


GAAACAAATG 


TGTACTATTC 


TAGTGTCAAT 


12660 


CTATTATATT 


TATAGAATTT 


TTTGTTGCTA 


GATTTGTCAA 


ATTGCTTAAA 


ATAATTTTTT 


12720 


TCAGAAAGCA 


AAAGCCGATA 


CCTATCGAGT 


AGGGTAGTTC 


TTGCTATCGT 


CAGGCTTGTC 


12780 


TGTAGGTGTT 


AATACTTTTC 


AAAAATCTCT 


TCAAACCACG 


TCAGCTTCGC 


CTTGC 


12835 



(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 5020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 
GGGGATATGA AGAACAAAAG AATATTTAAA GACTTCCAAG CTTCAAAAAT GAGTTTAAAC 60 
ATTTACACAA GCCCCTTGTT AGCCTTTGTT TTTGTCTTCA TAGGAGAGTT TGTGGCTTTT 120 
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ACTTTGTATG GTATTGGCTT GTTAGCTCTC ATCGGACTTG CTAGAAATTT TGGAGAGGCT 180 

GGTCAAAATC TTGCAAGCTA CTTGCAGACC TTGCATCAGA GCTTGACGGA TAAAACAAGT 240 

GACTTTCGTT TAATTTTAGG ATTACTGGCC TTTGGTTATT CTTAACACTG TGTTCAGATG 300 

GACAAGAAAA GTTGAGAAAA GACCTATTCG AACCTTGGGA TTTTATAGAG AGAATTTCCT 360 

CAGCAATCTT CTGAAAGGAT TTAGTCTAGG CCTGGCACTT TTTCTTCTGA CCTTGTTAGG 420 

TTTAGTGGTC TTAGGTCAAT ATCGTTTGGA ATCCATTCAC TTGAATCCTT ATTCTCTTGC 480 

CTTTGTCGTC TTTACTATCC CATTTTGGAT TTTACAGGGG ACAGCAGAAG AAGTGGTGGC 540 

CCGTGCTTGG CTACTTCCTC AATTGGCCTC AAGAACCAAT CTAAAACTAG CTATTCTTAT 600 

ATCTAGCCTG TTCTTTACCC TGCTTCATAT GGGCAATTCT GGTCTCACCC CTCTATCTCT 660 

AGTAAATCTC TTTTTATTCG GAGTTGCCAT GGCTCTTTAC CTTCTCAAAA CTGATACAGT 720 

TTGGGGTGTT GCAGGTATTC ATGGTGCTTG GAATTTTGCT CAGGGTAATC TCTTTGGGAT 780 

TTTAGTTAGT GGTCAACCGT CAGaACGTCT CTGATGACCT TTTTACCACA AGGCAATCAA 840 

GATTGGCTAT CAGGTGGTTC TTTTGGCATA GAAGGTTCCA TTATGACAAG TCTGGTATTA 900 

CTACTGCTGA TTGTCTATCT TGCTAATAAA TTAAAGAAAG AAAATGAAAG GATGTGACTT 960 

CGGTCCGTCC TTTTCTTCGT GAAAATACTA TAAGTATGCT AAAATAGGAA TAGCACATGG 1020 

AGAGAGGATT CTTATGATCA ATCACATTAC AGATAATCAA TTTAAACTAG TATCAAAATA 1080 

TCAACCATCA GGAGATCAAC CCCAAGCTAT CGAGCAGTTG GTGGATAACA TTGAGGGGGG 1140 

AGAAAAAGCT CAGATTCTGA TGGGGGCGAC TGGAACAGGG AAGACCTATA CTATGAGTCA 1200 

GGTCATTTCT AAAGTCAATA AACCAACTCT GGTTATTGCC CACAATAAAA CTCTGGCTGG 1260 

TCAGCTCTAT GGGGAGTTTA AGGAATTTTT CCCTGAAAAT GCAGTTGAGT ATTTCGTATC 1320 

CTACTATGAT TATTACCAGC CAGAGGCCTA TGTCCCTTCT AGCGATACCT ATATTGAGAA 1380 

GGATAGTTCT GTCAATGACG AGATTGACAA GCTTCGCCAC TCAGCTACCT CAGCCCTTTT 1440 

GGAGCGTAAT GATGTTATTG TCGTGGCCTC AGTCTCTTGT ATCTATGGTT TGGGTTCGCC 1500 

CAAGGAATAC GCTGATAGTG TCGTTAGTCT CCGTCCTGGT CTAGAGATTT CTCGTGATAA 1560 

ACTCTTGAAT GACTTGGTCG ATATTCAGTT TGAACGTAAT GATATTGATT TCCAACGCGG 1620 

AAGATTTCGC GTTCGTGGGG ATGTGGTAGA GATTTTCCCA GCTTCCCGAG ATGAACATGC 1680 

CTTTCGAGTA GAATTTTTTG GAGACGAAAT TGACCGTATT CGTGAAGTTG AGGCTCTGAC 1740 

AGGTCAGGTG TTGGGAGAAG TGGATCATTT AGCGATTTTC CCAGCGACAC ACTTTGTGAC 18O0 

CAATGACGAC CACATGGAAG TTGCCATTGC AAAGATTCAG GCCGAGTTGG AAGAACAATT I860 

AGCTGTCTTT GAAAAGGAAG GTAAACTGCT TGAAGCCCAG CGTTTGAAAC AGCGGACAGA 1920 
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GTATGATATC GAAATGTTGC GTGAGATGGG CTATACCAAT GGGGTTGAAA ATTATTCTCG 1980 

CCACATGGAT GGACGGAGCG AAGGAGAGCC TCCTTATACG CTTCTCGACT TCTTCCCAGA 2040 

TGATTTCTTG ATTATGATTG ACGAGAGTCA TATGACCATA GGGCAAATCA AGGGCATGTA 2100 

CAATGGAGAC CGTTCGCGTA AAGAAATGCT GGTTAATTAT GGTTTCCGTT TGCCGTCTGC 2160 

TTTGGACAAT CGTCCTCTCC GTCGGGAGGA GTTTGAGAGT CACGTTCATC AGATTGTTTA 2220 

CGTTTCAGCG ACACCTGGTG ACTATGAAAA TGAACAGACC GAGACAGTGA TTGAGCAAAT 2280 

CATTCGTCCA ACGGGACTCT TGGATCCAGA GGTGGAAGTC CGTCCGACTA TGGGACAGAT 2340 

TGATGACCTC TTGGGTGAAA TCAATGCCCG CGTTGAAAAA AATGAGCGTA CCTTTATCAC 2400 

AACTTTGACC AAGAAAATGG CAGAGGATTT GACCGACTAC TTCAAGGAAA TGGGTATCAA 2460 

GGTCAAGTAC ATGCACTCGG ATATCAAGAC CTTGGAACGG ACGGAGATTA TCCGTGACCT 2520 

GCGCTTGGGT GTCTTTGATG TCTTGGTCGG AATTAACCTG CTCCGTGAAG GAATTGACGT 2580 

TCCTGAAGTG AGCCTCGTAG CTATTCTCGA TGCTGACAAG GAAGGTTTCC TTCGCAACGA 2640 

ACGTGGACTC ATCCAGACCA TTGGACGTGC TGCACGTAAT AGCGAAGGTC ATGTTATCAT 2700 

GTATGCGGAC ACGGTTACCC AGTCTATGCA ACGTGCTATC GATGAAACTG CCCGCCGTCG 27 60 

CAAAATCCAG ATGGCCTATA ATGAAGAACA TGGTATCGTT CCACAAACCA TCAAGAAAGA 2820 

AATCCGTGAC TTGATTGCTG TGACCAAGGC AGTTGCTAAG GAAGAAGACA AGGAAGTCGA 2880 

TATCAATAGC CTCAACAAAC AAGAGCGCAA AGAACTAGTC AAAAAGCTTG AGAAACAAAT 2940 

GCAAGAAGCA GTTGAAGTGC TTGACTTTGA ACTAGCAGCT CAGATTCGTG ATATGATGCT 3000 

GGAAGTCAAG GCCTTGGATT AGGGGAATAG TATGATTTAT TTAAGAAAGT TAAAGAAAGA 3060 

AGATTTGATG TCTTTATGGG AAATGGCTTA TTCACAGCTT AATCCAGTTT GGAAACAGTA 3120 

TGATGCTCCC TATTATGATG ATTATCAGTA TTTTTCAAAT TTTAAAGAAT TCGAACTACA 3180 

AAAATCAGAA TCCATTTTAA GCAACTCAAA TCGCCTTGGT ATTTTTGTTG ATGATAAACT 3240 

AGTTGGGACT GTTTCGCGTT ATTGGGTATG TAAAGAAACA AGATGGATGG AATTGGGAAT 3300 

TGGTATTTAT GATAAAAAAT TCTGGAACAC TGGTATTGGG AAAGTTGCTA TGTTGCAGTG 3360 

GATAGATAGG ACGTTTCAGG ATTACTTGGA GTTGGAGCAT CTGGGTTTGA CAACTTGGTC 3420 

AGGAAATATT GGTATGATGA AACTTGCTGA AAAATTAAGA ATGAAAAAAG AAGCTCATAT 3480 

TCCAAAAGTT CGTTATTATC AAGGTAAATA TTTTGACAGT ATTAAATATG GTATTTTGAG 3540 

AGAAGACTGG GAGAAAATAA ATGACGGTTA TTATCAAATC AATGGAAACT CCTGAAGAGA 3600 

TAGAAGGTAA ATCCTTCGTT CACTGGCAAA CGTGGAGAGA GGCTTATGAT GACCTTTTGC 3660 



WO 98/18931 



PCT/US97/19588 



966 



CTGCGGAATT 


TCAGGAGACA 


ATGACATTAG 


AAAGATGTCG 


ACTCTTTAGT 


CAAAAGTATC 


3720 


CAGAAAATAC 


ATTGATTGCG 


ATGGATGGTG 


TGAAGATAGT 


TGGTTTTATA 


AGTTATGGCA 


3780 


ACTGTCGTGA 


TGAGACTATT 


CAAGCTGGTG 


AAATTATTGC 


TTTATATGTT 


TTAAAAGACT 


3840 


ATTATGGAAA 


AGGAATCGCA 


CAAAAGTTAG 


TGAAAGCAGC 


TTTGACTGAT 


CTTAATCATT 


3900 


TTTCTGAAAT 


TTTCTTATGG 


GTATTGAAAG 


ATAACAAGCG 


CGCCATTGCT 


TTCTATCAAA 


3960 


AAATGGGTTT 


TACTTTTGAT 


GGACAAGAAA 


AAATACTTGA 


ACTTGGAAAG 


CCTATAAAGG 


4020 


AAAAACGGAT 


GGTATTCTAT 


TCTAAATAAT 


TCTCAAAAGT 


AAAAGCTAAT 


ATGGTACCAA . 


4080 


GTCTGAAAAT 


TTAATAAATT 


AGAAAGCGAG 


TAAATTTATG 


TCCCGTTCCC 


AATTAACAAT 


4140 


TTTAACAAAT 


ATCTGTCTGA 


TTGAAGACCT 


CGAAACTCAG 


CGCGTGGTGA 


TGCAGTATCG 


4200 


CGCCCCTGAA 


AACAATCGCT 


GGTCTGGTTA 


TGCCTTTCCT 


GGAGGTCATG 


TAGAAAATGA 


4260 


TGAGGCTTTT 


GCGGAGTCTG 


TCATTCGTGA 


AATCTACGAA 


GAAACAGGGT 


TGACTATCCA 


4320 


AAATCCTCAA 


CTTGTCGGCA 


TTAAAAATTG 


GCCACTAGAT 


ACAGGTGGGC 


GCTATATTGT 


4380 


CATTTGTTAT 


AAGGCGACTG 


AGTTCTCTGG 


TACCCTTCAA 


TCTTCAGAAG 


AGGGAGAAGT 


4440 


TTCTTGGGTG 


CAAAAAGACC 


AGATTCCAAA 


CTTAAATCTG 


GCCTATGATA 


TGCTACCATT 


4500 


GATGGAAATG 






AG AGTTTTT C 




GTACAGAAGA 


4560 


CGATTGGGAA 


AAGAAAATCT 


TCTAGTCTTT 


TACTAAATAA 


CCTAGCTGAT 


CCAAGGCCTC 


4620 


CTCGATATAG 


TGGAGGTCTT 


GTTGTGTCTC 


GGCTTCAACT 


AGGTGATAAT 


GAATACCATC 


46S0 


TGTTAACTCA 


GAAATTGGCT 


TAAAGTCAGA 


ACGTTCAACT 


TGTTCTAGAA 


AATGTTGCAC 


4740 


GTCGCGGCGA 


CAGGTCAGTT 


TTAGTAAGGT 


TTCAATCTCT 


CCATAAACAG 


GATGATCAAT 


4800 


CAAGATATTT 


TGAACGCGAC 


CACCATTATC 


TACGATAGCA 


AGTAATTCTC 


GTCCAATTTC 


4860 


TTCAACTTCA 


TGCTTGACCT 


TAAATAATTT 


GTGATGATAA 


GTATTTGCAT 


TAGCATCTTT 


4920 


ATAGATATAA 


CCACGATTGG 


TAGATAGAAT 


TGGAGATCCA 


TCAGCTCTTA 


AAATTGCAAT 


4980 


ATCTTGAACA 


ATAACTTGTC 


GAGTGACATG 


AAAGTGCTCA 






5020 



(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
AAAAAGTGGC AATCCATTGA TTGGCCACTT CATTTAGAGA ATTATCGTCT CGCCCTTGAA 



60 
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GAAGAAGGTC GTGTAGTACT TGAGTTACTG CTATCGCTAG AACTACTACT TGAACTGCTG 120 

GAGCTGGATG GAGTTGGTAG ACTCCCCACA ATACTAGACC AAGCATTCTG ATAATCCGCA 180 

TCACTTCCGC CAATAGCAAA GCGATAACTT GTCGCTGGCG CTCCTGACTT ATTAGCCCAA 240 

TAGCTGGTAA CAGTCGAACC TGTGACCTCT ACTTCTTTTC CTTCAACAGA AACCTTCTCT 300 

GGTTTTTGAC CTGTTGATTT CAAGACTTCC GATTTCACTA CACTAGGATC TAAAGCAAAG 360 

CGCTCGTTCC CCCAAATGCT TGGGGAAGCT TGCTGAATCG CATTTACCAG ATGAGCCATG 420 

TAATTAGAGT TATTAGAATA ACCTGCTCTA CGTGACAATG AATGATTATC ATCATGCCCA 480 

ATCCAGCCAC CTAGGGTTAA TCTAGGTGTC GAAAGCATGA GCCACATATT TTCGTCTTGG 540 

TTGGTTGTAC CAGTCTTCCC AATCCAATCT GCATTAGCCA GAGTAGGATT TAAAGAAGTC 600 

AGGTTAGACT TGAAGGTTGT TGTCACACGA GAGGATAGAA CTTCTCGTAG CAATCCCTGC 660 

ATAATCGTCG CAGTAGCTTT TGAATAGACT TGAACCGGTT TATCCTGATA CTCATACACC 720 

ACTCTACCAT CTGCTGCTTC AATCTTTGAA ATCACATGCT TCTGATGATA AACTCCATTA 780 

TTAGCTAAGG TCTGATAGCC ATTGGTATGC TGGGCAACTG TGACTTCAAT ACCACCACCC 840 

ATTGGCAAGC TCTCAATACC GTACTCAGGA ATCTCGTAAC CCATCTTTTC CATATAACCC 900 

TTGACATCAA CACCCTTTTC ACGGAGCATA CGATAGGTCC AGTAAGCAGG GATATTCCAT 960 

GAATAGTTCA GAGCTTCTCC CAAGGTCATC ATTCCTGTTC CCTTGCTATT AGCATACATA 1020 

ATCGGATTGC CATTAGCAAA GTTTGTTGGA TAGTTAGATA GAATCGTTTC ACTTCCCATC 1080 

AAGCCCTGGT CAATAGCAAT ACCGTAGGCC AGCAAGGGCT TGGTAGTAGA AGCTGGCGAA 1140 

CGTTTGGTAT CAAAGGCATG ATTATTTTGA TTTTCTTGAT AATTACGACC ACCTACAAAG 1200 

CCTAGAATAG CACCTGTTTG GTTATCCATC AAGACATTCC CTACTTCTAC ACGACCTGTT 1260 

CCATCGTCTA AAAGATAGCC ATAATCAGCA ACCGCACTTT GCATGGCAGA ATGAATTTTC 1320 

TGATCTATGG TAGTAGTAAT CTTATAACCA CCATTTTCAA TTTCCTTGGC TGCCAAATCT 1380 

CGATAAAACT TCTGAGTTGC CTCATTTTTC AACTCCTTAG CGGAGACATT GTCTCTCTGA 144 0 

GCTAGATAGT CATACATACG TTCTTGAGCT TCTGCCAAAG TTGTAAAGTA TAAATAGTCT 1500 

CGTGAAATTC CTGTAACCGT GCCCGATGGT AAAAAGTCCT GTTTAAGGTC ATAATCCTTG 1560 

TACTGAGAAT ACTCGTCTTT GCTTAATGCA CCTGTACGAT ACATACTGTA AAGAACTGCC 1620 

TTAGCCCGTC TTAAGCCAAT TTCTAGGTCT TCATCACTCT TCAACTCCCC AGTATTTTCA 1680 

TAAGGAGAGT AAGTAATGGG ACTCTGTGGA AGTCCTGCTA AAAATGCTGC TTGAGGAACA 1740 

GTCAACTGAC TGGCATCTAC ACCGAAAATT CCCTCAGCTG CTTGCCGAGC CCCTGCAATA 1800 
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TTCTGTCCCT TATTATTTCG GCCAAAGGGA 
TCTTTATTCA TGGCGCGTTC CAAGGCAAGA 
AAGGTCGGCG CATCCCCAAC CACCTGCTGT 
CCACTAGAGG AACCCAAACC TACAAATTTC 
ACTACACCCT TATGTTCTTT AAAGTGTTCA 
TTTTCCGAAA TTTGCTCAGA TGAGATAGAA 
ATCACCGTCC CGTCCGAATA GGTAATCTCT 
ACCAATTCTT CTGTCTGAGG CACCCGAACC 
GCAATCCCAG CTCCCAACAT TCCTCCTAGA 
GCTTTTATAC TCAGTAAAAT AGCTGGGAAA 
TTGGTACTTG AACCTTTCTT GCCAGGTCTA 
AAAAATTCCA GCATTTTTCG TTTTAATTCA 
TTATCTATTA TACCACAAAA GGGAAATTTT 
GCTAGGCTAT TGCCCAAGTT TGTGATACAA 
AAAAACACAT GCACATTTTT GATGAGCTAA 
ATGAAGAAGC TTTGCGTAAA GCCCTAGAAG 
ATCCAACTGC TGACAGCCTT CACCTAGGCC 
TGCAACTAGC AGGTCACAAA CCTTATGCGC 
ATCCGTCCTT CAAAGATGCT GAACGTAGTC 
TCAAGTCTAT CCAAGGACAA CTTTCTCGTT 
CTGTCATGGT CAACAACTAC GACTGGTTTG 
ATATTGGAAA ATACTTCACG GTCAACTACA 
TCGAAACAGG AATTTCTTAC ACTGAGTTCG 
TCGTCCTTAA CCAAGACCAT AATGTCACTC 
ATATGACAGC TGGTACCGAA TTGCTTCGTC 
CTGTTCCACT AATCACAGAT GCAACTGGTA 
TCTGGCTCAA TCCCGAAAAG ACTTCTCCAT 
TGGACGCTGA CGCTGTTCGC TTCTTGAAAA 
AAGATATTCG TAAACAATTT GAAGCAGCGC 
CTCGTGAAGT TGTTACACTT GTTCACGGAG 



968 

GCCACATTGA GATAGGTCGT 
GCATCCACAA TCTCTGCCGC 
TTAATTAGTT GCTGGGTCAA 
CCCAAGGTCG CACGAATCAC 
TCTTCTGTCG CAATGATAGC 
GTGCGCAACA AATCACTCTC 
GAAATAGAAG AGATGTCCTT 
TTGTCAAATA AGGCCACTCC 
AAACCGAGTA CAAAGAGTAA 
ATGACTGACT TATCTAAGGT 
GCTGATTTTT TATTTTTTTG 
TTTAATTGAT TTTGCATGGA 
CAATAAAATA GCCACTTTCT 
TAGGTAGAAA CAATAATTTT 
AAGAGCGTGG TTTGATATTT 
AAGGTCAAGT TTCTTATTAT 
ACCTTGTCGC AATCTTGACA 
TCGTTGGCGG TGCTACAGGT 
TCCAAACAAA AGACACAGTA 
TTCTTGACTT TGAAAATGGC 
GCAGCATCAG CTTCATTGAC 
TGATGAGTAA GGAATCTGTT 
CTTACCAAAT CATGCAAGGG 
TTCAAATCGG TGGTTCTGAC 
GTAAGGCGGA CAAGACTGGT 
AGAAATTTGG TAAATCAGAA 
ACGAAATGTA CCAATTCTGG 
TCTTTACTTT CTTGTCACTT 
CACACGAACG CTTGGCTCAA 
AAGAAGCCTA CAAAGAAGCA 



TAAAATCTCA 1860 

CTTACGAGCC 1920 

GGTTGAACCC 1980 

CGCCTTGGGT 2040 

CTTCTTCAGA 2100 

TATGGAAGCA 2160 
GACCTGATTC . 2220 

GTATCCCAAA 2280 

GTTAAATAAG 2340 

TTTAGATTTT 2400 

TTTTTGCTGG 2460 

TTTCCTCACT 2520 

TCCCTATTCT 2580 

AAAAAGGAGA 2640 

CAAACGACTG 2700 

ACTGGCTACG 2760 

AGTCGTCGCT 2820 

CTCATCGGAG 2880 

GATGGCTGGG 2940 

GAAAACAAGG 3000 

TTCCTCCGTG 3060 

AAAAAACGGA 3120 

TATGACTTCT 3180 

CAGTGGGGAA 3240 

CACGTTATCA 3300 

GGAAATGCCG 3360 

ATGAACGTGA 3420 

GATGAGATTG 3480 

AAAGTCTTGG 3540 

CTTAACATCA 3600 
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CTGAGCAACT 


CTTTGCAGGA 


AACATCAAAA 


ACCTTTCTGT 


CAAAGAGCTC 


AAACAAGGAC 


3660 


TTCGTGGTGT 


GCCCAACTAC 


CAAGTACAGG 


CAGACGAAAA 


CAACAATATC 


GTGGAACTGC 


3720 


TCGTCTCATC 


TGGTATAGTT 


AACTCAAAAC 


GCCAAGCCCG 


TGAAGACGTC 


CAAAACGGAG 


3780 


CCATCTACGT 


AAACGGCGAC 


CGCATCCAAG 


AGCTTGACTA 


TGTCTTGAGT 


GACGCTGATA 


3840 


AGTTAGAGAA 


TGAACTGACT 


GTTATCCGTC 


GTGGGAAGAA 


AAAATACTTT 


GTATTGACTT 


3900 


ACTAAACTAT 


TCAACATTTA 


TCTATAAACA 


AAGGAGTTAA 


CCTCGAGAAA 


GGTAACTCCT 


3960 


TTTTGCTGTT 


AATAACTCTC 


ATCTATCTAT 


TTTTAATAGA 


CAGGCTACGC 


AGGACAATGC 


4020 


GCAAGGTTGT 


TAGATTATGT 


AAGATAGAGA 


GATTTGAAGG 


ACTGAACCAA 


TTAAATAAGC 


4080 


CAAAGCCAAT 


CAAACTACTA 


TTTACGACAA 


CGGTATCCTG 


AATATTTTTC 


TTGATGAGTG 


4140 


TTTGCAAAGA 


TGATGATAAC 


GAATCCAACT 


CTTGGAAGAA 


ATCCAAACGA 


TTATCTAACA 


4200 


ATAAGATATC 


ACTCATCTGC 


TTAGAAATAT 


CTGCACTCTC 


ATTCATCACC 


ACACCGATAT 


4260 


CTGATAGAGT 


TAAAGCCGCT 


GAGTCATTCA 


ATCCATCTCC 


AACCATCAAA 


ATAGTGTGAC 


4320 


CTGCTTTCTG 


CAGTTTCTCT 


ACTAACTCAA 


ATTTCCCATC 


AGGTTTCAAG 


TCTGTATAGA 


4380 


CCTGATCAAA 


GGGCAAATCT 


TTGACTAATT 


CCTCTGTCCT 


AATCAAGGTG 


TCTCCTGTTG 


4440 


CCAGAATCAA 


TTTTTTCCCC 


TGTGCCTTAA 


GTTTATCCAA 


GGCTGTTTTT 


GCTTCTTTTC 


4500 


TCAAAGGAGT 


ATGAATGCAG 


AACATTCCAA 


TCAATTCATT 


TTGATAAGCC 


AAGAATAAGA 


4560 


GATTGTAGTG 


ACTCTTGTAC 


TCTTCAATTA 


AAGCATTTTG 


TTCTGAACTG 


ATATGAATCT 


4620 


GCTCATCCTG 


CATCAAGACA 


TAATTCCCAA 


TAAGAACTGG 


TTGGCCATCT 


ATATGAGATT 


4680 


TGATCCCCTT 


GCTTGCGATA 


TATTGGAGTT 


TCCCATGCAT 


TTCCTCATGT 


TCAATTCCCT 


4740 


CTATCTCAGC 


TTGCTTGACG 


ATGGCATTAG 


CAATAGGATG ATAAATGTGT 


TCCTCAAGAC 


4800 


AGGCACTGAT 


TCTGAGAATA 


TCTTCCTCAC 


TATAGTCTCC 


AAAAGGTAAC 


ACCTTTTCAA 


4860 


CTATAGGATA 


ACTAGTTGTG 


ATTGTTCCTG 


TCTTATCAAA 


CAAGAAAGTA 


TCAACTTCCA 


4920 


GATATTTCTC 


CCTGTTGTGG 


CCTCTGGCTG 


TCATCTCTGT 


GCTGG 




4965 


(2) INFORMATION FOR SEQ ID NO: 144: 









(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3232 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
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CAGGGGCGTA TTACGTGACA ATTCAATGTA GGCTGTCGCT ACTTGCGCCA AAACAAGGAT 60 

TCGATAATGT CGGATGATAC TAACGATTAA ACCGAGCAGA AAGGATCCCA AAATTCCCCA 120 

AACTGCAATA TGCAAGGTCA GAAAGAATGC CTTTTGATAT AGTGGTAGAT ATTGTTCAAC 180 

AATGGATCAA TCCAAAAATA GAACCTCCCA TCTAGAAATA ATACAGTTAT TGTAGCACTT 240 

AAAATCTTCT TTGGATAATA TCTATTTTTT ATTGCCGTTA TAAGGATTTT TATCATAGAC 300 

ATAAAATTTC TGAAATTTCC AAACAAAATA TTTTAAAAGT TTTGAAAAAG AGTTAAGATA 360 

TTTTTGTAAT ACACAAAGTA AACGCTTACT TATTAAGGAG GACATTTTAT GTCATACAAA 420 

ACAAGCAATG CAGAAGGTCA TGTAGATTTC ATCAATACCT ATGATTTGGA GCCAATGGCG 480 

CAACAAGTTA TTCCTAAAGC AGCATTTGGC TATATCGCTA GTGGGGCGGG AGATACTTTC 540 

ACTTCTTTCC AGTGATTTTA GCGTCAGGTT CTTTTTAGTT TTTAAAGATT ATCCGTGAAT 600 

TTCTTGCTTA TTTATGATAA AATGGGAGTG TCGCAAAAAA TGACTCATCG TATTCAATTT 660 

TGAGTAAAAC TAGGAGGATC CCATGTCTAC AGAACATATG GAAGAACTAA ATGACCAGCA 720 

GATCGTTCGC CGTGAAAAAA TGGCTGCGCT CCGCGAACAA GGAATCGATC CTTTCGGAAA 780 

ACGTTTTGAA CGTACTGCAA ATTCACAAGA ATTAAAAGAT AAATATGCCA ACCTCGATAA 840 

AGAACAATTA CACGATAAAA ACGAAACAGC TACTATCGCA GGACGCTTGA TAACCAAACG 900 

TGGTAAAGGA AAAGTTGGTT TTGCCCACCT TCAAGACCGC GAAGGCCAGA TTCAGATCTA 960 

CGTTCGTAAG GATGCTGTCG GTGAAGAAAA CTACGAAATC TTCAAAAAAG CAGACCTTGG 1020 

TGACTTCCTT GGTGTCGAAG GTGAAGTGAT GCGTACGGAT ATGGGAGAAC TCTCTATCAA 1080 

GGCAACCCAC ATCACACACT TGTCTAAGGC TCTTCGTCCT CTTCCTGAGA AATTCCATGG 1140 

TTTGACAGAC GTTGAAACAA TTTACCGTAA ACGTTACCTT GACTTGATTT CTAATCGTGA 1200 

AAGCTTTGAA CGCTTTGTCA CTCGTTCAAA AATCATCTCT GAAATCCGTC GTTACCTTGA 1260 

CCAAAAAGGA TTCCTTGAAG TGGAAACACC TGTTCTTCAT AATGAAGCCG GTGGTGCTGC 1320 

TGCCCGTCCA TTTATCACCC ACCACAATGC CCAAAACATT GACATGGTGC TTCGTATCGC 1380 

GACTGAGCTT CACTTAAAAC GCCTTATCGT GGGTGGTATG GAACGTGTCT ATGAAATTGG 1440 

CCGTATCTTC CGTAACGAAG GAATGGACGC TACTCATAAC CCTGAGTTCA CTTCTATCGA 1500 

AGTTTACCAA GCTTATGCAG ACTTCCAAGA CATCATGGAC TTGACTGAAG GCATTATCCA 1560 

ACACGCTGCT AAATCAGTCA AAGGTGATGG CCCAGTCAAC TACCAAGGTA CTGAAATCAA 1620 

GATTAACGAA CCATTTAAGC GTGTTCATAT GGTGGATGCT ATCAGAGAAA TTACTGGTGT 1680 

CGATTTCTGG CAAGACATGA CTTTGGAAGA AGCTAAAGCT ATCGCTGCTG AGAAGAAAGT 1740 

TCCAGTTGAG AAACACTACA CTGAGGTTGG TCACATCATC AATGCCTTCT TTGAAGAGTT 1800 
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TGTTGAAGAA ACTTTAATCC AACCAACCTT TGTCTATGGA CATCCAGTAG CTGTATCTCC 1860 

ACTCGCTAAG AAAAATCCTG AAGACCAACG CTTTACTGAC CGTTTCGAGC TCTTTATCAT 1920 

GACTAAGGAG TACGGTAATG CCTTTACTGA GTTGAACGAC CCAATCGACC AACTTAGCCG 1980 

TTTTGAAGCC CAAGCTAAAG CCAAAGAACT TGGTGATGAT GAAGCGACAG GAATCGACTA 2040 

TGACTACATT GAAGCTCTTG AATACGGTAT GCCACCAACA GGTGGTTTGG GAATCGGTAT 2100 

CGACCGTCTC TGCATGCTCC TCACTGATAC AACAACTATC CGTGATGTAT TGCTCTTCCC 2160 

AACAATGAAA TAAATTCTTA TCCTCTGGGT CTTATCAGAG GATTTTTTGA TTCAAAAAGA 2220 
GACTGAATTT AAGGAGAAAA TGAAGTGTAG TATATTGAAA TTGAAATAGT ACACTTTGAT . 2280 

TTCTAAGACA TTGTTAGAAA TTGGTTTAAA TTCCCTAAGC AATTTGTGCA TGTTTTATTT 2340 

CATTTTACGA TAGTACGCTG AAACTTTTCA AAAAGTACTA GAAATTGACT TGGATTCCCC 2400 

AATTGATTTG TTCAGATTCA CTATAAATAA AAAATTAATA AGTGGGATAG GAAGTTAGCG 2460 

TCAACTAGGA TAGTATCTTG CTTAAACAGT ATATATGGGA TTGATATAAG TCCATAGGTC 2520 

CTATTAGAGG ATGTTCTGGT GTCTTATTCA CTTGTTTTTT ATAGTATTAG TAGATAGAAT 2580 

CAGCAAATAA AAACCCAAAT CATTCATACC TCTCTCAACT AGATGTAACT TACAAAACCC 2640 

CTGACCTCAT GAGCCACTTT CTTCCTCCTC ATGAGGTCAG TTTTACTTTC TGCTGTTCCA 2700 

GTATCGTTTT TCCTCGCTAG ATTTCCTCAA AAGGGCAGAC TCCTCCCTTG GTGCGTCACA 2760 

CGATTTTTTC ATCTCGACTG TTCTTTAATG CATCATTAAC GACGCTTTTC TTCTAGGTGG 2820 

TTCATAAGGA ACAGGAAGAT TCAGGTTGAC TTTTCTAATC CTAGAATAAA GTGCTGAAAA 2880 

CAATTCGGAA TAGGCATAGA GACTAGACAA TTTGAGGAGC TGCTTGCGTC CTGTTCGAAC 2940 

ACATTTTCCC ACCACGTGAA GAAAAAGATG GCGGAAGCGT TTGATTGTTA AAGTTTGGAA 3000 

GTCACCTCCA GCTAGATGTT TGAGAAAAAG ATAGAGATTG TAGGCGATAC AGCTCATCAT 3060 

CATACGAACT TCGTTTTTGA TTAAGGTTGA ACTATCCGTT TTATCGCCAA AAAATCCCTC 3120 

CTTCATCTCC TTGATGAAAT TCTCGGCTTG ACCACGTCCA CGATAAAGCT GAAACTGGTC 3180 

TTGGCTTGTT CCACTCGTCA TATTTGTAAC GAGAGAAATA ACATCGTAGA AC 3232 
(2) INFORMATION FOR SEQ ID NO: 145: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10711 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 5: 

CCGGAGAAAA TGATGAAAAG TTCAAAACTA TTTGCCCTTG CGGGCGTGAC ATTATTGGCG 60 

GCGACTACTT TAGCTGCATG CTCTGGATCA GGTTCAAGCA CTAAAGGTGA GAAGACATTC 120 

TCATACATTT ATGAGACAGA CCCTGATAAC CTCAACTATT TGACAACTGC TAAGGCTGCG 180 

ACACAAATAT TACCAGTAAC GTGGTTGATG GTTTGCTAGA AAATGATCGC TACGGGAACT 240 

TTGTGCCGTC TATGGCTGAG GATTGGTCTG TATCCAAGGA TGGATTGACT TACACTTATA 300 

CTATCCGTAA GGATGCAAAA TGGTATACTT CTGAAGGTGA AGAATACGCG GCAGTCAAAG 360 

CTCAAGACTT TGTAACAGGA TTAAAATATG CTGCTGATAA AAAATCAGAT GCTCTTTACC 420 

TTGTTCAAGA ATCAATCAAA GGGTTGGATG CCTATGTAAA AGGGGAAATC AAAGATTTCT 480 

CACAAGTAGG AATTAAGGCT CTGGATGAAC AGACAGTTCA GTACACTTTG AACAAACCAG 540 

AAAGCTTCTG GAATTCTAAG ACAACCATGG GTGTGCTTGC GCCAGTTAAT GAAGAGTTTT 600 

TGAATTCAAA AGGAGATGAT TTTGCCAAAG CTACGGATCC AAGTAGTCTC TTGTATAACG 660 

GTCCTTATTT GTTGAAATCC ATTGTGACCA AATCCTCTGT TGAATTTGCG AAAAATCCGA 720 

ACTACTGGGA TAAGGACAAT GTGCATGTTG ACAAAGTTAA ATTGTCATTC TGGGATGGTC 780 

AAGATACCAG CAAACCTGCA GAAAACTTTA AAGATGGTAG CCTTACAGCA GCTCGTCTCT 840 

ATCCAACAAG TGCAAGTTTC GCAGAACTTG AGAAGAGTAT GAAGGACAAT ATTGTCTATA 900 

CTCAACAAGA CTCTATTACG TATCTAGTTG GTACAAATAT TGACCGTCAG TCCTATAAAT 960 

ACACATCTAA GACCAGCGAC GAACAAAAGG CATCGACTAA AAAGGCTCTC TTAAACAAGG 1020 

ATTTCCGTCA GGCTATTGCC TTTGGATTTG ACCGTACAGC CTATGCCTCT CAGTTGAATG 1080 

GACAAACTGG AGCAAGTAAA ATCTTGCGTA ATCTCTTTGT GCCACCAACA TTTGTTCAAG 1140 

CAGATGGTAA AAACTTTGGC GATATGGTCA AAGAGAAATT GGTCACTTAT GGGGATGAAT 1200 

GGAAGGATGT TAATCTTGCA GATTCTCAGG ATGGTCTTTA CAATCCAGAA AAAGCCAAGG 1260 

CTGAATTTGC TAAAGCTAAA TCAGCCTTAC AAGCAGAAGG AGTCCAATTC CCAATTCATT 1320 

TGGATATGCC AGTTGACCAA ACAGCAACTA CAAAAGTTCA GCGCGTCCAA TCTATGAAAC 1380 

AATCCTTGGA AGCAACTTTA GGAGCTGATA ATGTCATTAT TGATATTCAA CAACTACAAA 1440 

AAGACGAAGT AAACAATATT ACATATTTTG CTGAAAATGC TGCTGGCGAA GACTGGGATT 1500 

TATCAGATAA TGTCGGTTGG GGTCCAGACT TTGCCGATCC ATCAACCTAC CTTGATATTA 1560 
TCAAACCTTC TGTAGGAGAA AGTACTAAAA CATATTTAGG GTTTGACTCA GGGGAAGATA . 1620 

ATGTAGCTGC TAAAAAAGTA GGTCTATATG ACTACGAAAA ATTGGTTACT GAGGCTGGTG 1680 

ATGAGACTAC AGATGTTGCT AAACGCTATG ATAAATACGC TGCAGCCCAA GCTTGGTTGA 1740 
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CAGATAGTGC TTTGATTATT CCAACTACAT CTCGTACAGG GCGTCCAATC TTGTCTAAGA 1800 

TGGTACCATT TACAATACCA TTTGCATTGT CAGGAAATAA AGGTACAAGT GAACCAGTCT 1860 

TGTATAAATA CTTGGAACTT CAAGACAAGG CAGTCACTGT AGATGAATAC CAAAAAGCTC 1920 

AGGAAAAATG GATGAAAGAA AAAGAAGAGT CTAATAAAAA GGCTCAAGAA GATCTCGCAA 1980 

AACATGTGAA ATAACTGTTG CAAAATATAA GAAAGGATTT AGTATTTCCC TTGAATGCTG 2040 

AATCCTTTTT TACATTTGTA AAGAAAGATT CTAAAATGTA CGGACCCCCA AAAGTTGGAG 2100 

CCTCTTTTTG TCAGAATAGA GAAAATTTTT GTTAATTTTA CTTGTTTCCT ATTGCTTTCT 2160 

CAGCTATTAT TTGTTATATT AAAAGTATAA TTATTTTTTA TTTATCAGAG TTAAGCATTG 2220 

CACTTTCAGA GGAAGGAGTA TTTTTTAAAA AGAAAATGTA AACGTTTGCT CAAAAATGAA 2280 

AGGATTTAGA AGTTTATGAA TAAAGGATTA TTTGAAAAAC GTTGTAAATA TAGTATTCGG 2340 

AAATTTTCAT TAGGTGTTGC TTCTGTTATG ATTGGAGCTG CATTCTTTGG GACAAGTCCG 2400 

GTTCTTGCAG ATAGCGTGCA GTCTGGTTCC ACGGCGAACT TACCAGCTGA TTTAGCTACT 2460 

GCTCTTGCAA CAGCAAAAGA GAATGATGGG CGTGATTTTG AAGCGCCTAA GGTGGGAGAA 2520 

GACCAAGGTT CTCCAGAAGT TACAGATGGA CCTAAGACAG AAGAAGAACT ATTAGCACTT 2580 

GAAAAAGAAA AACCGGCTGA AGAAAAACCA AAAGAGGATA AACCTGCAGC TGCTAAACCT 2640 

GAAACACCTA AGACGGTAAC CCCTGAATGG CAAACGGTAG CGAATAAAGA GCAACAGGGA 2700 

ACAGTCACTA TCCGAGAAGA AAAAGGTGTC CGCTACAACC AACTATCCTC AACTGCTCAA 2760 

AATGATAACG CAGGCAAACC AGCCCTGTTT GAAAAGAAGG GCTTGACCGT TGATGCCAAT 2820 

GGAAATGCAA CTGTTGATTT AACCTTCAAA GATGATTCTG AAAAGGGCAA ATCACGCTTT 2880 

GGTGTCTTTT TGAAATTTAA AGATACCAAG AATAATGTTT TTGTCGGTTA TGACAAGGAT 2940 

GGCTGGTTCT GGGAGTATAA ATCTCCAACA ACTAGCACTT GGTATAGAGG TAGTCGTGTT 3000 

GCTGCTCCTG AAACAGGATC AACAAACCGT CTCTCTATCA CTCTCAAGTC AGACGGTCAG 3060 

CTAAATGCCA GCAATAATGA TGTCAATCTC TTTGACACAG TGACTCTACC AGCTGCGGTC 3120 

AATGACCATC TTAAAAATGA GAAGAAGATT CTTCTCAAGG CGGGCTCTTA TGACGATGAG 3180 

CGAACAGTTG TTAGCGTTAA AACGGATAAC CAAGAGGGGG TAAAAACAGA GGATACCCCT 3240 

GCTGAAAAAG AAACAGGTCC TGAAGTTGAT GATAGCAAGG TGACTTATGA CACGATTCAG 3300 

TCTAAGGTCC TCAAAGCAGT GATTGACCAA GCCTTCCCTC GTGTCAAGGA ATACAGCTTG 3360 

AACGGGCATA CTTTGCCAGG ACAGGTGCAA CAGTTCAACC AAGTCTTTAT CAATAACCAC 3420 

CGAATCACCC CTGAAGTCAC TTATAAGAAA ATCAATGAGA CAACAGCAGA GTACTTGATG 3480 
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AAGCTTCGCG ATGATGCTCA CTTAATCAAT GCGGAAATGA CAGTACGCTT GCAAGTTGTA 3540 

GACAATCAAT TGCACTTTGA TGTGACTAAG ATTGTCAACC ACAATCAAGT CACTCCAGGT 3600 

CAAAAGATTG ATGACGAAAG CAAACTACTT TCTTCTATTA GTTTCCTCGG CAATGCTTTA 3660 

GTCTCTGTTT CTAGTAATCA AACTGGTGCT AAGTTTGATG GGGCAACCAT GTCAAACAAT 3720 

ACGCATGTCA GCGGAGATGA TCATATCGAT GTAACCAATC CAATGAAGGA TTTGGCTAAG 3780 

GGTTACATGT ATGGATTTGT TTCTACAGAT AAGCTTGCTG CTGGTGTTTG GAGTAACTCT 3840 

CAAAACAGCT ATGGTGGTGG TTCGAATGAC TGGACTCGTT TGACAGCTTA TAAAGAAACA 3900 

GTCGGAAATG CCAACTATGT AGGAATCCAC AGCTCTGAAT GGCAATGGGA AAAAGCTTAT 3960 

AAGGGCATTG TTTTCCCAGA ATACACGAAG GAACTTCCAA GTGCTAAGGT TGTTATCACT 4020 

GAAGATGCCA ATGCAGACAA GAACGTTGAT TGGCAAGATG GTGCCATTGC TTATCGTAGC 4080 

ATTATGAACA ATCCTCAAGG TTGGGAAAAA GTTAAGGATA TCACAGCTTA CCGTATCGCG 4140 

ATGAACTTTG GTTCTCAAGC ACAAAACCCA TTCCTTATGA CCTTGGATGG TATCAAGAAA 4200 

ATCAATCTCC ATACAGATGG TCTTGGGCAA GGTGTTCTCC TTAAAGGATA TGGTAGCGAA 4260 

GGCCATGACT CTGGTCACTT GAACTATGCT GATATTGGTA AGCGTATCGG TGGTGTCGAA 4320 

GACTTCAAGA CCCTAATTGA GAAGGCTAAG AAATATGGAG CTCATCTAGG TATCCACGTT 4380 

AACGCTTCAG AAACTTATCC TGAGTCTAAA TACTTCAATG AAAAAATTCT CCGTAAGAAT 4440 

CCAGATGGAA GCTATAGCTA TGGTTGGAAC TGGCTAGATC AAGGTATCAA CATTGATGCT 4500 

GCCTATGACC TAGCTCATGG TCGTTTGGCA CGTTGGGAAG ATTTGAAGAA AAAACTTGGT 4 560 

GACGGTCTCG ACTTTATCTA TGTGGACGTT TGGGGTAATG GTCAATCAGG TGATAACGGT 4620 

GCCTGGGCTA CCCACGTTCT TGCTAAAGAA ATTAACAAAC AAGGCTGGCG CTTTGCGATC 4 680 

GAGTGGGGCC ATGGTGGTGA GTACGACTCT ACCTTCCATC ACTGGGCAGC TGACTTGACC 4740 

TACGGTGGCT ACACCAATAA AGGTATCAAC AGTGCCATCA CCCGCTTTAT CCGTAACCAC 4800 

CAAAAAGATG CTTGGGTAGG GGACTACAGA AGTTATGGTG GTGCAGCCAA CTATCCACTG 4860 

CTAGGTGGCT ACAGCATGAA AGACTTTGAA GGCTGGCAGG GAAGAAGTGA CTACAATGGC 4920 

TATGTAACCA ACTTATTTGC CCATGACGTC ATGACTAAGT ACTTCCAACA CTTCACTGTA 4980 

AGTAAATGGG AAAATGGTAC ACCGGTGACT ATGACCGATA ACGGTAGCAC CTATAAATGG 5040 

ACTCCAGAAA TGCGAGTGGA ATTGGTAGAT GCTGACAATA ATAAAGTAGT TGTAACTCGT 5100 

AAGTCAAATG ATGTCAATAG TCCACAATAT CGCGAACGTA CAGTAACGCT CAACGGACGT 5160 

GTCATCCAAG ATGGTTCAGC TTACTTGACT CCTTGGAACT GGGATGCAAA TGGTAAGAAA 5220 

CTTTCTACTG ATAAGGAAAA GATGTACTAC TTCAATACGC AGGCCGGTGC AACAACTTGG 5280 
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ACCCTTCCAA GCGATTGGGC AAAGAGCAAG GTTTACCTTT ACAAGCTAAC TGACCAAGGT 5340 

AAGACAGAAG AGCAAGAACT AACTGTAAAA GATGGTAAAA TTACCCTAGA TCTTCTAGCA 5400 

AATCAACCAT ACGTTCTCTA TCGTTCGAAA CAAACTAATC CTGAAATGTC ATGGAGTGAA 5460 

GGCATGCACA TCTATGACCA AGGATTTAAT AGCGGTACCT TGAAACATTG GACCATTTCA 5520 

GGCGATGCTT CTAAGGCAGA AATTGTCAAG TCTCAAGGGG CAAACGATAT GCTTCGTATT 5580 

CAAGGAAACA AAGAAAAAGT TAGTCTCACT CAGAAATTAA CTGGCTTGAA ACCAAATACC 5640 

AAGTATGCCG TTTATGTTGG TGTAGATAAC CGTAGTAATG CCAAGGCAAG TATCACTGTG 5700 

AATACTGGTG AAAAAGAAGT GACTACTTAT ACCAATAAGT CTCTCGCGCT CAACTATGTT 5760 

AAGGCCTACG CCCACAATAC ACGTCGTGAC AATGCTACAG TTGACGATAC AAGTTACTTC 5820 

CAAAACATGT ACGCCTTCTT TACAACTGGA GCGGACGTCT CAAATGTTAC TCTGACATTG 5880 

AGTCGTGAAG CTGGTGATCA AGCAACTTAC TTTGATGAAA TTCGTACCTT TGAAAACAAT 5940 

TCAAGCATGT ACGGAGACAA GCATGATACA GGTAAAGGCA CCTTCAAGCA AGACTTTGAA 6000 

AATGTTGCTC AGGGTATCTT CCCATTTGTA GTGGGTGGTG TCGAAGGTGT TGAAGATAAC 6060 

CGCACTCACT TGTCTGAAAA ACACAATCCA TATACACAAC GTGGTTGGAA TGGTAAGAAA 6120 

GTCGATGATG TTATCGAAGG AAATTGGTCA CTCAAGACAA ATGGACTAGT GAGCCGTCGT 6180 

AACTTGGTTT ACCAAACCAT CCCACAAAAC TTCCGTTTTG AAGCAGGTAA GACCTACCGT 6240 

GTAACCTTTG AATACGAAGC AGGATCAGAC AATACCTATG CTTTTGTAGT CGGTAAGGGA 6300 

GAATTCCAGT CAGGTOGTCG TGGTACTCAA GCAAGCAACT TGGAAATGCA TGAATTGCCA 6360 

AATACTTGGA CAGATTCTAA GAAAGCCAAG AAGGCAACCT TCCTTGTGAC AGGTGCAGAA 6420 

ACAGGCGATA CTTGGGTAGG TATCTACTCA ACTGGAAATG CAAGTAATAC TCGTGGTGAT 6480 

TCTGGTGGAA ATGCCAACTT CCGTGGTTAT AACGACTTCA TGATGGATAA TCTTCAAATC 6540 

GAAGAAATTA CCCTAACAGG TAAGATGTTG ACAGAAAATG CTCTGAAGAA CTACTTGCCA 6600 

ACGGTTGCCA TGACTAACTA CACCAAAGAG TCTATGGATG CTTTGAAAGA GGCGGTCTTT 6660 

AACCTCAGTC AGGCCGATGA TGATATCAGT GTGGAAGAAG CGCGTGCAGA GATTGCCAAG 6720 

ATTGAAGCTT TGAAGAATGC TTTGGTTCAG AAGAAGACGG CTTTGGTAGC AGATGACTTT 6780 

GCAAGTCTTA CAGCTCCTGC TCAGGCTCAA GAAGGTCTTG CAAATGCCTT TGATGGCAAT 6840 

GTGTCTAGTC TATGGCATAC ATCTTGGAAT GGTGGAGATG TAGGCAAGCC TGCAACTATG 6900 

GTCTTGAAAG AACCAACTGA AATCACAGGA CTTCGCTATG TTCCGCGTGG ATCAGGTTCA 6960 

AATGGTAACT TGCGAGATGT GAAACTTGTT GTGACAGATG AGTCTGGCAA GGAGCATACC 7020 
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TTTACTGCAA CTGATTGGCC AAATAACAAC AAACCAAAAG ATATTGACTT TGGTAAGACA 7080 

ATCAAGGCTA AGAAAATTGT CCTTACTGGT ACCAAGACAT ACGGAGATGG TGGAGATAAA 7140 

TACCAATCTG CAGCGGAACT TATCTTTACT CGTCCACAGG TAGCAGAAAC ACCTCTTGAC 7200 

TTGTCAGGCT ATGAAGCAGC TTTGGTTAAG GCTCAGAAAT TAACAGACAA AGACAATCAA 7260 

GAGGAAGTAG CTAGCGTTCA GGCAAGCATG AAATATGCGA CGGATAACCA TCTCTTGACG 7320 

GAAAGAATGG TGGAATACTT TGCAGATTAT CTCAACCAAT TAAAAGATTC TGCTACGAAA 7380 

CCAGATGCTC CAACTGTAGA GAAACCTGAG TTTAAACTTA GATCTTTAGC TTCCGAGCAA 7440 

GGTAAGACGC CAGATTATAA GCAAGAAATA GCTAGACCAG AAACACCTGA ACAAATCTTG 7500 

CCAGCAACAG GTGAGAGTCA ATCTGACACA GCCCTCATCC TAGCAAGTGT TAGTCTAGCC 7560 

CTATCTGCTC TCTTTGTAGT AAAAACGAAG AAAGACTAGT ATTTAGTAAA ACCTCTTAAC 7620 

AAGATTACGG AAGCAGTCTC TATCTTTTCC AATGAGGTTT ATAGTACAGA AAAAGCCTGA 7680 

GAAGATGTCT TCTCAGGCTT TTGTTAAGCA CATAAATACA ATAGTGCTAT GACAAAATCA 7740 

CCCAGAAAAA TCTGGGTGAT AAATGTTATG GTTGTGCTGG TTGAGGATTC TGATTTTGTT 7800 

GATCAGGGGT TGTATTTGAT TGTTGCGTAT TATTGTTAGG ATTGGTAGTC GTACTATTAT 7860 

TTGTGCTTGG AGTGGTTGAG CTAGACTGTG AAGTTGAACT ATCTGATGAT GAGCTTGAAC 7920 

TTTCAGTTGA TGGGGGTTGT TGTGGAGCAG GTGAGTTCCA CGTAGAACGA GCACCATTTT 7980 

TAAATACGAA TTCTCCATTT CTGTAGAGCC CCTCTGGTAT ATTCCAATCT TCTGGATTGC 8040 

TTCCTTCAGA CAGGTAGGTC ATCATAGAGC GGTAAACTTT GGCAGCGACC GTAAGGCCAT 8100 

TGCCTACAAG TGGTGTCAGA CGGTTAGAAT AGCCTGTCCA TACAGCCATT GAATATTTAC 8160 

GCGTATAGCC AGCAAATAGT TCATCAGGTG CTACAAATTG AGAGGTCTTG ATGTGGTTTT 8220 

CAATTTCCTC GTCTGTATAG TTAGAGGTTC CTGTTTTACC AGCCTGAGGG AGCCAAGCAA 8280 

GATAGGCATT TCGTCCAGTT CCATAAGTCA AGACTGTTTT CATCATGTCG GTCATCATAT 8340 

AGGCTGTCGT TTCCTTCATG GCACGAGTTC CGACATTAGA GAACTCTTTT TCACTCCCAT 8400 

CACTAAAGAC GACTTTATGG ATATACATTG GTTTATAGTA AGTTCCACCA TTTGCAAAGG 8460 

CAGCGTAAGC AGCAGCCATC TTTTCACTAC TTGCTCCATA TTTTTTGTCT GATTCGGTTG 8520 

TGTTACTTGA AATGGCATTT GAGTAGTGAA TACTTGGGTA GTCGATTCCT AGACCATTTA 8580 

GGAAAGTCTT GGCGCGGTTG AGTCCGACCT TGTTTAGAGT TTCCACGGCT GGGACGTTTC 8640 

GCGATTGTTG CAGGGCGTAT TGCAAGGTGA TGTTGCCAAA GTAGCCCCTA TCCCAGTTAT 8700 

AAACAGGAGT ATTTGTCCCA GGGTAGTTAT AGGGCTCATC GTGAACGATA GTAGCAGTTG 8760 

AATCGTAGAC ACCGTACTCC AAGGCAGGAG CATAGTCTGT GATCGGTTTC ATAGTTGATC 8820 
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CCCAGTCGCG GTTTGTTTCT ACTGCTTGGT TAATTCCGAA GGAAACATTA CTTGACTGAT 8880 

GGCGTGCTCC TAGCTGGGCA ATGACTTTAC CGTTAGAAAC ATCAACAATG GTAGAAGCGA 8940 

CTTGCAATTC ATCGTCTGGA TAGGCAACGT ATTCGTCTGT ATTGTAAATA TCCCACAGAT 9000 

GTTTTTGAGC TTCTTGGTCT ACATTTGTGT AGACATCCAT CCCAGTTGTG AGTAGGTTAT 9060 

AGCCTGTTTC TTCTTCAACT TGATTGATGA CTTCCTTGAG GTAATTATCC ATGTAAGCAG 9120 

GGTAATTACT TGCTGATTTG AGACTTTGTA GTCCATCAGT AATTGGTGTA TTGACTGCTT 9180 

TCTCATACTG TTCAGCAGAG ATGTAGCCTT GATTTTTCAT TTCAGATAAG ACCAAGTTTC 9240 

GGCGGTCTTG GGCTGCTTCT GGATGTGAAT AGGGGTCATA TTGGTTTGGT GCCTGAGGCA 9300 

TTCCAGCCAG CAAGGCTAAC TGAGGTAAAC TTAAATTATT GAGGTCTTTA CCATAGTAGT 9360 

TTTGAGCTGC TGTCTGCATT CCATAGTTCC CATTAGACAT GTAGACCTTA TTTATATAGT 9420 

AGGTCAAGAT TTCTTGCTTG GTTGCTTTTT GTTCTAACTG AATCGCTAAC CAAGCTTCCT 9480 

GAGCCTTACG AGAAATAGTC TGGTCGGAAG TCGAAGTTGA AAAGTAAGTC AACTTAATCA 9540 

ACTGTTGGGT GAGAGTTGAT CCACCTTGGA GGGAATTGCT TTGCAGATTG CGCAAGAAAG 9600 

CTCCCAGGAT ACGGATGGTA TCAATCCCCC TGTGGTCGAA GAAGCGATGG TCTTCGATAG 9660 

AAACGATTGC CTTAACCAAA TCTGTGGGAA TATCATTAGC TTGGGCATTG ACGCGGCGTT 9720 

CAGAACCCAA GTCAGCAATG AGTTGATTTT TATTGTCGTA GATTTTACTA GAAGTTGTTG 9780 

CAACTAGTTT ACTCTCGGAT AGGCTAGGAG CCTTGCTAAC GTAGTAGAAA AAAACTCCTC 9840 

CGCCTAAGAC AATGGCTGCG ATAACCAAGC TTAAGAAGCT AATGCTCAGA TACTTGATTA 9900 

GGCGCAGAAT CGTTGGTTTG TTCATCTTGT TTTACCACCT AATAAATGTT CTTTGATAAC 9960 

ATTGAGATAA GGAATTTGAG GGAAGGCACC AGCCTTGATT TCATATCCAT ATTCTCGAAT 10020 

ATATTCAAGT GGCATTGATT TTTGTCCCTT ATCTTGATGA TAGAAGCGAA TCAAATCGAA 10080 

TGCCGGCAAT AAGTAGGTTT CTTGCTGAGA AGAAAAGTGA AGAAGGACAA AGCAGATTCC 10140 

TTGTTGGGCA AGGACTTGTT CCATATGCTG AATCTGATGT GGATGAAAAT TTTTCATCGG 10200 

AATCGCACGT TTTTGTTTTG TTTCCTTGAC TTCAAAGTCG ATGTAATATC CATTATAAAC 10260 

GCCAGAATAG TCCGTCGTTG AAGCTTGTCG AAAATAGGCT TCAACAATCT TGGCACGACT 10320 

TCGTTGTGGA TAGTCCACTT GTACGATTTG AATAGGAGTT GGTTTCTTAT GTATAACAGC 10380 

CAAGCCCTGA GACAAATAGT AGTCGTTGGT AGCATTGATC ATCTTTTCAA AGGGTACCGA 10440 

GCTCGAATTC GTAATCATGT CATAGCTGTT TCCTGTGTGA AATTGTTATC CGCTCACAAT 10500 

TCCACACAAC ATACGAGCCG GAAGCATAAA GTGTAAAGCC TGGGGTGCCT AATGAGTGAG 10560 
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CTAACTCACA TTAATTGCGT TGCGCTCACT GCCCGCTTTC CAGTCGGGAA ACCTGTCGTG 10620 
CCAGCTGCAT TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC 10680 
TTCCGCTTCC TCGCTCACTG ACTCGCTGCG C 10711 
(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 11887 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

TACATTCATT CCATCGGCTA CTCCATAATA CTTAGATAAA ACCATAGCTG AAGTCGAATA 60 

CGGATACTGT AAAGTATTAT CAATTTTAAT CAAATCATCA TTACCGATAA TACTTCTGAT 120 

TGCTTTTGGT AGTATGAACC ATACGTTGGT GAAATCTCAG ATAATGAAGA ATCATTAGAC 180 

TCTGGACCTT TTTCTAGTGT CTCACTTACC TCATATTCTT CACCCTTACT AGAAATAACA 240 

CTCAAAGCAG ATACTGTCGA TAACTGGCTA GCCAATAAAG TACTCGCAAT AATTGAAATA 300 

CCCAATTTTT TATAAACAGT TTTCTTCATT ATTGTATCCT CCTAATGTAA TTATAGCGTA 360 

CTATTCTAAA TTTCTTAATC TACTATAGAA TCAAGAAATC TACCACCTTC TTTAAATACC 420 

CTCCATTATC ACATAAACAG GTAAACTTTT CAATTAATGA CTGCGCTTTT CAATCACGCT 480 

AGAGGTACTT GCTTGCTTCT TTGATACTAA GTTCAGCCAT TCTTTCCTTG TTTTTCTCAA 540 

TAAAGCATGT TACCCAAGTG GGATTCGTTT TGGAGTAGTC TCGCAGAGTC CAGCCAATGG 600 

CTTTATTGAT AAAAAATTCT GTTTGGTTCA AGTTATGAAG GAGAATCTTT TCCATTAATT 660 

GAGTATTGGT CTTCTCTTTT CTTAACAACT GGTGGTCAAT AGCGACACGT CTCAGCCAGA 720 

TATTATCTGA TAGGCTCCAT TTTATACTCA ATGAAAATCA AAGAGCAAAC TAGGAAGCTA 780 

GCCGCAGTTG CTCAAAACAC TGTTTTGAGG TTGCAGATAG AGCTGACGTG GTTTGAAGAG 840 

ATTTTCGAAG AGTATTAAGA TTATTTCTTC TAGTTCAGGG TGTTCATACA CCAAACTCCC 900 

TACTACTCGA TCTAGGATAT CTACCGTGTC CCACAAGGAT TTTGTCACGA CTAACTGCTC 960 

TAGCTTAGGC AAATCGGTTT CCTTTAGATA AGACTGCATT GCTTTCAAAT AGTTAGCAGC 1020 

CACATATTGG TATTTTCTAG GATCCTTTTC CCAGCAAGTG TCTGCAAAAT CCCAATCGAT 1080 

AATCTTTGTT TTTTTCGCTT CTGGAAAATA TTTTATAGAG TTTATTTCTT TCAGGCACCG 1140 

CAATACCTAG AAAAGAAAAT TGATGGCGCA TATAGGCTTC CATGGACCTT GCTTTTTTAG 1200 

AGTCTTTTGC TGCTTCTAGC TCCTCAAGTA AATCTGCTAA ACTCATCTAA AACTCCTCTT 1260 
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GCCCCACCAA ATGGTGCTGA AAGGCATAGA CAGCCGCCTG GGTACGATCG CTGACTTCAA 1320 

GTTTGGCAAG AATATTGGAC ACGTGGGTCT TGACCGTCTT GAGAGAGATA AAGAGGTCAT 1380 

CTGCGATGCG CTGATTTTCG TAGCCCTTGG CGATGAGTTG GAGAACATCT CGCTCACGCG 1440 

CAGTCAATTC TTCATGAAGT TCCATATGAT TGCGGTGGTA TTCAACCTTC TTGCTAACCT 1500 

CTTGCTCAAT GGCCAGCTCG CCAGCAGCTA CCTTACTGAC GGCATGAAGC AATTCATCTG 1560 

CACTAGAAGT CTTGAGCATA TAGCCTTTGG CACCAGCATC TAAGACTGGC ATGATTTTTT 1620 

CATTGTCCAA ATAAGAGGTC ACAATCAAAA TCTTGGCTTC AGGCCATTCT TTAAGGATTG 1680 

CTAAGGTCGC GTCAATCCCA TTCATCTCAG GCATGACAAT ATCCATGACA ATGACATCTG 1740 

GACGCAGTTC CAAGGCCAAG TCAATCCCTT GAGACCCGTT GGACGCCTCA CCCACAACTT 1800 

CTACATCGTC TTGGAGGTCA AAGTAGCTTT TCAAGCCCAA TCGGACCATT TCATGGTCAT 1860 

CTACTAGTAA AATTTTCATC TTTACTCCTT TATCATTCCT TATCTAACAG GGGAATACGG 1920 

ATATCAACCG CCAGCCCTTG CTTGGGAGCT GTCAAGAGTT GAACTGTTCC AGCCATATCT 1980 

TCAACCCGCT CCTTGATATT TCGCAGTCCA TAACTCAAGT CGTCTAAGCT CCCTAACTGG 2040 

AAACCAATCC CATTGTCCAC CACCTTCAGT TGCAATTCAA CATCTGTCTG ATAGAGGTAG 2100 

ACATCTAGGC AAGATGCCTG GGCATGGCGG AGGGTATTGC TAATCAACTC TTGCAGGATA 2160 

CGGAAGATAT GCTCCTCGAT TTTCTTAGGC AATTTCGTCA TATTCTGCTT GAGACTAACC 2220 

CTAAGATCAC TCTTGTCCTC AAGCTCTTTT AAAAGAATTT GAATCCCTTC TATCAAGCTC 2280 

TTCTGCTCCA GTTCAACTGG TCGCAAATGC AAGAGCAAAA CCCGCAAATC CTTCTGGGCT 2340 

GTTTCTAAAA TAGCTGTGAC ACTCTGCAAC TGGGTCTGCA TCTTTTCTCT ATCCAATTTC 2400 

AAAGCCTGCT GACTGATACC CGATAAAATC ATGTGGGCCG CAAACAACTC CTGACTGACT 2460 

GTATCGTGCA AATCCCGAGC AATTCGCTTC CGTTCCTTCT CGATGATTTC CTCTTCCTGA 2520 

GCAAGGCTCT GATTTTCAGC TTTTTGAAGA GCCTCTGTCA AAAGGTTAAG TTTACCTGAT 2580 

AAGGACTTGA AACTGGCATC CAAATCTGGA TCTGCAACCT GAACCACTTC TTGCCCTGCT 2640 

AATAAACGCT TGAGATTAGC CTGCATTTTT CTTAGAGAAA GCTCTTCGAT CCCTCGCCAA 2700 

AACAGGGCTA AGAGACAGGT CATGGACATG CTGAAAACCA ACAATAAAAA GACAAATTTT 2760 

TCTGTTTTTT CGACATCGTG CAAAAAGATA GACCAGTCAA AATCAAGTAT TTCCAGCAAG 2820 

CTGTGGGAGA AAAAAAAGAC AAATAGGAAG GAGGTGAGAG CAATAATGAC ATAGGCTTGT 2880 

TTTTTCATCC TCTAACCACC TCCACATCAC CAATCATAGT GGTCAAGAAA ATCTTGACAC 2940 

TCTTGTTACT CTTGAGATAG TCTTTTGTTT CTTGATGATA GTGTTCATTG CGGAGGGCTC 3000 
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GCTTGGGCTG GTTGAAAAAA ATCAAATCCC CATAGAGACA 
CCACATCTAC AGGTACGATG ATTTTGGTCG TTCCTACCAT 
TGTCATGATT GGTTAAGATG ACCCTCTCCA GATGAATAGT 
AGAGATTGAT ATCATCGAAT TGGCAAGTCT GGTAGCTTGA 
ACCAACGATT TTTCTCCTTC TTAACCGTCA CGACCTCTTC 
CTTTTTCCTG GTTCATCATC GGGTAAAGAA GAAAGAGGCT 
TAGCTAGAAT CACAAAAGGA TTGAGCATAA CGATGAAAAA 
CTAAAAGAAG ATTATTTCCC TCTTTACCAG TGTAGTAGCG 
ATAGTATCAG CAGAAAACGC GAAAAATGCT CTGATACCAT 
GAAGACAGGC TTCGATAAAT AAAAAGATTT TAAATTTTCT 
TCTATTTTAT CACAATTCAA AAAAGTCACC TCAGTCTGAG 
TACGCCTTTT TCATCTGATC CTTTGCTTCT TTTAATTTTC 
TTTTGTAGAT CTGCTATGGT GGCACAGTTA AGGGAACACA 
TTCCAGCCTT GGACAATGCC AATCACTTCT TCAACTGTGT 
ACGGTTCGTG ACAATCCCAC AGCCTTAGCA CCAAAAACCA 
GGATTCCGAA CCCCTCCACT AACCAAGAGT TCGACCTTAT 
AGAAGGGCCT GCATGGTAGA CTGACCCCAT TGATTGAGGT 
CGGTTTTCGA TATAGGCAAA GCTGGTGCCA CCACGACCCG 
CCGAATTCAT AGGCTCTTTC GATTGTCTTG GCATCCATTC 
ACAATAGGAA CGGGAATTTG CTTGCTATAA TCTGCTAGAT 
TTCCTTTCTC CCTCGGGCAT GAGTAATTCC TGCATGACAT 
ACAGGATTCA TCTCTTCTAC AGTCTGAAGT CCTAACTCGA 
TTGGTTCCAA GGAGGAGATT GGGATGACTA GACTTGACAG 
TTTTTGAGGG CTGCGCTATA AGAACCCGTT ACAAATAAAA 
TGAGCCAGCT TTTGATTGAT TTCTCTTCCC TTATTACTTC 
TAAAAAGGAA AGTCCCACTT TCGACCAGCA AACTCTGTCG 
TTGTAAAGAG GCAAGGAAGA ATGAATCAGC TCCACCTCAT 
TTCTGCTCAA GGGCATAGAG GATATGCTCG TCCTTACGAT 
TTCTTGATAT AAGAGCTCAA TCCCCAGATC GGCCCAACGA 
TTGCGCATCA AAACTCAGGG CGATGCCACA GTCACCACCA 



GTTAACGCTG AGACTGACTT 3060 

CTTTCTGAGG ATAATGACAT 3120 

GTCCTTGCCC ATGAAGCGAA 3180 

AAAATGATGA AGATTTCCAA 3240 

AAAAACCAAA TTGGTCTGCT 3300 

ATAGATAACC GCAACAAAAA 3360 

GAAGAGAATG GTTGCCGCTA 3420 

AATCAAAAGC AAAAAGAGGA 3480 

CAAAATCAGA GCTCCTGTCA 3540 

CATAGGTTCA TCCTCTCCCT 3600 

GATGGAAAAA AGGCGCTGGT 3660 

CATAAAGAAG ATAGTCTACT 3720 

TAATCAAGCG TAGATCTGCT 3780 

AGGTTTCAAC CAATTCCAGA 3840 

AGCACTTAAT CATATCCAGC 3900 

CTTTCCATTC TTGGGCATTG 3960 

AATCACGCTG GCCACTACGA 4020 

ATAGGTCCAC TGTACGAACA 4080 

CAAAGCCCAC TTCCTTGAGG 4140 

GCGATTGCCA GCTTCTAAAC 4200 

TGACATGCAC TTGCAATAGA 4260 

CAGGCTTGTC CAATCCAATA 4320 

AAAAAGAATC ATCCGTTGGA 4380 

TACCACAGGA TTCCGCCACC 4440 

CACCAGTCAT GGCATTGATA 4500 

AAAGATCGAT TTCATCCAGA 4560 

CAAAGCTATT ATAGGAACTT 4620 

TTGTCGTCAT GTCCTATCCT 4680 

TTTTTTAAGG TTTTGGTTGA 4740 

CCAGCACCAC TACTCTTGGC 4800 
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AACGGTCTGC AAATCTTGAC TGGCTTCTTT CAACTGTCTA AGCAAAGGCG TGTAAATATC 4860 

TGTACTCAAG CCTTCTAAAA GCTTGCTGGC TACTTCTACT TGATCGATAA TCTTTTCTGA 4920 

TTTCCCCTGT TCCAAGGCTT CTACCAGAGA AGTCACCGTT TCTTTTGAGG AAGTTAAAAA 4980 

ATTTTGATTG ATATTTTGCT TGATTTGCTG GACCATGTGA CTCGATACAG CCACTTCCTT 5040 

GGTCCATCCC ACTAAGAAAT CACATTCTAA AGTTGGTTTC ACTTGTGAAA TTGAAAAGCC 5100 

CCAATCACGC TCCAGAACTG TCGCCAAGTT TTCTTCTTCT AACCAAGCAG CCACCTTCTG 5160 

GCGATCAAAT GACTGGTAGA GAACCAAATC CTCTGCCACA ATACAGGCAA GGTCGCCCAT 5220 

GGAACCATTG TCTCCTCGCT TAAGCAAGAC AGCGCTAGTC AGCTTGAACA AGAGCTCCTG 5280 

ATCAACAGAA ACATCATACA GAGCCAGTAA AGCCTTGACA ACCAAGACAA CGACGCTGCC 5340 

ACTAGAACCT AGACCAAACT TTTTCCCTTC TCGTTCCATT TTGCCACAGA TTTCTAGAGA 5400 

AAAAGGTCTT AAATTCTGAC CACGAACAGC GAGGAAGTCT CCCATCAAAG CAATCGTTTC 5460 

TTGAATCAAG CTATAGTCAG GATTAGGCCT TAAGTCCACT GCGAAATCAA ACATATCTGA 5520 

ATAGATACGG TAGCTGTCAG AAAAAGCAAT CTCAGCCCTC ATATAGATGG GAATATCCTT 5580 

TATCAAAGCT AACTGCCCTG GCTCTAAAAT AGCATATTCA CCTGCCCAAT AGAGTTTTCC 5640 

GCAAGTTTTA ACAGCAATCA TCTTGACTCA AATCCTTTGT TTTTGACACA ATCAAGCGAT 5700 

AACGATGACC GAAAATTTCT GATAAATGCT CCAAGTCTTT CTCCTGACAG AAGACCTTAA 5760 

CATTGGGACC AGCATCCATG GTAAAGTAGC AGGCCTCTCC TTTCTCACGA AGCTGGCGAA 5820 

CAAAGGCCAT AGCCTCATAA GAGGCATCCG TCAGATAAGA AAAGGCTGGA CTAGCAGTCT 5880 

TTGTCGTAGC ATGCATAGCC AGGGCATTTT TCTCCGTTAA TTCTCCAATC TTGGCAAAAT 5940 

CATTTTCCTT GAGATAAATC AGCATATCCT GATAGTCCTT. CTCAGACTGA CGAACCCAGT 6000 

CGTCGAAAGT CGTCGAGGTT TCCACACAAA GTTTCATCCC GTCACGGCTA GAGATTGGTT 6060 

TTTTCTTGTC CTCTAGCACC AACATAATCA TAGCTAGTTT CAAGTCTGTC TCTACAGGGT 6120 

AAATTTCTCC ACTATCCTTA TCCCAGGCTC CTAGTGGTCC ATAAAAACTC CGAGAAGAAG 6180 

AACCTGAGGC AAATTTGGCT TCCTGTGCCA ACTGACTTCT ATCCAATCCA AGCTTGAAAT 6240 

AAGCATTACA AGCCTTGACC AGGGCGGACA AACCACTAGA ACTTGAGGAC AGACCCGCTG 6300 

CCGTAGGCAT ATTGTTTTGA GTATCGATAC GGACAAAGCC CTCACCAGCT GGACGATAAC 6360 
GGTCAATAAT CTTACTCATC TTGGCATGCT CGACCTCATT TTGTAGCTGA CCATTGATGT ~ 6420 

AAAATTCGTC AGCTGTTACA TTGGCTGGTA AAGGCGACAA GGTCGTCTCT GTATACATAT 6480 

TTTCCAAAGT TAGAGAAATA CTGCT AG TAG CAGGCACCAT CTCTTTTTCT TTTTTCTTTC 6540 
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CCCAATATTT GATAATAGCA ATATTTGCGT AGGAACGTAC TGTTACAGGC TCTCTATCCA 6600 

TGTCTGAACA GCTCCTTTCT CTTCTAATCT TTCTGCTAGT TCTTGTGCGT GTGTCAAATT 6660 

GGTTACCAAG GCTATGATAC AACCTCCTAG CCCACCACCG CTCATCTTGG CACCCAGAGC 6720 

ACCATGGCTA AGAGTCGTTT CAACCAAAAA GTCTGCCTCA GGGCTACTGA CTCCAATTTC 6780 

TTTTAAATGT AAATGCGCTT GACTGAGGAT TTGTCCCAGT CCTTCAGCAT CTTTTTGTGA 6840 

AATCGCAACT TCTGCTTGCT GGGTTAATTC TCCCAAGGCA TGCAAAAACG GTAGGGCATC 6900 

CTTGCCCTTA TTTTGAACCA CTTGGATGGC TTCACGAGTA TGACCATAAA CACCCGTATC 6960 

GGCAATCACC AAATAGGCGG ATAAATCCAT CTCAAGTTCT GTAAATCCTA CGTTCTTGAT 7020 

AAAGCGAATA GGTTGGTCAC TAAGACAGGT CTTAGCATCC AAACCACTAG GATTCATATG 7080 

GGCAATCATT TCAGCTCGAT TGACCAAGAT TTCTAGTACA TCATGAGGCA GATCAGCCTG 7140 

ATAGTAGTCA AATACTGCAC GAATGGCCGC TATGCTGATA GCCGCTGACG AACCCATCCC 7200 

CCGTTTCTCA GGGATAGCCG AGTCAATCTC ACAACGAATG CAGGCTTCTG TGATATTCAA 7260 

ATACTCCAGT GAGGCATAAA CCGCCATGGA CAAGGTATCC TCCTCATAAA GGCGCCAAGG 7320 

ACTCTCTGCA GGAACTACCT TACAGGTCAC CTCCACCTCC AAAAGAGGCA GGGAAATGGC 7380 

AGGATAACCG TAAACGACCG CATGTTCCCC TATTAAAATT ATCTTACTAT GTGCCTGACC 7440 

GACACCAACT TTTTTTGTCA TTTTTTCCTT TTACTAGACG AAAAAACGTC TTATTTTTCA 7500 

TACAAGTATT AATTCTTTCC TATCTATTTT ATTATATTTT CACAAAAAAA GCGATTGTTT 7560 

CCATTCACAA TCGCTTCTTT CATTATTGAA CCCATTCGCC ATTATAGTTG ACAGAATAGC 7620 

CATCTACGGT CGTATTCACT GCCAAGGCAC CTGAGCGCTA TAAGCGTAGT ACCATCTGCC 7680 

ATTGACCTGG AACCAACCTG TCGTCATAGA ACGACGAAAG AAACTCCATA CCATTAAGTA 7740 

AAGAGGAAAG TCGTGAGGGA GCATGCGCCA TTGACAACCT GTTTTAGTGA CGTACAAAGT 7800 

CTCATTAACA AGTACTCGTT TCGGCCATTT ATAGGTGCGG TGTTTGGAGA AATAGGGTTC 7860 

AATCTTCGCC CATTCTTGAT CGTTTAAATC AGTATCATAT GCTTTGCGTA TCATAACTCT 7920 

AGCTTAACAT TTTTTTGTGA ATACAGGTTC TAAATAATCG ACCACGAAAA TTTCTTAAGT 7980 

GGAAAACGCC TTATGAAGTA TGCTACGGGA AAGTTATGCA CTTAATTTGA CAATTCAAGA 8040 

TGTAAAAATA TATACTATAG TAGATTGAAA CTAGAATAGT ACACCTCTAC TTCTAAAATA 8100 

TTGTTAGAAA TCGATTTGAC TGTCCTGATC GATTTATCCT GTTATTATCT CATTTTACTA 8160 

TAATATTTGA TAAGTTATCC TAAAAGTATT ATTATGTTGT TGTGTTATAG ATTGATTGAA 8220 

TCTAACTAAA GGATCCTATT CAATTACTAG AACTATCACA TACTCAAGGT CAGCTCACAG 8280 

ATGAGCAACT ATTTTGGTTA CAATGTCTAC TAAATTTAAG TCAAACAAAT AATTTAGTCA 8340 
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AAATTAAAAA AATAGAGGAA CATAAATATG ATTACAAAAC AGAATGTAAT AGTGTTCTAC 8400 

AATTTTTACT AGATAAAACT GTAAATTCTG AAGGAAGGAT CACTTCTTCA ACAGAATTTG 8460 

GAAATTTCGT AAGTAATTTA TCATTCCAAC ACGGAATAGC TGGACTACTG TTTCCTCTAA 8520 

ATAAATTGTA CCCCCCAGAA CTGGATTCTA AAATACTCTC TATCATCAAG AAGGCAGTGA 8580 

CAATTAGAAC GACACACACA TATGAATATC AATACTCACT GCTATTTGGT GATGCAGGCT 8640 

ATCTATGGTT ACTCCTACAT TTATTTTCTA TCAGTAAAAA TCAATACTAT CTACAATTAG 8700 

CAAACGTCAC CGCTAAAAAA TTAATAGAGA ATTATGATAC TCTAGAGGAA ATAGACTTTG 8760 

CATTGGGAAA ATCTGGTGTC CTATTATCAT TAATAAAATA CTATCAATTT ACCAATGACA 8820 

ATACTCTTAA AATTTTCATC CACAATAGTA TAGGGGAAAT TTATCATTAT TTCCTACAAA 8880 

GAGATACAGC CAAAGAAAGC ATTTTAGACT ATAGCTTTGC TCATGGATAT TGTGGAATTG 8940 

CATATGCTTT ATTTGCCTAT TCTAAAGTCT TAGAACCTTC TATGTTTTAT AATGATCTCC 9000 

ATACATTCCA TACTGAATTA AAAAAATTAT TAGAAAAAGT TACTTCTAAT ACTGAAAATT 9060 

TAGGAAATTT ACAACTTTCT TGGTGCAAAG GAATTTCCGG AATAATCTTA TATCTTTGTA 9120 

TGTACGATTG TGACGGAAAC AAAGATATTA TTAGTAAATA TCAAGAATTT GTTTTTAACC 9180 

ATCATCTAAA AATGATGACA GGATATTGCC ACGGAATAAC TAGCTTACTA CAAACCACTG 9240 

TCTACAATCA AAACAAATTA CTGATGAAAA AAATCCAACA GGTAATTTTA GCATGTTCTG 9300 

AACGAGATGA TCACGGTTTA CTGATGTTTC AAGGAGATAG TGGTAAAGCA GATTTGTTTG 9360 

ACTTCGGAAT AGGAAGCATG GGGTATATTG GTGTCTATTA AATAATAAAT TCCCATTCGA 9420 

TGTGCAGACA TAAGGAGAAA AGTATGAAAT TATTTTGGAC AAACAACATA TATAGACAGT 9480 

TGCTGCTAAA CAGCTGTTTT TCATCATTCG GCGACAGTAT TTTCTACCTC GCCATTATCA 9540 

ATTATGTGGC TCAGTACAAT TTCGCTCCGC TAGCGATTTT ACTGATTTCC ATTTCAGAGA 9600 

TGGTTCCCCT ACTATCGCAA CTCTTTCTCG GGATTCTAGG AGATTTTCAA GAAAATAGAG 9660 

TCAAACACGC ACTCTGGATT GCCAAAATCA AAATCCTGCT CTACGCTATT TTGACAGTAT 9720 

TTCTCGTCTT GTCGCCCTTT TCATTAGTTT CAGTCATTAT GATTGTCATC ATCAACCTCA 9780 

TCTCTGACAC CTTGAGCTAC CTGTCTGCCT ACATGATGAA CGCCCTCTAC ATCAGTGTAA 9840 

TTAAGGACGA CCTGCATGAT GCCATGGGGT TCAGGCAGTC TCTGATGAGG GTTGTCCGTA 9900 

TTGTCGCCAA TCTGGCTGGC GCATTCCTTA TCAATGTTAT AAGTATTCAA ACTATTTCCC 9960 

TTATCAACAC TCTGACTTTT GTCATTGCCT TTTTGGGCCT GTATGTTATT CGACATACCT 10020 

TGTATGAGGT TGAAAAAAGA ATTGAAATGT CACATACAGC ACTGAGTTTT AAGAAATATT 10080 
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TTCAACATCT TAAACAGTCG CTGGCTGTGC TCCTGAGGTT AAAAGATACC GTCATACTAC 10140 

TGTTTCTGAC GACCAGTATG ATTGCCATCT TGGATGTGTC CCCTCGGCTG ATTGCCCTCC 10200 

GCTTCATCCA ACAGACACTA GCACAACTGA GCATTGGGCA ACTCCTCGCC CTGCTCTCCA 10260 

TCATCATGTC TTGTGGAGCT ATCCTTGGCA ATATGACCAG CAGTAATCTA TTTAAAAATA 10320 

TCCGTTTCAC GCACCTCTTG GTTTTCTGTG AGATTTCCCT ATTGACTCTA ATAACTAGTA 10380 

TCCTTTGTCA AGCCTATATC GTAATTTTCA TGACCAGTTT CATCAGTTCT ACGATTATCG 10440 

GCATTCTCAG CCCTCGCCTA CAAGCAGCTG TCTTTGCCCA TATCCCCAGT GACAAGATGG 10500 

GGACGGTGGG CTCTGCTCTG AGCACAGTGG ACATTCTCGC CCCGTCCCTG CTCTCCCTAT 10560 

TAGCCCTATC CATAGCATCG GGCGTTTCGG TGCAGTTAGC ATTGATATTT TTGTATCTTA 10620 

TTTTAATTGC TCTTATCTTT TGTCAATGGT TAGTCAAGTT CAACACTCAT AACTAACGAA 10680 

AAAGCATGTG TAGATTTCAC ATGCTTTTAA TCTCCCCAAT CGTCAGGTCA AGTACAACAA 10740 

AGTCACTTCT TTGATTAAGC GAGTGTTCTA ATATAATTAT AAGCGCCCTG TCATTACCGA 10800 

ACCCATTCGC CATTATAGTT GACAGAATAG CCATCTACGG TCGTATTCAC TGCCAAAGCA 10860 

CCTGAGCTAT AAGCATAGTA CCAGTTGCCA TTGACCTGGA ACCAACCTGT CTTCATGTCT 10920 

CCATTACCTG CATTTAGGTA GTACCAAGTT GAACCATCTT GATACCAACC AGTTGCCATA 10980 

GCTCCTGATG AACGGAGATA GTACCATTTG TTCCCAAGGT TTTGCCAACC TGTTTTCATA 11040 

TCGCCATTTG GGTGGTCTAA ATAATACCAA GTGGTACCTT CCTGATACCA GCCAGTGGCC 11100 

ATTGCTCCTG AGGAACGGAG GTAGTACCAC TTATTACCTA GATATTGCCA ACCTGTTTGC 11160 

ATAATACCAG TTGTTGGATC TAGGTAGTAC CAAGTCGAAT CATCGTTTAT CCACCCCGCA 11220 

CGTCTTTCAC CACCAAGGTA GTTTTCTCCA TTAATTTCCG TCTTAGCTAG ATAATACCAG 11280 

TTAGACTGAT CATAAAGCCA ACCTGTCTCT AAAGAATGAT TTTGATTAAA GTAATAGTTC 11340 

GTATAATAAC GCTTCTCTTC TTTATCTTCT GAATCTTCAC GTTTTTCCCC GTACTTTCTT 11400 

CCAACACTGT CTTTAGTTTT AATCTCTAAT GTTTTCCAAC CAACAAACTC TTGTAGCACT 11460 

CCATTTTTAT CGAAGTAGTA CCACTCTGAC TTTGGAAAAC CTTCTAATCT GATACCATTT 11520 

GGGTAAGGAC CAATTGTACT ACCTTTAGAT GGAAACGGGA TATATTGCCA GCCGACAACC 11580 

ATCTCTCCAG ATAGAGAATC AAAATAATAG TACTTACCAT CAATCACTCG CCAGTAGGTT 11640 

TCTTTGAGGT CCCCCTTTTT GTAGTAGGTT CTTCCGTTTT CTTGGACAAA CTGCCATCCT 11700 

TCAGAATCAT CTGCAAATAC TGTACTGGTC CCTAGCAAAC CAAAGAAAAA TACTGTCAGT 11760 

CCAACTTGCA TAGTTTTTTT CAAAATTTTC ATCTATATAC CCTCCAATAT TAAATCCACT 11820 

CACCAGATGA GGCGAAATTA TAAACTTTAC CATCGATAGT TTGGCTACCT GTAACCATTG 11880 
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CTCCAGG 11887 
(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11340 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

CCGGTATGTT CTGGAATACT ACCAATCTAA GCTGGCTGTG CCCTACAGTT TTACAACCCT 60 

GTACGAATAC CTTAAGGAAT ATGACCGATT TTTCAGCTGG GTTTTGGAGT CTGGTATTTC 120 

AAACGCTGAT AAAATATCCG ATATTCCTTT ATCAGTTTTG GAAAATATGT CTAAGAAAGA 180 

CATGGAATCC TTTATCCTTT ATCTACGTGA ACGTCCCTTG CTGAATGCTA ATACAACAAA 240 

ACAAGGTGTT TCACAGACAA CTATCAATCG AACCTTATCA GCACTTTCTA GTCTTTACAA 300 

GTATCTAACC GAGGAGGTTG AAAACGATCA GGGGGAACCT TATTTCTATC GTAATGTAAT 360 

GAAAAAAGTT TCCACCAAGA AAAAGAAAGA AACCCTTGCT GCCAGAGCTG AAAATATCAA 420 

GCAAAAACTC TTTCTAGGTG ATGAAACAGA AGGTTTTCTA ACTTATATCG ATCAAGAGCA 4 80 

CCCACAACAG CTTTCAAATC GAGCTCTCTC ATCATTCAAC AAAAATAAAG AACGAGATTT 540 

AGCCATTATT GCCCTTCTCT TGGCATCTGG TGTTCGCTTA TCTGAAGCTG TTAATCTAGA 600 

TCTAAGAGAT CTCAATCTAA AAATGATGGT TATTGATGTT ACTCGAAAAG GTTGCAAACG 660 

TGACTCAGTC AATGTCGCTG CTTTTGCTAA ACCTTATTTA GAGAATTATC TGGCCATTCG 720 

GAATCAACGC TATAAAACGG AAAAAACAGA TACAGCCCTT TTTTTAACTC TCTACAGAGG 780 

TGTTCCTAAT CGTATCGATG CTTCTAGCGT TGAGAAAATG GTTGCTAAAT ACTCAGAGGA 840 

TTTTAAAGTG CGTGTAACAC CCCATAAACT GCGCCATACA CTAGCAACTA GGCTCTATGA 900 

TGCGACTAAA TCACAAGTTT TAGTC AGTC A CCAACTAGGA CATGCTAGCA CACAAGTCAC 960 

TGACCTCTAT ACCCATATTG TTAGTGATGA ACAAAAGAAT GCTCTGGATA GTTTATGATT 1020 

TTACGTATTT TAAATTATGT AAATAAATAT CAAAAAAAGA AGTTGGCCAA CTTCTTTTTG 1080 

ATTTATCCAA CTACCGCTTC AGCGATTTCT TCACGGCTAA TACCAGCGAA GTAGCGTGTG 1140 

ATATCAATGG TTTTTAGCGC CTTAAGAACA TCTTCGCGTT CGTATTTCAC CCCACGAAGG 1200 

ACATCTTCTA CTGCAGCAAC GTCTTCAATA CCAAAGAAGT CACCATAAAT CTTGATGTCT 1260 

TGGATTTTTG ATTCAGTAAC GTTAGCAAAG ACTTCAACCT TACCACTAGT GAATTTGATT 1320 
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CCACGACGGA CGTTAAATTC AGGTGATTTA CCATAGTTCC AGTCCCAAGT TCCAAACTTA 1380 

GTATCCTTGA TGCGATTGAT TTCGGCCAAT TCTTCTTCTG AAAAGACGTA TTCAGTCATC 1440 

TCTGGGTACT CTTTTTTCAT GTATTCCAAG AGTAAATCAC GGAATTTTTC GACTGTGATT 1500 

TTTTTTGGTA ATTCATTGAT AATATTGGTT ACACGGGCAC GGACGGATTT CACACCTTTT 1560 

GATTCAAATT TATCTTTTGA AACCTTAAGG GCATTTGCGA GGACTGACAA ATCAACGTCA 1620 

AAGAGCAAGC AACCGTGGTG CATGATACGG CCGTTGATAT AGGCTTGGGC ATTGCCACAG 1680 

AACTTCTTAC CATCAATCTC AAGGTCATTA CGACCTGTGA ACTCAGCTTT AACCCCAAGT 1740 

TGAGCCAGGG TATTGATAAC CGGAGTTGAG AAGCTCTTGA AGTCAAATGC CTTATTTTCA 1800 

TCTTCTTTGG AGATGATCGT GTAGTTGAGG TTATTTAAAT CGTGGTAAAC AGCTCCACCA 1860 

CCACTAATAC GGCGAACTAC CTCAATACCA TTTTCGCGAA CATAATCACG GTTGATTTCT 1920 

TCGATAGTGT TCTGGTGACG ACCAACAATG ATAGATGGCT TGTTAATCCA AAGTAGGAAG 1980 

ATTTGATCCT CATCCAAAAG GTGTTTAAAG GCGTATTCTT CCAAGGCAAT ATTAAAAGCA 2040 

GTGTCATTTG AATGATTGAT AATGTATTTC ATGATATCCC TTTACTTTAT ATGATAGAAA 2100 

CTGGAAATAA CCTTCCAGTC TAATCTATCT TCGTTTTATT TTTTCTTAGG TGAATGGATG 2160 

GCCATTCCTA GAACATCTGC AAACGCTTCG TACATCACTT CAGAGTAAGT TGGGTGCCCG 2220 

TGGATGGTCT TCAGCATTTC CTCAACAGTG ATTTCCATTT CGATGATGCT TGATGCTTCG 2280 

TTTATTAATT CTGCGGCTGC AGGACCAATA ATGTGTACAC CAAGGATTTC TCCGTATTTC 2340 

TTATCAGCGA TAACTTTTAC GAAACCTTGA GCTGCGTCAG ATGCAATAGC ACGACCGTTA 2400 

GCAGCAAAGT TAAACTTACC GATGGCAACA TCGTATTTCT CACGGGCTTG TTCTTCTGTC 2460 

AAACCTACTG CTGCTACTTC AGGGAGAGTG TAGATGGCTG CAGGAGTCAA ATTCAATTTG 2 520 

GCAACTGCAT GATTTCCTTT AAGGGCATTT TCAGCGGAAA CTTCACCCAT GCGGAAAGCT 2580 

GCGTGAGCCA ACATCTTAGT ACCGTTGATG TCACCTGGTG CATAAATGCC TGGAACTGAA 2640 

GTTTCCATGT ATTCGTTGAC CTTGATACAA CCACGATCCA ATTCAAACTC AACCTCTCXA 2700 

ATACCTTCAA GGTCTGGCAT ACGACCAATT GAAAGAAGAG CTTTGCTTGC GATGATATCG 2760 

TCTTTTCCTT CAACCTTGAT ACGAAGTTGA CCATTTTCCT CAATGATTTC TTGCAGTTTA 2820 

GTACCAGTCA AGATGGTCAT TCCTTTACGC TCAAGAATCA AGCGAAGGTT CTTAGAAACT 2880 

TCCACATCCA TAGCTGGAAC TATACGGTCC ATCATTTCGA TAACAGTCAC TTTTGAACCA 2940 

AATGTCATGA AGGCCTGACC GAGTTCGATA CCGACAACTC CACCACCGAT GATAACAAGG 3000 

CTTTCTGGCA CTTCGTTCAT TTCAAGAATG TCATCACTAG TCATGACAAG TGGAGATTCC 3060 

ATACCAGGGA CGTTGATCTT GTTGACTTTT GAACCACCAG CAAGAATGAT TTTCTTGGTT 3120 
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TCAAGCAATT CAGAACCATT TACCAAGACG 
TTATGAACAG TAACTCCGTA GCTACGAAGA 
ACAACTTTAG ATTTAGTTTC TAAAAGTTTT 
ATCACGATAC CACGATTTGC AGCATGACCG 
TAGGTCTTGG TTGGAATACA TCCACGGTTT 
ACAAGGGCAA CCTTACCGCC GAATTGGGCA 
CCTCCACCAA TCACAACGAT ATCAAAAGCA 
CTTGCTACAG GTACAGGGCT AGCTTCTGGC 
CTTTCTTCAC CAAGGTAACC GATAACTTCC 
AGAATGGCAA TCAAGTACCC ATCTTCTTCG 
ATGATTTCCA AAAGGATTTC TCCTTCTTTT 
ACGATTTGTC CTTCTGTCAT ATCCACGCCG 
TCTTCCTTCC TTTATCTATA TCTTAAAAAT 
ATTGGCGTTT CAATCAACTC TTTCAAGTCC 
ACGACACGGT GGTCAATGGT TAATCCTAAA 
TTGACGACAA CTGGCTTCTC GATTGTCGAA 
TTAATAATCG GACCAAAGGA CTGAACACCA 
GAATTTTGTA ACTCACTTGG AGCCAATTTA 
AAGGCTACAA CCAGTTCTGA AAGACTCATC 
AATCCATTAT CCATCCCAAC TGCCATGGCA 
TTGCCATCTT CTGTCAATGA AGCGTTGATG 
GCAAGCGAAA GAAGGTCTGT TACAGTAGTC 
AGAACCTTCT TACGAAGAGC CAACATTTCA 
GTTGGCGCAG TCAAGTAAGA TTCAACCATG 
ATTGGAATAC GCTCGATTTT ACCATATGGT 
TCAATCTGAG CAGGAGATTT GATGCTATCG 
ACATCCTTCT TCATGATTTT ACCACGATGA 
TTATGTTCGA GGGCAATTCG TTTTGCAAGT 
TAAGTTTCCA CGTCTTCTTT GTGGACACGA 



987 

TTCTTGTCTT TAGTGATTGT 
AGTCCTGCAA CACCACCAAC 
TCCATATCAA CAGTGAAGTT 
ATATTTTCAA TAATTTCAGC 
AAGCAGGTTC CACCAAGTTC 
GCTTTAATGG CTGCAACATA 
TCATCGCTCT TACCATCATC 
GATGCTGCTC CAGCTGTTGG 
GTTACAGGGA CAGTTTCACC 
GCTTCCAATT CCATGCTGAC 
ACAAATTCTC CGACTTTTTT 
GCTTTTGGCA TAATTACTTC 
GAATACTCTT GCTCTTAAAT 
TTCATAAACT TAGCACCAGC 
CTCATGATTG GGCGAATCAC 
CTGACACCAA GGATAGCTGA 
AACATTCCCA AATTACTGAT 
CCATCCAAGG TACGGCCAAT 
TTCTCAGCAT TGTAAACAAC 
AGATTGACAT AGTTGTGAGT 
TATGGGTGTT TCATAAGAGT 
TTCTTCCCAG TTGCTTCCAT 
GTCATATCAA CTTCATAGTT 
CGTTGGGCAA TAACCTTACG 
GTTACGTTAT CAGGGACTTC 
TTTTCGATAT TTTCAGGAAG 
CCGGTTCCTT GGATTTCCTG 
GGCGAAATGC GAACCACGTT 
CCGTTTGCAC CTGAGCCAGA 



ACCAATTCCT 3180 

AAGAGTATTA 3240 

AGGATTTTCA 3300 

GTTATGAAGG 3360 

AGATTTCTCA 3420 

ACCAGCAGGA 3480 

GTTTGAGGTA 3540 

GATGTTTTCC 3600 

ATCTCCTTTG 3660 

TTTATCAGTC 3720 

ATTCCATTGG 3780 

TAAGGCCATG 3840 

TAACATTGAG 3900 

CATACCATCT 3960 

AATTTCACCA 4020 

GTTGGGTTGG 4080 

TGTGAATGTT 4140 

AACATCCTTA 4200 

AGGTGTCATC 4260 

GATAATAGTC 4320 

CTTAACAACT 4380 

GATTGGCTCA 4440 

GAGGGTGAAG 4500 

CATTGGTGTC 4560 

TTCCACTTTT 4620 

CAGGGCCAAA 4680 

CCAAGCAATG 4740 

TGTGTCTTTA 4800 

AACGTCGTAG 4860 
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AGGTTTATCC CTAAATCATC CGCTAACTTT CTAGCTGCAG GAGTCGCTCT TAGCTTGTCA 4920 

TCAGCCATGA CCTCTCCAAT TCTATTTATG ATACAAAGGG CGTCAAAAGC GACTGAAAAA 4980 

TAGGAAATCG ACGATGGCTT CGATGAAGCC AAGGAGATTT ATCTTTTTTC CGATCTTTTA 5040 

GCCCGTGCTC TAATCTAAGA TATTAATGAC GAAGAGCTCT GCACCTAAAA GATACAAAGT 5100 

TTCTCGTCAG CTTTATTTTA TTTACATAAC TTATCTTATG TAACCCTATT CTTTGTTATA 5160 

AGTTTTTCGG ATTGCATCTT TGATACTTTC AACTGTTGGA ATCATTGCAT TTTCTAGGTT 5220 

TTGTGCATAA GGCATCGGCA CATCTTCTCC TGCACAACGG CGAATTGGTG CATCTAGATA 5280 

GTCAAATGCT TCTGATTCTG AAATAATAGC TGAAATTTCA CCGATATAGC CACTTGTTTT 5340 

GTGGGCATCG TTGACCAGAA CAACCTTACC AGTCTTCTTC ACTGAGTTTA TGATGATATC 5400 

CTTATCAAGC GGAACAAGGG TACGTGGGTC AACAATTTCA ACTGAAATTC CTTCTTCTGC 5460 

TAATTCTTCA GCAGCTTGAA CCACACGGCG AAGCATTTTT CCATAAGTAA CAACTGTTAC 5520 

ATCCGTTCCT TGGCGTTTGA TTTCACCAAC CCCAAGTGGA ATTGTGTAGT CTGGATCAAC 5580 

TGGCACTTCC CCTTTTTGGT TAAATTCTGA CTTGTACTCA AGTATAATAA CTGGGTTGTT 5640 

ATCACGGATA GAAGACTTAA GCAGGCCTTT CATGTCCGCA GGTGTTCCAG GTGCCACAAC 5700 

CTTAAGTCCT GGAATGTGAG TAAACCAAGA CTCTAGAGAT TGTGAGTGCT GGGCGGCAGA 5760 

GCCAACTCCG TTACCAGCTG CACAACGAAC AGTCATTGGA ACCTGACCTT TACCACCAAA 5820 

CATGTAACGT GTTTTAGCAG CTTGGTTGAC GATATTGTCC ATGGCAATAA CAGAGAAGTC 5880 

CATGAAGGTC ATATCGACGA TTGGACGAAG TCCTGTCATG GCTGCTCCTG CTGCTGCTCC 5940 

AGAGATGGCA GCTTCAGAAA TCGGACAGTC ACGGACACGT TCTGGACCAA ATTCTTCAAG 6000 

CATTCCAACA GAAGTACCGA AGTCTCCTCC GAAGACACCG ACGTCTTCTC CCATCAAGAA 6060 

CACATTTTCA TCGCGACGCA TTTCCTCAGA CATAGCAAGG ATAATGGTGT CACGGAAGGA 6120 

CATTGTTTTT GTTTCCATTT TATCTCTTTC TCCTTAGTCT GCGTAAATAT CTTCAAAGGC 6180 

TGATTCAAGC GGTGGGAATG GGCTTTCCTC TGCAAATTTA ACAGAAGCTT CTACTGCTTC 6240 

CTTTACTTGC GCTTGGATTT CTTCCAATTC TTCGGCACTT GCAATGTTAT TTTCAATAAG 6300 

GTAATTGCGG AGGTTTTCGA TTGGATCTTT TTGTTTCCAC AATTCCACTT CTTCACGCGT 6360 

ACGATATTTA CCAGGGTCAG ATGATGAGTG ACCGAGCCAG CGATAAGTTA CACTTTCAAT 6420 

CAAGACTGGA CCATTGCCAC TGCGAACATG GTCCACAGCT TTCTGAAATC CTTCATAGAC 6480 

ATCGATGACA TTGTTACCGT CTTCGATGAA CATTCCAGGA ATTCCATAAG CGGCGCTACG 6540 

TTGATGGATA TGTTCTATAT TGGTCATTTT CTTGATATCC GCAGAGATAC CGTAACCGTT 6600 

GTTAATGCAA TAGAAAATGA CTGGCAGGTT CCAGATAGAA GCCATGTTCA CTGCTTCGTG 6660 
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GAAAACACCT TCATTGGTCG CACCATCTCC AAAGAAGCAG ACAACGATTT TACCGGTATT 6720 

TTGCATTTGC TGACTGAGGG CTGCACCGAC AGCGATCCCC ATACCACCAC CTACGATACC 6780 

ATTGGCACCA AGGTTCCCAG CATCAAGGTC AGCGATATGC ATAGATCCAC CTTTCCCTTT 6840 

ACAGGTTCCA GTGTATTTAC CAAGGATTTC AGCCATCATT CCGTTGAGGT CAATCCCTTT 6900 

AGCAATAGCT TGCCCGTGTC CACGGTGGTT TGAGGTAATC AGATCATCTG GATTGAGAGC 6960 

TAACATAGCC CCCACGTTAG CTGCCTCTTC ACCAACAGAA AAGTGCGTCA TTCCTGGCAC 7020 

TTTCCCTTTC TTTACTAATT GTGCAATTTT TAAGTCCATG CGACGGATTT CTTCCATCTT 7080 

ACGGAACATT TCTAGCAAAA GATTTTTATC TAAAGTTGAC ATCTTCTTGC CTTTCTAACT 7140 

TTCTTCTTAC CTTACTATTT TACCGCTTTT GGCAAATACT GTCAAAGTTT TTCTAAAAGA 7200 

AATTTCACAA AATAAAAAAG AAAACCCCGT GAAAACAAGG GATTTTCTTG TCAAGAATAT 7260 

TTTTTCACAA ACTTTTTAGC ATTTGGATTT TGCTAAAGAT TCAAATCTCT TCATAATCAC 7320 

AGTTAAACGC CAACGGTAGA GCGCCCCGCT C ACAATC AAA CTAATAATCA AGCCGATCCA 7380 

GTAAGAATAA GCTCCAAAAT CTGTTAGGGA ATCAAATAGC GTAnCACAGG GATTGCTACG 7440 

CCCCAATAAC CAAGCAAACC AAGGTAAAAA GGAATAACTG TATCCTTATA CCCCCGCAAA 7500 

ATTCCCTGAA GCGGCGCCGC AAAGGTATCT GCTAACTGGA AGAAAAGACT ATAAGTTAAA 7560 

AAACGCACTG TCAAATCGAT AAATTTTGGG TCGTTACCAT AAAGACTGGC CACATTTCCC 7620 

CTAAAAATGT AAAGGAAGGT TAAGGTGAAG GCCGCAAAAA TGAGGGCAGT CCATCTTCCT 7680 

AGACCAATAT AGGTTTTCGC ATCATCAAAT CGCTTGGCTC CCACTTCATA GGAAACGACA 7740 

ATAGCCATAG CCGATGAGAT ACTCATAGGA AAGGCGTACA TAAGACTTGA AAAGTTCATA 7800 

GCTGACTGGT GACTAGCTAT AATCAAGGGC GAAAACTTAG CCATAATCAA GCCAACCACT 7860 

GAAAAGATAG CCACTTCCGC GAAGACAGTT CCCCCAATAG GCAGACCTAA ACGAACTCCT 7920 

TCCTTAATTT TATCCATATT AAGTGGAATT CGTTTCTCAA GGTGTAAGGC TTTGAGCTTC 7980 

TCCTGTTTAA ATAAAACCAG AACAGAAATC CCAAGCAAGA CCCAGTAGGC CAAGGATGTT 8040 

CCTAAACCAG CACCAGCCCC TCCCAGTTCT GGAACACCAA AGGCACCGTA AATCAAGAGA 8100 

TAGTTAAATC CGCTATTGAG AGGGAGTAAC AAAAGCATGA GGTACATGGA CAGTTTGGTC 8160 

AAGCCCAGCG AATCCAGCAA GGAACGAATG ACGCTAAAGA GCAACAAGGG GATAATCCCG 8220 

ATAGATAAAA ACCAAAGATA GCGAACCGCT ACTGCCGCTA CTGCTGCTTC TAACCCAATA 8280 

TGATTCAAGA TTATTGGTGC CAAGAAAAGT ACCATCCCCA GCAAGACCAC AGATAGGCCC 8340 

AAGGCCAAAT AAATAAATTG GTAAAAATCA GACGCAACTT CTTCCTTTTT GCCTCGACCA 8400 
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AGATGGTGAC CAATGATAGG CACCAAGGCT GACACAATCC CTGTTAGAAA TGTAAAGAAA 8460 

GGATTCCAGA TACTGGTTGC CATAGATACA CCAGCCAAGT CCATAGTGTT GTATTGACCT 8520 

GTCATTGCAG TATCAACAAA AGAGGCAGAA TAATTGGCAA ATTGGTAGAT CAGGATTGGG 8580 

AAGAAAATTT TTAAAAATAA TACTAACTTC TCTCGTAAAC ACTTTGTCTT ATACATACTT 8640 

CTCTTTCTAT TCTGATTTAT CTAAACCAAA GAGTTTCAGA CCATAGTTTT TCAAACTTAG 8700 

CGGAGGTTTA TTAGATTTTG AAGTAGTATG CCAACACGCA CATGTACGAC AATAATAGCT 8760 

TCTAACTAAA CCTCCGTTAT CATATTGAAC CGCATGGTCA GCTTTTTCTT TAGTTTCATA 8820 

TTGAATTTTG GAACGATTAG CTGCGGGACA GTAAATTCCA CTATTAGATT TCGCTTGTCT 8880 

CTCCCTACGT TTTCGAAAAT AATTCATATT CTAACTCCTA TCAAGCTTGA TAGACGATTT 8940 

GTCCCTTACA GATGGTATAT TTAACCTGCC CTTTTAAGGT TTCACCGATG AATGGTGAAT 9000 

TAGCTGCTTT GGAAGCAAAA TGGGAGTCCA CAAAGCGGTC AGCCTTGGCA TCAAAAATAG 9060 

TGATATCTGC TGGACCATTC TCAGCCAAGT AACCTGCTTC AAAGTTGTAA AGCTTGGCTG 9120 

GGTTGTATGT CATTTTTTCA AGTAATTCCA TCAAGCTCAA CTCACCAGCT TCTACTAAAT 9180 

AGGTCAAGCT GAGAGACAGG GATGTTTCTA AGCCAGTCAT ACCAGATGGC GCTTTGGTAA 9240 

TATCCTCAAC ATTTTTTTCA TCTACATGAT GAGGCGCGTG GTCAGTCGCA ATAACTGTGA 9300 

TGACACCTGA TTTGAGACCT TCGATAACGG CACGACGGTC TGATTCCAAA CGAAGCGGTG 9360 

GATTCATCTT AGCATTGCTA CCTTGTGTTA AAAGAAGTGC TTCTGTCTTA GAGAAATGCT 9420 

GTGGCGCTAC TTCTGCTGTG ACTTCTGCAC CTAACCCCTG AGCAAACTCC ACTACTTTAA 9480 

CACTTTCTTC CTTAGACAAA TGCTGGATGT GAACATGGGC TTTAGTTGCA TAGGCAATCA 9540 

TGACATCACG CGCCATCATA GCGTACTCAG CCACCCCAGT AGCACCGCAG ATATGGAAAT 9600 

GTTCTCTAGC AATATTTTCA TTAAAGCCAA GAACACCGTT CAAACCTGGA TCTTCCTCAT 9660 

GAAGGCTGAT AAAGGTATTG AGTTTTTTGG CTTCCTCCAT GGCTTCCTTG ACAATCTTAC 9720 

TGCTCTCAAG CGGAATACCG TCATCAGAGA AACCAACCGC ACCAGCTTCT AAGAGTGCCT 9780 

TAAAGTCAGT CAAGTTTTTA CCATTAAAGT TTTTAGTAAT GGTCGCAACT GTCTTGACAT 9840 

TAATCTTCTC TTTGGCAGCT GACTGGAGAA CTGCTTGCAA AGTCTCCACG TCTGAAATGG 9900 

TTGGACTGGT ATTAGCCATC ATGACGACAG TAGTAAAACC ACCTGCAGCG GCTGCTAGGG 9960 

CACCAGTATG AATGTCTTCT TTATGTGTTT GACCAGGTTC ACGGAAATGA ACATGAATAT 10020 

CGACCAAGCC AGGAGCAACC ACAAGACCAG TAGCATCAAT CGTTTCTGCT CCTTCTTCCG 10080 

TGATCTCAGA CGCAATTTTG ATAATTTTCC CATCTTGAAC TAAGACATCA CAAACTTGAT 10140 

CCAAACCAGA CTTGGGATCC ATTACACGAC CATTTTTGAT TAGTAGCATC TGCTTTCTCC 10200 
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TTTATTCATA GAAATCAACT TGGGTATCCA ACAATTTATC CCCATCATAA ACAAACTTGG 10260 

CTGAAAAGAA GGGTTTATCC TCTAAAAGCC ACTC AAC AAA GGTGTGGTCA CCTTCCCAAG 10320 

TCGGCTTGCT CAAAACCTCA TCATAGGGAA CCCATTCTAG CGTCCCCTCA TTGCAGTCAA 10380 

TCAAGTCGCC CTCAAACTCC GTCACCTTAA AAACATAGGT GTACCAGTCT AAATCTGGTG 10440 

TAAATTCAGG AAAAGTGATG ACACCTTTTA GAACTGGCTT GGCTTTGAGC CCTGTTTCTT 10500 

CAAGGATTTC ACGCGCCGCG CATTCCTGGG GCGTCTCTCC TCTCTCTAGC TTACCACCCA 10560 

CACCAATCCA TTTCCCTTCA TGGACATCAT TGGGTTTCTT ATTACGATGG AGCATGAGCA 10620 

GTTCTTTCCC ATTATCAATG TAGCAAATCG TCGCTAACTG AGGCATATTT TCTCCTTATC 10680 

TAAGCCAATC GATTGGCTCT TGTCCTGTCT CTTTTAAGAA TGCATTGGCC TTGGAAAAGG 10740 

GCTTGGAACC CCAAAATCCT CTATAAACCG ACAAAGGACT TGGATGGGCT GATTCGATAA 10800 

TCAAGTGATG AGGATTGGTA ACTAATGCCT TCTTCTTACG TGCATAAGCT CCCCAGAGTA 10860 

CAAAAACGAC TGGTCTATCT AGATGATTGA CCACCTGAAT CACAGCATCA GTAAAAGGCT 10920 

CCCAGATTTG ACCAGCATGA CCATTGGCCT GTCCAGCAGG AACAGTCAAA CAAGCATTAA 10980 

GAAGCAAGAC TCCTTGCTCA GCCCAAGCTG TCAAATCATG AGATTTCTTA ACTCCGATAT 11040 

CATCTGACAA TTCTTTCAAG ATATTTTGCA AGGATGGTGG AGCTGGGATA GAGTCAGGTA 11100 

CAGAAAAACT CAAGCCCTGC GCTTGACCTG GTCCGTGATA GGGGTCTTGC CCTAGAATTA 11160 

CCACCTTAAC TTCTTCAAGC AGTGTTGTCA AGAGAGCCTG AAAAACCTTT TCCTTGGGTG 11220 

GATAAATAAT CCCCTGAGAA TAGACCTGCT CCATAAACTG ATTGATTTTC CCGAAATAAC 11280 

CCTCAGGTAA TTGCGCCTTA ATCAAAGCAT GCCAAGACGA GTGTTCCATA GCCGACTCGG 11340 
(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

AAAAAATAGA CTTGTTAGAC TATAAATGTA GTAAGCCTAC ACAAGAAAAA TACATAGAGA 60 

TAAAGGTGAT TATTATGAAA TTCAAAAAAA TGCTTACTCT TGCAGCCATT GGCTTATCAG 120 

GATTTGGGCT TGTTGCCTGT GGCAATCAGT CAGCTGCTTC CAAACAGTCA GCTTCAGGAA 180 

CGATTGAGGT GATTTCACGA GAAAATGGCT CTGGGACACG GGGTGCCTTC ACAGAAATCA 240 
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CAGGGATTCT CAAAAAAGAC GGTGATAAAA AAATTGACAA CACTGCCAAA ACAGCTGTGA 300 

TTCAAAATAG TACAGAAGGT GTTCTCTCAG CAGTTCAAGG GAATGCTAAT GCTATCGGCT 360 

ACATCTCCTT GGGATCTTTA ACGAAATCTG TCAAGGCTTT AGAGATTGAT GGTGTCAAGG 420 

CTAGTCGAGA CACAGTTTTA GATGGTGAAT ACCCTCTTCA ACGTCCCTTC AACATTGTTT 480 

GGTCTTCTAA TCTTTCCAAG CTAGGTCAAG ATTTTATCAG CTTTATCCAC TCCAAACAAG 540 

GTCAACAAGT GGTCACAGAT AATAAATTTA TTGAAGCTAA AACCGAAACC ACGGAATATA 600 

CAAGCCAACA CTTATCAGGC AAGTTGTCTG TTGTAGGTTC CACTTCAGTA TCTTCTTTAA 660 

TGGAAAAATT AGCAGAAGCT TATAAAAAAG AAAATCCAGA AGTTACGATT GATATTACCT 72 0 

CTAATGGGTC TTCAGCAGGT ATTACCGCTG TTAAGGAGAA AACCGCTGAT ATTGGTATGG 780 

TTTCTAGGGA ATTAACTCCT GAAGAAGGTA AGAGTCTCAC CCATGATGCT ATTGCTTTAG 840 

ACGGTATTGC TGTTGTGGTC AATAATGACA ATAAGGCAAG CCAAGTCAGT ATGGCTGAAC 900 

TTGCAGACGT TTTTAGTGGC AAATTAACCA CCTGGGACAA GATTAAATAA AATGTTTGCT 960 

CCATAAATCT CTAAAGAGAT GCAGACGTTT CATCGTACAA TAAGATAAAG AAGGCAAGTA 1020 

GGGAGGTGTC GTATCTCCCT TACTTTCTTC ACTAGAAAGG ACAAGATGTG ACAAAACAAG 1080 

CCTTCAAAGA AGCAGTTTTT AGGGCAATTT TTTTCATGAG TGCAACAGTA GCTGTTGTAG 1140 

CTATTTTGCT AATCTGTTTC TTTATTTTTA GTAATGGCTT ACCTTTCATA GCTAACTACG 1200 

GCTTTGCCCG TTTTTTATTA GGCAGTGATT GGTCGCCAAC GAACATTCCG GCAAGCTATG 1260 

GTATTTTACC AATGATCGTT GGTTCCTTAT TAATTACCTT AGGAGCGATT GTGATTGGGG 1320 

TGCCAACAGG CATCTTGACA TCGGTGTTTA TGGTTTATTA TTGTCCAAAG CCCGTCTATG 1380 

GCTTCTTAAA ATCAGCTATC AACTTGATGG CAGCCATTCC ATCTATTGTT TATGGTTTTT 1440 

TCGGCCTACA ATTATTGGTG CCTTGGATTA GAAGCTTTTT AGGAAATGGC ATGAGTGTCC 1500 

TAACCGCTTC GTTACTATTA GGAATAATGA TTTTGCCAAC CATTATCAGT TTGTCAGAAT 1560 

CTGCTATCCG AACAGTTCCC AAAACGTATT ATTCTGGTAG CTTGGCTCTA GGAGCTAGTC 1620 

ATGAACGGAG TATTTTTAGT GTCATCTTGC CAGCTGCGAG ATCTGGTATT TTATCAGCAG 1680 

TTATTTTAGG AATCGGTCGC GCAGTAGGTG AAACCATGGC AGTTATTTTG GTGGCAGGCA 1740 

ACCAGCCGAT TATTCCAAGT GGACTCTTTT CAGGAACCAG AACCTTAACA ACCAATATTG 1800 

TTCTGGAAAT GGCTTACGCA TCAGGTCAGC ATAGGGAAGC CCTTATTGCA ACCTCAGCAG 1860 

TTCTCTTTTT CCTTATTCTC TTGATTAATG CCTACTTTGC CTACTTGAAA GGAAAATCAT 1920 

CTTATGAGTA AATACCTGCT AAAACTTCTC GTTTATTGTT TTTCAGCTTT AACCTTTGGC 1980 

TCTCTCTTTT TAATCATTGG TTTTATCCTC ATCAAAGGCT TACCTCATCT AAGTCTATCC 2040 
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CTCTTTTCTT GGACTTATAC TTCTGAGAAC ATTTCCCTTA TGCCAGCGAT TATTTCCACC 2100 

GTTATTCTGG TCTTTGGTGC TCTTCTTTTA GCCTTGCCCA TAGGGATTTT TGCTGGTTTT 2160 

TATCTTGTGG AATATACAAA AAAAGATTCC CTTTGTGTTA AAATCATGCG ATTGGCCTCA 2220 

GATACCTTAT CTGGGATTCC TTCCATTGTT TTTGGTCTGT TTGGCATGCT CTTCTTTGTA 2280 

GTCTTCTTAG GTTTTCAATA CTCTCTGTTA TCAGGAATCT TAACCTCAGT TATCATGGTG 2340 

TTGCCAGTCA TTATTCGCTC AACAGAAGAA GCCCTTTTAT CTGTTAGTGA TAGCATGCGT 2400 

CAAGCAAGTT ATGGACTTGG GGCAGGTAAG TTACGGACTG TTTTTAGAAT TGTTCTACCA 2460 

GTTGCCATGC CAGGTATTTT AGCTGGAGTG ATACTAGCTA TTGGCCGTAT CGTTGGTGAA 2520 

ACAGCTGCCC TCATGTATAC ATTAGGTACC TCTACCAATA CGCCAAGTAG TCTCATGTCT 2580 

TCAGGCCGTT CTCTAGCCCT ACATATGTAT ATGCTGTCAA GTGAGGGGCT ACATGTCAAT 2640 

GAAGCCTATG CTACCGGCGT GATTTTGATT ATTACTGTTT TAATGATAAA TACTCTATCA 2700 

AGCTTATTAT CTCGAAAACT TGTGAAAGGA GCTTCCTAGT ATGGGAACAT TTTCAGTCAG 2760 

ACACCTAGAC TTATTTTACG GGGATTTTCA AGCCTTAAAA AATATTTCGA TTCAATTACC 2820 

AGAAAGACAG ATTACTGCCT TGATAGGCCC ATCTGGTTGT GGCAAATCAA CTTTTCTAAA 2880 

AACCCTTAAC CGGATGAACG ATTTGGTTCC TTCTTGCCAT ATTGAAGGCC AAGTCCTCTT 2940 

AGATGAGCAA GATATTTATA GTAGCAAATT CAACCTTAAT CAGCTACGTA AGCGTGTAGG 3000 

GATGGTTTTT CAACAGCCTA ATCCCTTTGC CATGTCTATC TATGATAACG TGGCTTATGG 3060 

CCCAAGGACA CATGGTATTC GAGACAAAAA ACAATTAGAT GCCTTAGTGG AGAAATCTTT 3120 

AAAAGGGGCA GCCATTTGGG AAGAAGTCAA AGATGATCTT AAAAAGAGTG CCATGTCCTT 3180 

ATCTGGCGGT CAGCAGCAAC GCCTTTGCAT TGCGCGAGCT TTAGCAGTAG AACCTGATAT 3240 

TCTGTTAATG GATGAGCCGA CTTCAGCCTT AGACCCTATC TCCACTTTAA AAATTGAAGA 3300 

CCTCATTCAG CAACTAAAAA AGGATTATAC GATTATCATT GTTACCCATA ACATGCAACA 3360 

AGCTTCACGT ATTTCAGATA AAACTGCTTT TTTCTTAACA GGAGAAATTT GCGAATTTGG 3420 

AGATACCGTT GACGTGTTTA CCAATCCAAA AGATCAGCGC ACAGAAGACT ATATTTCAGG 3480 

ACGGTTCGGA TAAGGAAGGA AAAACCTATG AGAAATCAAT TTGACTTAGA ATTGCATGAA 3540 

TTAGAACAAT CCTTTTTAGG ACTAGGGCAA CTTGTCCTTG AAACAGCTTC AAAAGCCTTA 3600 

CTGGCCTTAG CCTCCAAAGA CAAGGAGATG GCAGAGCTAA TTATCAATAA GGATCATGCT 3660 

ATCAACCAAG GTCAAAGCGC TATCGAATTG ACCTGTGCCC GTTTGTTGGC CTTGCAGCAG 3720 

CCACAAGTGT CTGACCTTCG ATTTGTGATT AGCATCATGT CTTCTTGTTC AGACCTTGAA 3780 
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CGTATGGGAG ACCATATGGC AGGCATTGCC AAAGCTGTTT TGCAACTAAA AGAAAATCAA 3840 

CTAGCCCCTG ACGAAGAACA GTTACACCAA ATGGGTAAAT TATCCCTCAG CATGCTAGCC 3900 

GATTTATTGG TTGCCTTTCC TTTGCACCAA GCCTCAAAAG CTATTAGTAT TGCTCAAAAA 3960 

GATGAACAGA TTGACCAATA TTATTATGCC TTATCAAAGG AAATCATTGG ACTTATGAAA 4020 

GACCAAGAAA CCTCAATTCC CAATGGAACT CAATACCTTT ATATCATAGG GCATCTGGAA 4080 

CGCTCGCTGA TTACATTGCT AACATTTGTG AACGCCTAGT CTACCTAGAA ACAGGAGAAC 4140 

TAGTGGATTT GAATTAATTC AACTAATCCT TAAAAGAGAA GAGTACGATT AAGTACTCTT 4200 

TTTTATGGTT GTAAAAAAGT TCATTTGACC AATTTAAGCA GTGTAGATAG TGAGGAGTTG 4260 

TTTCAATTCT ATCGTGAACG AGGGAATGCT GAAAACTTTA TCAAAGAAAG GAAAGCAGGA 4320 

TTCTTTGGGG ATAAGACAGA TAGTTCGACC ATGATTAAGA ATGAAGTACG TATGATGATG 4380 

GGCTGTCTGG CTTATAATCT CTACCTCTTT TTAAAGCAGC TAGCTGGTGA TGAAGTAAAG 4440 

TCCTTGACTA TCAAGCGTTT TCGACGTCTC TTCCTTCATA TTGCCGGAAA ATATGTCTCT 4500 

ACTGCTAGAC GACATATTCT CAAATTCTCA AGTCTATACG CCTATTCAAA ACAGTTTCAA 4560 

GCCTTATTTG ATACAATCTG CCAGATAAAT CTGATACTCC CTGTTCCATA TAGAGCTAGA 4620 

GGGCAGGGGA AAACATGCCT AACAGAATAA GTCACCTTAT TTTAAAAATC GAGCATCAAA 4680 

CCAAGGGAGG AGTCTGCCCT TTTTTAGGAA AAAATCAAGA CAAATCTCCT CAATTATGTC 4740 

TCGAACATCA GAAATTAAGC AAAATCACCA GAAGGACAGT ATTTCAACTA GCTTTTCTGG 4800 

TAATTTTTGA ACTGTGTAGT TCGTTAGTGC CAGATATGAA TAATTTGGGA TGATAAATCT 4860 

TTCTTCCTCA GGTAGCCTAT CATAATACTC TTCAAAAATC TTATCAAAAA CACTCTCTTT 492 0 

CTTTTGGGCG ATAGTTTCAT CTTCGTATGT AGGAGTCCTC ATCAAGAAAT ACTTCAATTC 4980 

TAGGTATTCC TTATCCAACT CTATATAACT TGGCATCAAC TTGTAATCTT CAACCCCCAA 5040 

ACGTTCAGCA ATATATTTTA ACTTTGTTAG TATTGGTCTG GATTCTCCAT TTTCAATTCT 5100 

AATTAATTGA CGGATACTTA ATTCAGACTC ATCACCACAA AATTCTGAAC GACTGATT1 V 5160 

TTTAGCCAAA CGTAATCTTT TAATTTTTTC GCCAAACTCT CGCAACCTAC AAGAACTTCC 3220 

I 

TGAGTTGTTT ACCTCTATTA TAAGCATATA CTGAATCAAA CTATCTATCA GATTTCTTCT 5280 

CACTTTAACT AAAGACTAAG AGTTTATCCC TTCGTCTCGG TTTTTGTGTA TTTTTCCACC 5340 

ATACCCCAGT AATGCAAGTG CAAAATCCCC TAGAATATGA TAGAATAAGA GAAAGAACTC 5400 

TATCAAGGAG GAAATCATGG AAAAACAAAC CGTCGCCGTC TTGGGGCCTG GTTCTTGGGG 5460 

AACCGCCCTT TCACAAGTCT TAAATGACAA TGGACACGAG GTACGTATTT GGGGAAATCT 5520 

TCCCGAGCAA ATCAATGAAA TTAATACACA CCATACTAAT AAGCACTACT TTAAAGATGT 5580 
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CGTTCTAGAC GAAAATATCA TTGCCTACAC CGACTTAGCA GAAACATTGA AAGATGTGGA 5640 

TGCGATTTTG TTTGTTGTCC CAACAAAAGT GACACGACTT GTTGCCCAGC AAGTTGCACA 5700 

AACCTTGGAC CATAAGGTTA TCATCATGCA CGCATCAAAG GGATTAGAAC CTGATAGCCA 5760 

TAAACGATTA TCAACCATTC TTGAAGAAGA AATTCCTGAA CATCTCCGTA GTGATATCGT 5820 

CGTTGTTTCA GGGCCTAGTC ATGCAGAAGA GACCATTGTG CGTGACCTAA CTTTAATAAC 5880 

TGCTGCTTCT AAAGATTTAC AAACAGCTCA ATACGTTCAG AAGCTATTTA GTAATCACTA 5940 

CTTCCGACTT TATACCAATA CGGATGTTAT CGGGGTTGAA ACTGCTGGTG CTCTTAAAAA 6000 

TATTATTGCT GTCGGTGCTG GAGCTTTACA TGGTCTTGGA TTTGGTGATA ATGCTAAGGC 6060 

AGCCATCATC GCTCGAGGTT TAGCAGAAAT CACCCGCCTA GGGGTAGCAC TCGGGGCCAG 6120 

TCCATTGACC TATAGCGGCT TATCTGGTGT GGGAGATTTG ATCGTAACGG GAACTTCCAT 6180 

CCACTCTCGT AACTGGAGAG CTGGAGATGC TCTCGGACGA GGAGAATCCC TAGCTGATAT 6240 

AGAAGCTAAT ATGGGCATGG TAATCGAAGG AATTTCAACG ACTCGAGCAG CCTATGAACT 6300 

AGCCCAAGAA CTTGGAGTCT ATATGCCCAT TACACAGGCT ATTTACCAAG TTATTTATCA 6360 

CGGAACCAAT ATCAAAGATG CCATTTATGA CATCATGAAC AATGAATTTA AAGCAGAAAA 6420 

TGAGTGGTCT TAACCCTCTA TAGAAAGGAT TTTTATGACA TCAAAAGTTA GAAAGGCAGT 6480 

CATCCCTGCT GCTGGACTAG GAACTCGATT TTTACCAGCA ACCAAGGCCC TTGCCAAAGA 6540 

AATGTTGCCA ATCGTAGACA AACCAACTAT CCAGTTTATC GTGGAAGAAG CTCTCAAATC 6600 

AGGTATTGAA GATATTCTAG TTGTCACTGG TAAATCAAAA CGTTCTATTG AGGACCACTT 6660 

TGATTCAAAC TTCGAATTGG AATATAACCT CAAAGAAAAA GGGAAAACAG ATCTTTTGAA 6720 

GCTAGTTGAT AAAACAACTG ACATGCGTCT GCATTTTATC CGCCAAACTC ATCCACGCGG 6780 

TCTCGGAGAT GCTGTTTTGC AAGCCAAGGC TTTCGTCGGA AATGAACCTT TTGTCGTTAT 6840 

GCTTGGTGAT GACTTGATGG ATATCACAGA CGAAAAGGCT GTTCCACTTA CCAAACAACT 6900 

CATGGATGAC TACGAGCGTA CCCACGCGTC TACTATCGCT GTCATGCCAG TCCCTCATGA 6960 

CGAAGTATCT GCTTACGGGG TTATTGCTCC GCAAGGCGAA GGAAAAGATG GTCTTTACAG 7020 

TGTTGAAACC TTTGTTGAAA AACCAGCTCC AGAGGACGCT CCTAGCGACC TTGCTATTAT 7080 

CGGACGCTAC CTCCTCACGC CTGAAATTTT TGAGATTCTC GAAAAGCAAG CTCCAGGTGC 7140 

AGGAAATGAA ATTCAGCTGA CAGATGCAAT CGACACCCTC AATAAAACAC AACGTGTATT 7200 

TGCTCGTGAG TTCAAAGGGG CTCGTTACGA TGTCGGAGAC AAGTTTGGCT TCATGAAAAC 7260 

ATCCATCGAC TACGCCCTCA AACACCCACA AGTCAAAGAT GATTTGAAGA ATTACCTCAT 7320 
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CCAACTTGGA AAAGAATTGA CTGAGAAGGA ATAACAAAAT CATTTATATA AAGATTAGCC 7380 

ACACATAAAT TAAGTAAATT CTCTACTTGA ATCTACCTAT TTAATAAAAA CTAATGAAAA 7440 

CGCTATACTT GTATTTGTTT TTTCATTAAA ATAAGAGTAG AATAAATTAG TATAGTAAAA 7500 

CAAAAAAGCA CCGAATCGGT GCGCACTTTT TCAAGTTGTG TACGGACAAA GCCTTATTTT 7560 

AACTTTGCTA TGTTGTTTCT AATGGTTCCA AAATAATAAA TAATTTTAAA TTTGACTTAA 7620 

CTGTTGGAGT AGTCATGGTT AAATTAAATC AACCGAGCCG AACATAAGTT GTTTAATTTT 7680 

GTGGAAGCTA TTAATAAAAA TATAATAAGG GAGAAAGATA GGTGTAATTT TAATTTTAAA 7740 

GTAATTGCGG ACACTATCAA AGAAAAAGAT TATGGAGAAC AAATTTGTAG AATTTATCGA 7800 

AAACAATAAA AAAGTAATCA TTTCATCAGT TGCAGTTGGT GTTGTATTGG TATTAGGGTT 7860 

TGGATGGTAT TCATATAACC AACAACAAGC AGAACAACAA GCAAAAATTG TACAATTAGA 7920 

AAAAGATAGC AAATCAGACA AAGAACAAGT TGATAAACTA TTTGAATCAT TTGATGCATC 7980 

TTCAGATGAA TCTATTTCTA AATTAAAAGA ACTATCTGAA ACTTCACTTA AAACCGATGC 8040 

AGGTAAAGAC TATCTTAATA ACAAAGTCAA AGAATCATCT AAAGCAATTG TAGATTTTCA 8100 

TTTGCAAAAA GGTTTGGCTT ATGATGTTAA AGATTCAGAT GACAAATTTA AAGATAAAGC 8160 

AACTCTTGAA ACAAATGTAA AAGAAATTAC AAAACAAATT GATTTTATCA AAAAAGTTGA 8220 

TGAAACTTTT AAACAAGAGA ATTTGGAAGA AACTCTTAAA TCTCTAAATG ATCTTGTTGA 8280 

TAAATATCAA AAACAAATCG AACTTTTGAA GAAAGAAGAA GAAAAAGCTG CTGAAAAAGC 8340 

TGCTGAAAAA GCAAAGGAAT CTTCTAGTCA AAGTAATTCT TCTGGTAGTG CTTCTAATGA 8400 

GTCTTATAAT GGATCTTCCA ATTCAAATGT AGATTATAGT TCATCTGAAC AAACTAATGG 8460 

ATATTCAAAT AATTATGGCG GTCAAGATTA TTCTGGTTCA GGAGATAGTT CAACAAATGG 8520 

TGGATCATCA GAACAATATT CATCTAGCAA TTCAAACAGC GGAGCAAATA ATGTCTACAG 8580 

ATATAAAGGC ACTGGTGCTG ACGGCTATCA AAGATACTAC TACAAAGATC ATAATAATGG 8640 

AGATGTGTAT GATGACGATG GAAATTACCT TGGGAACTTT GGTGGCGGCA TTGCAGAACC 8700 

TAGTCAACGC TAATAACTAT TTTAGAGCTG TGTTGTTTCG AATGGTTCCA AAACACATTA 8760 

AAAGCTACTC ATTTTTTAAG TAGCTTTTTT CTTATTCAAG TTTACATATT ATACTCAATG 8820 

AAAATCAAAT TCAAACCACG TCAGCATCGC CTTACCGTAG GTATGGTTAC TGACTTCGTC 8880 

AGTTTCATCT ACAACCTCAA AACCATGTTT TGAGCTGACT TCGTCAGTTC TATCTACAAC 8940 

CTCAAAGCAG TGCTTTGAGC AACCTGCGGC TAGCTTCCTA GTTTGCTCTT TGATTTTCAT 9000 

TGAGTATTAG TCGTCACAAT CCCATTCCCT TGTAGAAAAG CAAAATGGCG AGTCCTACGA 9060 

ACAAGACTAC CGCTCCTAAT CTCTGGCTGG TGTTATACAT CCGTTTTTCT CCTCTAACTG 9120 
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GAAAGATAAC TGCTAGAAAT GCGCCACCAA CTGCACCACC GATATGGCCT GCTAGGCTGA 9180 

TTCCTGGAAT CAGAACACTT CCAATAATGT TAACCACAAA AAGTGTCAGA TAGGATTGCC 9240 

CTAGCTGTTG GATATAAGGA TTGCGAGTTG CATAGCGAAG AACAATAATC GCGGCAAATA 9300 

GCCCATAAAG AGAGGTAGAG GCGCCTGCTG CTAAGGATTT AGGACTAAAT ACAAAAACAA 9360 

AGAGATTGCC CATCATTCCT GATAAAAGAT AGAGAAAGAA AAACTGCTTA GAACCGAAAA 9420 

TCTCCTCTAC CTGCCTTCCA AGATAATAAA GTGAAAGCAT ATTAACAATG AAATGTTCCC 9480 

ACCCAATATG AACAAAAATG GCAGACAAGA GACGCCAAAC CTGCTCGGGA AAGAGGCGAA 9540 

TAGCTGGCCC ATACATGGCT CCAAATCGAA ATAATGTATC TGCCCTGTCA AAGTTTCCGC 9600 

CTGCAGTGAC CAACATTAGT AAAAATACCA AGGCCGTCAC TAAGAGGAAG AAACTCGTCA 9660 

CAGGGTAACG TCTATCAAAG ATTTCCTTCA TCAATTAATA CCTCCTGAAC AGGAATATCA 9720 

TGGTTTTCAG GTATAAAGTC CTGAATTTGA CAAGGATATA TCGTACTCAA AGTACGACCA 9780 

GAAAAATGTT CCAGATAGCG GTCATAATAG CCTCCACCGT ATCCTATCCG ATATCCTTTC 9840 

GTCGTAAAAG CCAGACCAGG AACATGAATC AAATCAATCT GAGATGCATC CACCACTTCC 9900 

AAATCTCCCT GTAGCTCCAG TAAGGCAAAG AAAGTTTTTA CCAACTGTTG CGGATCATAG 9960 

ACCACAAAGT CCATGCGCCC CTTGGGATAA GTTTTGGGTA TTAAAACCTT CTTGCCGTCC 1002 0 

TTCAGCGCCT GCTCAATCAG TTCCTGCGTT TGAAACTCAT GAGAAAAAGA GAGGTAGGTT 10080 

GCGATGACCT TGGCTTCTTG ATAAAAGGGG TGTTGTAAAA GCCGCTCGGT TAAAGCTTGG 10140 

TCTATAGCCT GTTTTTGCTC TTGAGATATA GCCTTCATTT CATGCAAGAC TTGCTTGCGT 10200 

AATTCCGATT TCATAGACAA GCCCTCTATT CTGCTGCCTT CTTTTTCAGG AAACTAGACA 10260 

CCGCAGCCAC CCCAATAGCT AAGACTTCTT CCTTAGGACT CATTTGAGGG TGATGAAGAG 10320 

CGTAGGGACT ATCGATACCT AGCCAAAACA TCACGCCATC AACCTTTGAA AGGAGATAAC 10380 

CAAAGTCCTC GCCTGTCATA GCAGGTTCGA TATCAATCAA CTCGATTCCG TCTTTTTCGT 10440 

CAAAGAAGTC CATCAGTTCA CGCGCCAAGG CTGGATTGTT CTCAACAGGT AGGTATCCAC 10500 

CTTGTTTGAG TTCCACTTCG ACTTCCATAT CAAAGGCAGC TGCAACCCCT TCTGCAACTG 10560 

TTTTTACCCT CTTTTGCACC AAGAGACTCA TGTCCTGTGT CAAGGCACGA ATAGTTCCAT 10620 

GTAAAAAAGC TGTGTCTGTG ATGACATTGT TGGTGGTTCC AGCTTGAAAA ACGCCGAAGG 10680 

TCACCACTGC TCCCTCGATT GGGTTGACAT TGCGGCTAAC AACTGACTGC ACTTGGGTCA 10740 

CAAAGTAACT AGCCGCCACC AAGGCGTCAT TGGCTTCATG AGGAAAAGCT GCGTGGCCAC 10800 >. 

CTTTGCCTTT GAAACGGATC TTCACCTCGC AAGTTCCTGC AAAGAGTGTA TGAGTATTAG 10860 
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TCGCAATCTG GCCGACTTTC AAATCTGGAC GAACATGGAG ACCATAGAAT TGATCTGGCA 10920 

ACCAATCTCC AAAAGCACCG TCCTCATACA TGAGCATACC ACCAGCTTCA TTTTCTTCAG 10980 

CAGGCTGAAA TAGAAAGAGC AGATTATTCT TGGGTTGCTC CTCAAGGGCG CGCTCAAGAC 11040 

AGCCTAAGGC AATGGTCATA TGAAAATCAT GGACACAGGC ATGCATGCGA CCTTGGTGTT 11100 

GAGAAGCAAA AGGTAGACCT GTTTGTTCGA CGATAGGCAG GCCATCAATA TCTGTCCGCC 11160 

AACCAATGGT TCGCTCCGGC TGACTTCCCT GCAGGTAGAC CAAAATCCCT GTCCGCCAAG 11220 

TACGAATTTG AACAAAATCC TTGCCCGTAG TCAATTTCTC AATCACATCC AGCAAATAAG 11280 

CCTGAGTCTT GAACTCCTCC AAGCCAATCT CTGGAATCTG GTGTAAATCT CGTCTAGTCT 11340 
GAATCAAATC TAACATCTAT CTGTCCTCCG ATATAGCAGA AAGAGGCTGG AAAAAGGGTT . 11400 

CCGCCTCTTT TTTACTTTTA CAATTACAAG GTACGAAGCG CATCCTCTAG CGCTGTTTTT 11460 

TGTTGAGTTT GGGCATCAAT TTCTTTGATA ATACGAGCTG GAACACCTGC TACTACCACG 11520 

TTTTCTGGGA CATCTTGGGT AACAATAGCT CCTGCTGCGA CAACTGAACC ACTACCGATT 11580 

TGGACTCCTT CGATAACCAC TGCATTAGCA CCGATAAGAA CATTGTCTCC GACACGGACT 11640 

GGTTCAGCAC TAGCTGGCTC AATCACACCT GCCAAAACTG CACCTGCACC AACGTGGCTA 11700 

TTTTTTCCAA CGATGGCACG GCCACCAAGG ATGGCACCCA TGTCAATCAT GGTTCCAGCA 11760 

CCGATTTCAG CACCGATATT GATAACAGAT CCCATCATGA TAACAGCATT GTCACCAATT 11820 

TCCACCTGGT CACGGATAAT CGCACCTGGC TCGATACGAG CGTTGATAGC ACGCTTATCT 11880 

AGCAAAGGAA CTGCAGAATT ACGAGCATCT TGCTCGACAA CATAATCTTG ATTTTCTACC 11940 

AAACCTTCAA GAAGCGGAGC CACATCCTTC CAGTCTCCGA ATAGGACATT TCCTAGTTTG 12000 

ACAACAGAGC TAGGCACAGC AGTTGCGAGT TGCCCCTCAA AGGTTACTTT GACACTGGTT 12060 

TTCTTTTCAG CATTGGCGAT AAATTGGATA ATTTCTTGAG CGTTCATTTT TGTAGCAGTC 12120 

ATAGGTG 12127 
(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 12566 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

CCATCCTTCT GTTGATGTGA CAGGAATGAT GATAAATCAA CCAGTAGCTA GTCGCGAAGA 60 

GGTGACAGAG GCTTTGAGTC ACTTGGCGGT AGAGCACAAT AGTCTCATTG CTCGTCGAAT 120 
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CGTTGAGCCA AATGAAGCTG GAGAAACACG CTTTACCTAT GCCACTTATG GTGAGGGAAA 180 

GCTTCCAGAA GGTCTGACCA TTTCCTCCAA GGAGAGTGCA GAAACGAGTG ATTTATTAGG 240 

GTCTTACTTG ATTGTATCAG GAAGTTTGGA TGGAGTGAGC TTACAGACCA CCTTGAAAGA 300 

GCTTGGTTAT CAAGGCTTTG TTTCGAATGG AGAAGATCCA TTTTCGATAG TCTTACTATT 360 

GACGGCCACC CCTATGGTGC TACTGAGTTT AGCTATTTTT CTGCTGACCT TTATGAGTCT 420 

GACCCTGATT TATCGGATCA AATCCCTTCG TCAGGCAGGG ATTCGCTTAA TAGCTGGTGA 480 

GAGCTTGTTT GGAGTTGCTC TCAGACCAGT GTTAGAAGAT GTGAGACAGC TTATCTGCTC 540 

AGTGCTGGTA TCCAGTCTTT TGGGATTGGG GATTCTCTGG TATCAAGGTG CCTTGTTTAT 600 

GGCAACGGTG CAACTGGTCA TCATTGCTCT TCTACTTTAT GGATTGACCT TGGCAGGGAT 660 

TTCTACCTTA CTAAGTGTCG TCTATCTACT TGGTTTACAG GAAAATAGTC TGGTGGATCT 720 

ATTGAAAGGG AAACTCCCTC TCAAACGTAT GATGACATTG ATGATGGTGG GGCAACTCTT 780 

AG CTGTATTG GTGGTCGGAT CGAGTGCGAC AGCTCTCCTA CCCCACTACC GTGAAATGCA 840 

GGAAATGGAG AGAGCTAGCA ATAAATGGAG CCAGTCCTCA GACCGTTACC GTCTATCCTT 900 

TGGTTGGTCT AGTGCATTTG CCGATGAAGA AGGAACGCGT AAGGATAATC GTGAGTGGCA 960 

GACATTTACT GAAGAACGGT TAGCCAATAC AGACTCTTTT TATATTATGA GCAATGTTGA 1020 

CAATTTCTCA GATGGAGCAG AAGTGGACCT AGATGGCAAT CGTCTCAGTG ACTACACACC 1080 

GTCAGGGAAT GTTATCTATG TCTCACCGCG CTATCTGATA GAAGAAAAGA TTACCGTTTC 1140 

TTCAGAGTTT ATGGACAAGA TGCAAAACTT GTCTGAGGGA GAGTTTGGGC TGATCTTGCC 1200 

TGAGAGCTTG CGAGAGCAGT CTGTCTACTA CCAAGGATTG TTTACAGATT ACCTGCAAAA 1260 

CTTTTCATCT GAAAGTGTAG AAGTGACGAG TCAGAAACAC TACCTCCCAC AGGTAAGGCT 1320 

AGCTTTTACA GAAACAGGAC AGGAACGTTT CCTCTATAAT GATGGGTACA AGACAACACG 1380 

CCAGTACCTA AAAGATCCGA TTATTGTAGT TCTAACGCCG CAAGCGACTG GAACAAGACC 1440 

TGTTGCAGGG ATGTTGTGGG GAACTACGGC TAATAGTGCC TTGAAACTAG ATCGATATGG 1500 

AGACAGCATC ACAGCTCTAA AAGAGAAAGG TCTGTATCAC AAGGTTTCTT ACTTGGTAAA 1560 

AAGCCAGCTA TTTTTTGCCA AGGTACTAAA TGACAAACGG GTGGAGTTTT ACTCTCTCCT 1620 

TATTGGGACG ATTTTGACCC TGTCTACGGC TATCTTGTTA TTTGATTCCA TGAATCTTCT 1680 

CTATTTTGAG CAGTTCAGAC GGGAACTTAT GATTAAACGT CTTGCTGGTA TGACAATCTA 1740 

TGAGCTTCAT GGCAAGTATT TACTGGCGCA AGGAGGAGTT CTCTTGCTTG GCCTAGTCCT 1800 

ATCTAGTATT TTGACAAGAG ATGGTTTGAT TAGCGCTCTA GTTGTAGCTT TGTTTACGCT 1860 
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TAACGCCCTC TTGATTTTAG TAAGGCAGGA CAAAAAAGAA GAAGCTGGTA GCATGGCAGT 1920 

ATTGAAAGGA AAATAAGATG ATTGATATTC AAGGATTGGA AAAGAAATTT AATGACCGCG 1980 

CGATTTTCTC TGGTTTGAAT CTCAAGCTGG AGAAGGGCAA GGTTTATGCC TTAATCGGAA 2040 

AGAGTGGAAG CGGAAAGACG ACGCTGCTGA ATATCTTGGG AAAGCTAGAA AAGATAGATG 2100 

GTGGAAGGGT TCTCTATCAG GGGAAAGATT TAAAAACCAT TCCCACTCGT GAGTATTTTC 2160 

GAGACCAGAT GGGCTATCTC TTTCAAAATT TCGGCCTCTT AGAAAACCAA TCAATCAAAG 2220 

AAAATTTGGA TTTGGGTTTT GTTGGTCAGA AAATCTCAAA AGTAGAACGT TTGGAAAGGC 2280 

AAGTGGGGGC TTTAGAAAAA GTTAATCTAG GGTATTTGGA TTTAGAACAA AAAATCTATA 2340 

CTTTATCTGG GGGAGAGGCC CAACGAGTTG CCCTTGCTAA GACTATTTTG AAAAATCCAC 2400 

CCTTGATTTT GGCAGATGAA CCAACAGCAG CTCTTGATCC TGAAAATTCA GAGGAGGTTA 2460 

TGAATCTCTT GGTGGATTTG AAAGATGAAA ATCGAATTAT CATCATTGCG ACCCATAATC 2520 

CCCTAGTCTG GAATAAGGCT GATGAAATCA TTGATATGAG GAAACTTGCT CATGTGTGAA 2580 

AAAATCCGTA TTCGCAGGGT ATCTGATTAT CCTAGTGCCA GAGGTGGTTT AGAAGATATC 2640 

CTCATCATGG AAAATATGAC CAATCATCTC CTTTTGGTTC AAATCCGAGT GCATGGCTAT 2700 

TTGCTTGATT TTGCTAGTAT TGAAGGGCAA AGGCAAAAGC ATTATCGTTT GAAAAATTTA 2760 

CCTCAGACGG TTGAACTGAC AGTGGATGAT GTGGAGGAGG ATGTGGATTT GACCCTACCT 2820 

GAAAATCGAA GTTATCAAGA AGCTGATTTT TTTGAACGCA TGTTTCGAGA GAACTGCTAA 2880 

GGCCACTTTT AAAGATTTCC AAGACTATCT TTCTTCATGA GGAAAGATAG TTTTTTGGTA 2940 

TGATTTTCAT TCCCAAAATA CAAGGGGAAT GTGTTACAAT AGTAGTAACA GATAATAGAA 3000 

AAGAGAAT AG ATGAGAATTG CAGATTATAG CGTGACCAAG GCAGTGCTGG AGCGTCACGG 3060 

TTTTACCTTT AAAAAGTCCT TTGGGCAAAA TTTTTTGACG GATACCAATA TCCTTCAAAA 3120 

AATTGTGGAT ACGGCTGAAA TTGATGATCA GGTCAATGTC ATCGAAATCG GGCCAGGTAT 3180 

TGGTGCCTTG ACAGAATTTT TGGCTGAGCG TGCAGCCCAA GTCATGGCTT TTGAGATTGA 3240 

CCACCGTTTG GTGCCAATTT TGGCAGATAC CCTGCGTGAT TTTGATAATG TGACCGTAGT 3300 

TAACGAAGAT ATTCTCAAGG TTGATTTGGC GCAACATATC CAGAATTTTA AAAATCCTGA 3360 

CCTGCCAATC AAGGTAGTGG CTAATTTGCC TTACTACATC ACGACGCCTA TTCTCATGCA 3420 

CTTGATTGAG AGTGGCATTC CTTTTTGTGA GTTTGTGGTC ATGATGCAGA AAGAAGTAGC 34 80 

GGACCGCATT TCAGCCCAGC CTAACACCAA GGCTTACGGT AGCTTGTCTA TCGCCGTGCA 3540 

GTATTACATG ACAGCCAAGG TTGCCTTTAT CGTGCCTCGT ACGGTCTTTG TGCCAGCGCC 3600 

AAATGTGGAT TCAGCCATCT TGAAAATGGT GCGTCGTCCA GAGCCAGCCG TAGCAGTAGA 3660 
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AGATGAGAAC TTTTTCTTTA AGGTTTCCAA GGCTAGTTTT ACCCATCGCC GCAAGACCTT 3720 

GTGGAATAAC TTGACAGGTT ACTTTGGTAA GACTGAAGAG GTCAAGGACA AGCTGACCAA 3780 

GGCTTTGGAC CAGGCAGGCT TGTCACCAAG TGTGCGTGGG GAAGCTCTCA GCTTGGCAGA 3840 

ATTTGCCGGT CTAGCAGACG CACTTAAAGG GCAAGGACTC TAAGATGCAG GGACAAATCA 3900 

TTAAAGCCTT GGCAGGTTTC TACTATGTGG AGAGTGATGG CCAGGTTTAT CAAACACGCG 3960 

CGCGTGGGAA TTTCCGTAAA AAAGGCCATA CCCCTTATGT TGGGGACTGG GTAGATTTCT 4020 

CTGCCGAGGA AAATTCAGAA GGCTATATCC TCAAAATTCA CGAACGGAAA AACAGTCTGG 4080 

TTCGTCCGCC TATTGTCAAT ATCGATCAAG CTGTAGTAAT CATGTCCGTC AAGGAACCTG 4140 

ATTTTAACAG CAATTTGCTG GATCGTTTCT TGGTTCTTTT GGAGCACAAG GGCATCCATC 4200 

CCATTGTCTA TATTTCCAAA ATGGATTTGT TGGAAGATAG GGGAGAACTG GATTTTTACC 4260 

AGCAGACCTA TGGTGACATC GGCTATGACT TTGTGACCAG TAAAGAGGAA CTCCTGTCTT 4320 

TGTTAACAGG CAAGGTTACG GTCTTTATGG GGCAGACAGG TGTTGGGAAG TCAACTCTTC 4380 

TCAATAAAAT CGCACCAGAC CTCAATCTTG AAACGGGAGA AATTTCAGAC AGTCTAGGTC 4440 

GCGGTCGCCA TACCACTCGA GCTGTTAGTT TTTACAATCT CAACGGGGGT AAAATCGCAG 4500 

ATACACCAGG ATTTTCATCC TTGGACTATG AAGTATCAAG GGCTGAAGAC CTCAATCAGG 4560 

CTTTCCCAGA GATTGCTACT GTTAGCCGAG ATTGTAAGTT CCGTACTTGT ACCCATACCC 4 620 

ATGAGCCGTC TTGTGCCGTC AAACCAGCTG TTGAAGAGGG TGTTATTGCA ACCTTCCGTT 4680 

TTGACAATTA CCTGCAATTC CTTAGTGAAA TTGAAAATCG TAGAGAAACC TATAAAAAAG 4740 

TCAGCAAAAA AATTCCAAAA TAAGGAGAAA CCTATGTCTC AATACAAGAT TGCTCCGTCA 4800 

ATTCTGGCAG CAGATTATGC CAACTTTGAA CGTGAAATCA AACGTCTAGA AGCAACTGGG 4860 

GCAGAATATG CCCATATCGA TATCATGGAC AGTCATTTTG TACCGCAAAT CAGTTTTGGT 4920 

GCAGGTGTGG TCGAGAGCCT TCGTCCTCAT AGTAAGATGG TTTTCGATTG CCACTTGATG 4980 

GTGTCAAACC CTGAGCATCA TCTGGAAGAT TTTGCGCGTG CAGGTGCAGA CATCATCAGT 5040 

ATCCATGTAG AAGCAACGCC TCATATTCAT GGCGCCCTCC AAAAAATTCG TTCACTCGGA 5100 

GTTAAGCCTT CAGTCGTTAT CAATCCTGGC ACATCAGTTG AAGCCATCAA GCACGTCCTT 5160 

CATCTAGTTG ACCAAGTTTT AGTCATGACG GTTAATCCAG GTTTTGGTGG GCAAGCCTTT 5220 

CTGCCAGAAA CCATGGATAA GGTCCGTGAG TTGGTTGCTC TTCGTGAGGA AAAAGGTTTG 5280 

AACTTTGAAA TCGAAGTGGA TGGTGGGATT GATGACCAAA CTATTGCTCA AGCCAAAGAA 5340 

GCCGGTGCGA CTGTTTTTGT AGCAGGTTCC TATGTCTTTA AGGGAGAAGT CAATGAGCGA 5400 
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GTACAAACTC TCAGAAAACA ACTGGACTAG GGTTGCAGTT TTTGCAGGCG GAAACCGCGG 5460 

TCATTATCGG ACAGATTTTG ATGCTTTTGT TGGGGTGGAT CGAGGCTCGC TCTGGGTCTT 5520 

GGAAGAAGAC TTACCTCTTG CTCTAGCAGT CGGAGATTTT GATTCTGTGA CGGAAGAAGA 5580 

GCGACAGGTG ATTCAAAAAG GTGCCCAGTA TTTTGTCCAA GCACGACCAG AAAAGGATGA 5640 

TACAGATCTG GAATTGGCTC TCTTAACCAT CTTTGAACAA AATCCTCAGG CTCAGGTCAC 5700 

TATTTTCGGT GCCTTGGGTG GCCGTATTGA CCATATGTTG GCCAATGTCT TTCTGCCTAG 5760 

CAATCCTAAG TTGGCACCCT ATATGCATCA AATAGAAATT GAGGATGGGC AAAACTTGAT 5820 

TACTTATTGT CCAGAAGGAA TCAGTCAGCT AGAACCTCGT TCAGACTACG ACTATCTAGC 5880 

CTTTATGCCA GTTCGGGATA GCCAGCTGAC TATTCTTGGA GCCAAGTATG AGTTGACAGA 5940 

GGAAAATTTT TTCTTTAAAA AAGTGTACGC TTCTAACGAA TATATAGATA GGGAAGTGTC 6000 

GGTAACTTGC CCAGATGGTT ATGTGGTCGT ACTGCATAGC AAGGACAGGA GGTAGGATGG 6060 

AAAGTTTACT TATTCTATTA TTAATTGCCA ATCTAGCTGG TCTCTTTCTG ATTTGGCAAA 6120 

GGCAGGATAG GCAGGAGAAA CACTTAAGTA AGAGCTTGGA GGATCAGGCA GATCATTTGT 6180 

CAGACCAGTT GGATTACCGC TTTGACCAAG CCAGACAAGC CAGCCAGTTA GACCAAAAAG 6240 

ATTTGGAAGT GGTTGTCAGC GACCGTTTGC AAGAAGTGCG GATTGAATTG CACCAAGGTC 6300 

TGACCCAAGT CCGTCAAGAA ATGACAGATA ATCTCCTCCA AACTAGAGAC AAGACAGACC 6360 

AACGTCTCCA AGCCTTGCAG GAATCAAATG AGCAACGTTT GGAACAAATG CGCCAGACGG 6420 
TCGAGGAAAA ACTAGAAAAG ACCTTGCAGA CACGCTTACA GGCTTCCTTT GAGACAGTTT ' 6480 

CTAAACAACT GGAGTCTGTC AATCGTGGCC TTGGAGAAAT GCAGACAGTT GCCCGTGATG 6540 

TCGGAGCTCT TAACAAGGTT CTCTCTGGAA CCAAGACGCG AGGGATTCTG GGAGAATTGC 6600 

AACTGGGGCA AATTATTGAA GACATCATGA CACCTGCCCA GTACGAACGA GAATACGCAA 6660 

CGGTTGAAAA CTCTAGTGAA CGAGTGGAGT ATGCCATCAA GTTACCCGGA CAAGGCGACC 6720 

AAGAATACGT CTATCTGCCA ATTGACTCTA AGTTTCCACT GGCAGATTAT TACCGCTTGG 6780 

AAGAAGCCTA TGAGACAGGT GACAAGGATG AGATTGAACG CTGTCGTAAG TCACTCCTAG 6840 

CAAGCGTCAA GCGCTTTGCT AGGGATATTA GGAACAAGTA CATAGCACCA CCTCGGACGA 6900 

CCAATTTTGG AGTTTTGTTT GTTCCGACAG AAGGTCTCTA CTCAGAAATC GTCCGCAATC 6960 

CGGTCTTCTT TGATGATTTG AGACGGGAAG AACAGATTAT TGTTGCAGGA CCAAGTACCC 7020 

TATCAGCCCT TCTTAACTCC CTATCAGTTG GTTTCAAGAC CCTTAATATC CAAAAGAGTG 7080 

CCGACCATAT CAGCAAGACT CTTGCCAGTG TCAAGACCGA GTTTGGCAAG TTTGGTGGTA 7140 

TTCTGGTCAA GGCACAAAAA CATCTCCAAC ATGCCTCTGG CAATATTGAT GAATTATTAA 7200 
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ACCGTCGTAC CATAGCTATC GAGCGGACGC TCCGTCACAT TGAGTTGTCA GAAGGTGAGC 7260 

CTGCGCTTGA TCTACTCCAT TTTCAAGAAA ATGAGGAAGA ATATGAAGAT TAGTCACATG 7320 

AAAAAAGATG AGTTATTTGA AGGCTTTTAC CTAATCAAAT CAGCTGACCT GAGGCAAACT 7380 

CGAGCTGGGA AAAACTACCT AGCCTTTACC TTCCAAGATG ATAGTGGCGA GATTGATGGG 7440 

AAGCTCTGGG ATGCCCAACC TCATAACATT GAGGCCTTTA CCGCAGGTAA GGTTGTCCAC 7500 

ATGAAAGGAC GCCGAGAAGT TTATAACAAT ACCCCTCAAG TCAATCAAAT TACTCTCCGC 7560 

CTGCCTCAAG CTGGTGAACC CAATGACCCA GCTGATTTCA AGGTCAAGTC ACCAGTTGAT 7620 

GTCAAGGAAA TTCGTGACTA CATGTCGCAA ATGATTTTCA AAATTGAAAA TCCTGTCTGG 7680 

CAACGGATTG TCCGAAATCT CTACACCAAG TATGATAAGG AATTCTACTC CTATCCAGCT 7740 

GCCAAGACCA ACCACCATGC CTTTGAAACG GGCTTGGCCT ATCATACGGC GACCATGGTG 7800 

CGTTTGGCAG ACGCTATTAG CGAAGTTTAT CCTCAGCTCA ATAAGAGCCT GCTCTATGCG 7860 

GGGATTATGT TGCATGACTT AGCTAAGGTC ATCGAGTTGA CGGGGCCAGA CCAGACAGAG 7920 

TACACAGTGC GAGGTAATCT TCTTGGACAT ATCGCTCTCA TTGATAGCGA AATTACCAAG 7980 

ACAGTTATGG AACTCGGCAT CGATGATACC AAGGAAGAAG TCGTTTTGCT TCGTCATGTC 8040 

ATCCTCAGTC ACCACGGCTT GCTTGAGTAT GGAAGCCCAG TCCGTCCACG CATTATGGAA 8100 

GCAGAGATTA TCCATATGAT TGACAATCTG GATGCAAGCA TGATGATGAT GTCAACAGCT 8160 

CTTGCTTTGG TGGATAAAGG AGAGATGACC AATAAAATCT TCGCTATGGA TAATCGTTCC 8220 

TTCTATAAAC CAGATTTAGA TTAATAATTT AAGAAAAATG AGCATTTTTT AGGATAAGAA 8280 

TGTTCGTTTT TTTATGTGAA TATGGTATAA TAAGTAAAAG ACAAAAATGA ATACTCTTCG 8340 

AAAATCTCTT CAAACTAGGG TAGTATCGCC TTGTCGTATG TATATATGCA GGTATATTAC 8400 

AGGGTTTGTC AGTTCTATTG ACAATCTCAA AACAGTGTTT TGAACCACCA GCGACCAGCT 84 60 

TTCTAGTTTG CTTTTTGATT TTTTGAATAA AAATGGAATA GGAAATAGAA ATGAAATTAA 8520 

GAAGAAGTGA TCGGATGGTT GTCATTTCCA ACTATTTGAT TAATAATCCT TATAAACTAA 8580 

CTAGTCTCAA TACTTTTGCT GAAAAGTATG AGTCTGCTAA ATCATCCATC TCAGAAGATA 8640 

TCGTCATTAT CAAACGCGCC TTTGAGGAAA TTGAAATCGG TCATATCC AG ACAGTGACTG 8700 

GGGCTGGCGG AGGTGTCATC TTCACACCGT CTATTTCGAG TCAGGATGCT AAGGAAATGG 8760 

TTGAAGACTT GCGTACCAAG TTGTCAGAAA GTGACCGTAT CTTGCCAGGT GGTTATATCT 8820 

ATCTGTCTGA TTTGCTTAGC ACACCAGCCA TCTTGAAAAA TATTGGTCGT ATTATTGCCA 8880 

AAAGCTTTAT GGACCAAAAA ATTGACGCGG TTATGACCGT AGCAACTAAG GGTGTGCCAC 8940 



WO 98/18931 



PCIYUS97/19588 



1004 



TTGCAAATGC 


AGTTGCCAAT 


GTCCTCAATG 


TCTCTTTTGT 


CATTGTGCGC 


CGTGACCTGA 


9000 


AAATTACCGA 


AGGTTCAACT 


GTTAGCGTCA 


ACTATGTTTC 


AGGTTCAAGT 


GGTGACCGTA 


9060 


TCGAGAAAAT 


GTTCCTTTCA 


AAACGTAGTC 


TTAAGGCAGG 


CAGCCGTGTC 


TTGATTGTGG 


9120 


ATGACTTCTT 


GAAAGGTGGC 


GGAACGGTCA 


ATGGTATGAT 


TAGTCTCTTG 


CGCGAGTTCG 


9180 


ACTCAGAACT 


GGCAGGTGTA 


GCGGTCTTTG 


CGGACAATGC 


CCAAGAAGAA 


CGTGAAAAGC 


9240 


AGTTTGACTA 


CAAGTCACTC 


TTGAAGGTAA 


CCAATATTGA 


TGTCAAGAAC 


CAAGCCATCG 


9300 


ATGTTGAGGT 


TGGCAATATC 


TTTGACGAAG 


ATAAATAAGA 


GATAGAACTA 


AAGGTTGGAA 


9360 


CGATTGTCCC 


AGCCTTTCTT 


TGCAAACAGA 


ATAGAAGGAA 


GCTTATGAAA ACACCATTTA 


9420 


TCAATAGAGA 


AGAGTTAGAA 


GCGATTGTTG 


CCGAGTTCCC 


GACTCCCTTT 


CACTTGTATG 


9480 


ATGAGAAGGG 


GATTCGTGAG 


AAGGCAAGAG 


CCGTCAACCA 


AGCTTTTTCG 


TGGAACAAGG 


9540 


GCTTTAAGGA 


ATATTTTGCA 


GTTAAGGCTA 


CTCCAACTCC 


AGCTATTTTG 


AAAATTCTCC 


9600 


AAGAAGAAGG TTGTGGTGTG GACTGCTCTA GTTATGTAGA GCTTTTGATG AGCCATAAAC 


9660 


TGGACTTTCT 


GGGTTCTGAG 


ATTATGTTCT 


CTTCCAACAA 


CACGCCAGAC 


AAGGAATACG 


9720 


CCTATGCACG 


TGAATTGGGT 


GCGACCATTA 


ACTTGGATGC 


CTTTGAAGAT 


ATTGAACATC 


9780 


TGGAGAGAGT 


AGCAGGCATT 


CCAGAAATCA 


TCTCTTGTCG 


TTATAATCCT 


GGAGGCGTTT 


9840 


TTGAACTGGG 


GACAGACATT 


ATGGACAATC 


CTGGGGAGGC 


TAAGTTTGGC 


ATGACCAAGG 


9900 


ACCAGCTCTT 


TGAAGCCTTT 


GCTATCTTGA 


AGGAAAAAGG 


AGCCAAGACT 


TTTGGGATTC 


9960 


ACTCCTTCCT 


AGCGTCCAAT 


ACCGTGACCC 


ATCTCTATTA 


TCCAGAGTTG 


GCTCGTCAGC 


10020 


TCTTTGAACT 


GGCTGTTGAA 


ATCAAGGAAA 


AGTTGGGCAT 


TTCGCTAGAC 


TTTATCAATC 


10080 


TTTCTGGCGG 


TATTGGTGTT 


AATTATCATC 


CAGACCAGGA 


GCCGAACGAT 


ATCGCCTTGA 


10140 


TTGGTGAGGG 


AGTTCGTAAG 


GTGTATGAAG 


AGGTTCTTAC 


GTCAGCAGGT 


CTTGGTCAGG 


10200 


TCAAGATTTT 


CACCGAATTG 


GGTCGTTTTA 


TGCTGGCACC 


TCACGGTGCT 


CTAGTCACAA 


10260 


GAGTCACTCA 


TAAGAAGGAA 


ACCTACCGTA 


CCTATCTAGG 


TGTGGATGCC 


TCAGCAGTCA 


10320 


ACCTCATGCG 


TCCAGCTATG 


TACGGAGCTT 


ACCATCATAT 


TAGCAACGTG 


ACCCATCCAG 


10380 


ATGGACCAGC 


TGAAGTGGTA 


GATGTGGTCG 


GTTCACTCTG 


TGAAAACAAT 


GATAAATTTG 


10440 


CAGTTAATCG 


CGAACTGCCT 


CATACAGAAA 


TCGGTGATTT 


GCTGGTCATT 


CATGATACAG 


10500 


GTGCCCACGG 


ATTTTCAATG 


GGCTACCAGT 


ATAATGCCAA 


ATTACGTTCT 


GCGGAAATCC 


10560 


TCTATACCGA 


AGAAGGTAAA 


GCCCGTCAAA 


TCCGCCGTGC 


AGAGCGCCCT 


GAGGACTATT 


10620 


TTGCAACCTT 


ATATGGCTTC 


GATTTTGAAG 


AATAATCTGA 


TAATAGATTG 


AAAATGAAAT 


10680 


TGAAAAACAG 


ATTGCTTTCT 


AAAAAATAGG 


CAAAAATCTT 


GTTTTTCCTT 


CAAGTCGTGA 


10740 



